Job Responsibilities
1. Participate in the development of Agent, a large-scale model for aerial robots, creating advanced autonomous agents with capabilities in image and video understanding, task decision-making, intelligent search, and tool invocation for aerial robotics scenarios;
2. Contribute to the research, development, and optimization of foundational large-scale models, building multimodal foundational models tailored for embodied systems, particularly in the field of aerial robotics;
3. Resolve complex challenges encountered during development and collaborate with team members to drive project progress.
Job Requirements
1. Bachelor's degree or higher in Computer Science, Software Engineering, Artificial Intelligence, Robotics, or related fields;
2. Proficiency in Python/C++ or similar development languages, with strong programming skills and algorithmic foundations;
3. In-depth understanding of the technical principles behind mainstream large language models (e.g., GPT, Qwen, LLaMA, GLM series) and multimodal models (e.g., Qwen-VL, InternVL). Preference given to candidates with application development and model fine-tuning experience;
4. Demonstrate proactive learning mindset, strong hands-on capabilities, and dedication to research, along with excellent communication and teamwork skills;
5. Preferred qualifications:
a. Experience in data analysis, machine learning, or related fields; preference given to winners of renowned algorithmic competitions like ACM Programming Contest or Kaggle;
b. Familiarity with cloud-native technologies and tools like Kubernetes and Docker;
c. Knowledge of large robotics models (e.g., ACT, OpenVLA, pi0, RT2) and understanding of cutting-edge developments in vision-language-navigation models;
d. Prioritized if you have open-source projects or technical blogs in relevant fields, or published papers in AI or robotics.