About The Team
This position is responsible for researching and building the company's LLMs. The role involves exploring new applications and solutions for related technologies in areas such as search, recommendation, advertising, content creation, and customer service.
The goal is to meet the increasing demand for intelligent interactions from users and to significantly enhance their lifestyle and communication in the future.
The primary job responsibilities include : 1. Explore effective ways of evaluating the abilities of the model at various stages.
Open the black box of LLMs, understand the source of various abilities of LLMs, and provide guidance for the model iteration.
2. Synthesize large-scale, high-quality data through methods such as rewriting, augmentation, and generation, to improve the abilities of the pre-training model in various dimensions.
3. Explore effective training methods (such as active learning, curriculum learning, and effective training objectives for LLMs, and optimize the scaling laws.
1. Excellent coding ability, familiar with data structures, and fundamental algorithm skills, proficient in C / C++ or Python, winners of competitions such as ACM / ICPC, NOI / IOI, Top Coder, Kaggle, etc.
are preferred;2. Experience with deep learning, NLP, CV and familiarity with large-scale model training are preferred;3.
Candidates who have led influential projects or papers in NLP, deep learning are preferred;4. Excellent problem analysis and solving skills, able to deeply solve problems in large-scale model training and application;
5. Good communication and collaboration skills, able to explore new technologies with the team and promote technological progress.