LLM Data Engineer-Mandarin Speaker

Remote: 
Full Remote
Contract: 
Work from: 

Offer summary

Qualifications:

Proficient in spoken English and Chinese., Bachelor's degree or above in Computer Science, Artificial Intelligence, Data Science, or related fields., Proficiency in Python programming language., Familiarity with data annotation processes and quality control methods..

Key responsibilities:

  • Build and optimize data processing pipelines for large language models.
  • Manage and process multi-source data while ensuring data quality.
  • Collaborate closely with algorithm engineers to define data requirements.
  • Drive data-driven model optimization.

MyShell.ai logo
MyShell.ai Startup https://myshell.ai/
11 - 50 Employees
See all jobs

Job description

About MyShell

MyShell is revolutionizing the AI landscape by building an open ecosystem for AI-native apps. Our powerful platform and intuitive toolkit empower anyone to create, access, and benefit from AI-powered applications. Launched in April 2023, MyShell has quickly gained global traction, attracting a diverse community of creators and users.

Our team of talented individuals from top institutions like MIT, Princeton, and Oxford is committed to fostering innovation in a supportive and transparent work environment. With funding from leading VCs, MyShell is poised to reshape the future of AI, making it accessible and integral to everyone's daily life. Join us on this thrilling journey as we redefine what's possible with AI.

 

 

Responsibilities:
  1. Build and optimize data processing pipelines for large language models, including data collection, cleaning, annotation, augmentation, and format conversion.
  2. Manage and process multi-source data while ensuring data quality.
  3. Collaborate closely with algorithm engineers to define data requirements and drive data-driven model optimization.
Qualifications:
  1. Proficient in spoken English & Chinese.
  2. Open to both fresh graduates and experienced professionals; Bachelor's degree or above in Computer Science, Artificial Intelligence, Data Science, or related fields.
  3. Proficiency in Python.
  4. Familiarity with data annotation processes and quality control methods. Candidates with hands-on experience in building evaluation and test datasets will be preferred.

What We Offer

  • Competitive salary and equity package, commensurate with experience and location.
  • Flexible working hours and a fully remote work environment, with the ability to collaborate effectively across time zones.
  • A dynamic and collaborative work environment that fosters innovation, growth, and professional development.
  • The opportunity to work on cutting-edge technologies and help shape the future of AI, transforming industries and making a global impact.

Required profile

Experience

Spoken language(s):
English
Check out the description to know which languages are mandatory.

Other Skills

  • Collaboration
  • Communication

Data Engineer Related jobs