Offer summary
Qualifications:
Bachelor's degree in Computer Science or related field, 7 years of experience in distributed computing systems, 5 years of experience in developing AI and ML algorithms, 3 years of involvement in machine learning development lifecycle, Proficiency in Python or C/C++.
Key responsabilities:
- Architect resilient systems for long-duration training tasks
- Build infrastructure for efficient model deployment
- Implement training clusters optimizing storage and network systems
- Develop benchmarks to assess AI system performance
- Design applications incorporating large language and foundation models