Note: The job is a remote job and is open to candidates in USA. Creospan Inc. is a growing tech collective that offers innovative solutions to propel businesses into a better tomorrow. They are seeking a Software Engineer III who will design data engines and evaluation frameworks to support AI research and integrate breakthrough technologies into widely-used products.
Responsibilities
- Design and develop multimodal datasets for training and evaluating AI models across image, video, audio, and text modalities
- Build and execute model evaluation frameworks, benchmarks, and performance metrics
- Fine-tune machine learning models using supervised and preference-based training techniques
- Utilize multimodal LLMs to generate, augment, balance, and improve training datasets
- Develop internal annotation tools and user interfaces using React and TypeScript
- Build and maintain scalable data ingestion and processing pipelines
- Implement reliable large-scale data movement, batching, deduplication, retry logic, and parallel processing
- Collaborate closely with AI researchers, scientists, and engineers to advance state-of-the-art AI systems
Skills
- 5+ years of professional software engineering experience
- Strong understanding of machine learning fundamentals, including:
- Model fine-tuning (SFT, preference tuning)
- Prompt engineering
- Model evaluation methodologies
- Understanding of model failure modes
- Experience building multimodal datasets involving image, video, audio, and text data
- Strong Python programming skills
- Hands-on experience with:
- PyTorch
- Hugging Face ecosystem
- Model training and inference workflows
- Experience building production-grade web applications using:
- React
- TypeScript
- Strong SQL skills and experience developing large-scale data pipelines
- Experience designing reliable data processing systems including batching, deduplication, retries, and parallel execution
- Master's or PhD in Computer Science, Artificial Intelligence, Machine Learning, or related field
- Experience working on multimodal large language models (MLLMs)
- Research publications or significant contributions to AI research projects
- Open-source contributions to AI or machine learning projects
- Experience working in mid-sized technology companies or large-scale technology organizations
Company Overview
Creospan is an information technology company that offers IT consulting and networking engineering services It was founded in 1999, and is headquartered in Schaumburg, Illinois, USA, with a workforce of 201-500 employees. Its website is http://creospan.com.