Location: Remote, Onsite in San Francisco or Australia/New Zealand   |   Full-Time
Python Machine Learning Data Analysis AI Benchmarking Model Evaluation Statistical Analysis Data Science

Artificial Analysis is an independent AI benchmarking and insights provider. We help engineers and companies understand AI by providing comprehensive benchmarks and evaluations. We are seeking an ML Engineer to design and execute complex benchmarking methodologies across various AI models and technologies.

Key Responsibilities:

  • Develop and maintain automated testing frameworks for AI model evaluations
  • Design experiments to measure model performance across different dimensions
  • Process and analyze large datasets to identify trends and patterns
  • Optimize benchmarking pipelines for efficiency and accuracy
  • Collaborate with engineering teams to integrate benchmarking capabilities

Required Skills:

  • Strong proficiency in Python for data manipulation and analysis
  • Experience with machine learning frameworks and model evaluation techniques
  • Knowledge of statistical analysis and data visualization
  • Familiarity with CI/CD pipelines for reproducible research
  • Understanding of various AI model architectures

We offer competitive compensation, equity options, and the opportunity to work with state-of-the-art AI technologies in a collaborative environment. This role requires strong analytical skills and a passion for advancing AI transparency through rigorous benchmarking.

Post Date: July 14, 2025