Menu
Turing

SwarmBench Task Engineer — Reasoning / Math

Turing
full time remote mid

Required Skills

PythonPhD MathsGrad Maths

Job Description

Build challenging multi-agent tasks requiring advanced mathematical reasoning.

About Turing:

Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers in two ways: first, by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialize in coding, reasoning, STEM, multilinguality, multimodality, and agents; and second, by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L


Role Overview:

We are seeking a highly analytical and computationally proficient individual to join our team with a strong research background. You will be instrumental in contributing to this role by either crafting challenging and insightful problems in your respective research domain, devising elegant computational solutions.


Responsibilities:

  • Build multi-agent benchmark tasks that require multi-step mathematical reasoning, proof construction, or algorithmic problem-solving
  • Design problems that are genuinely hard for a single agent but decomposable — competition math, numerical analysis, combinatorial optimization, statistical inference
  • Create verification scripts that check mathematical correctness — numerical answers with appropriate tolerance, proof step validity, algorithm output correctness
  • Write clear problem statements with precise notation, definitions, and output format
  • Create decomposition guides that split problems into independent sub-computations or parallel solution strategies

Required Qualifications:

  • 5+ years of experience in mathematics, quantitative research, or computational science (e.g., competition math, university-level mathematics, or quantitative research)
  • Strong Python programming skills, including NumPy, SciPy, or symbolic computation (SymPy)
  • Experience writing mathematical proofs or formal derivations
  • Ability to create problems with precise, verifiable answers (not subjective or open-ended)
  • Familiarity with AI coding benchmarks such as SWE-bench and Terminal-bench
  • Comfortable with Docker (writing Dockerfiles, building images, debugging containers)
  • Understanding of numerical methods, including floating-point tolerance, convergence criteria, and error bounds

Nice to have:

  • Experience creating math competition problems (e.g., AMC, AIME, Putnam, IMO, or similar)
  • Research experience in mathematics, theoretical computer science, or quantitative fields
  • Experience with automated theorem proving or formal verification
  • Knowledge of AI reasoning benchmarks (e.g., GSM8K, MATH, AIME, GPQA, ARC-AGI)
  • Experience with large-scale numerical computation or scientific computing

Perks of Freelancing With Turing:

  • Work in a fully remote environment.
  • Opportunity to work on cutting-edge AI projects with leading LLM companies.
  • Potential for contract extension based on performance and project needs.

Offer Details:

  • Commitments Required : 40 hours /week with 4 hours of PST Overlap
  • Engagement type : Contractor assignment/freelancer (no medical/paid leave)
  • Duration of contract : 1 month; [expected start date is next week]
  • Location: Bangladesh, Brazil, Colombia, Egypt, Ghana, India, Indonesia, Kenya, Nigeria, Turkey, Vietnam