Software Engineer – AI Tools & Evals

Job Details

Working Arrangement: On-site

Location: New York City,

Salary: $170K/yr - $190K/yr

Share this job

Our client is hiring a Full-Stack Engineer to build the internal tools that power a fast-moving AI research team working on solutions to prevent AI misuse through advanced AI detection models.

The company is developing a transformer-based text classification model and runs continuous experiments to improve performance across different data slices, languages, and domains. Your role will be to design and ship the web apps, dashboards, and data tooling that make this research possible.

 

What you’ll build:

  • Lightweight UIs to browse, filter, and inspect labeled datasets
  • Dashboards and visualizations to surface model failures and eval results
  • Data pipelines to collect, transform, and prepare experiment datasets
  • Internal tools that help researchers iterate quickly on training and evaluation

What we’re looking for:

  • Strong full-stack engineering skills (frontend + backend)
  • Experience building internal tools for ML or research teams
  • Familiarity with model evaluation workflows and performance tracking
  • Comfort working with structured/unstructured data pipelines
  • Ability to ship quickly in a fast-paced AI environment

Nice to have:

  • Experience in data annotation or model training ecosystems
  • Background working closely with ML researchers
  • Portfolio or side projects demonstrating independent shipping

This is a high-ownership role embedded directly with research — ideal for someone who enjoys building practical, high-leverage tools that accelerate model development.

This field is for validation purposes and should be left unchanged.
Accepted file types: doc, docx, pdf, Max. file size: 2 MB.
Scroll to Top