Software Engineer – AI Tools & Evals

Job Details

Sector: AI Research, AI/ML, Software

Type: Permanent

Working Arrangement: On-site

Location: New York City,

Salary: $170K/yr - $190K/yr

Share this job

Our client is hiring a Full-Stack Engineer to build the internal tools that power a fast-moving AI research team working on solutions to prevent AI misuse through advanced AI detection models.

The company is developing a transformer-based text classification model and runs continuous experiments to improve performance across different data slices, languages, and domains. Your role will be to design and ship the web apps, dashboards, and data tooling that make this research possible.

What you’ll build:

Lightweight UIs to browse, filter, and inspect labeled datasets
Dashboards and visualizations to surface model failures and eval results
Data pipelines to collect, transform, and prepare experiment datasets
Internal tools that help researchers iterate quickly on training and evaluation

What we’re looking for:

Strong full-stack engineering skills (frontend + backend)
Experience building internal tools for ML or research teams
Familiarity with model evaluation workflows and performance tracking
Comfort working with structured/unstructured data pipelines
Ability to ship quickly in a fast-paced AI environment

Nice to have:

Experience in data annotation or model training ecosystems
Background working closely with ML researchers
Portfolio or side projects demonstrating independent shipping

This is a high-ownership role embedded directly with research — ideal for someone who enjoys building practical, high-leverage tools that accelerate model development.