We’re building a gamified developer platform where tens of thousands of engineers create high‑fidelity datasets that push LLM frontiers. This role owns the technical lifecycle of data pipelines—from defining new data formats with partner labs to shipping the tooling, environments, docs, and QA that make those formats real at scale.
Own projects end-to-end , from initial prototyping to ongoing maintenance, bug fixing, and iteration based on feedback.
Own developer experience pipelines end‑to‑end: Prototype tooling for collecting new data formats → productionize workflow → iterate from developer experience
Champion DX: Create clear, concise guidelines and documentation to empower our data contributors and ensure high-quality inputs for your projects.
Quality & governance: Develop and manage the quality standards for your projects, which includes training and aligning content reviewers to ensure data consistency and accuracy. Implement automated checks, eval harnesses, reviewer workflows, and data quality bars; be hands on and in the weeds to align with reviewers on standards.
Maintain & iterate: Monitor, debug, and continuously improve reliability, latency, and contributor success rates.
Define Frontier data formats: Co‑author specs/RFCs with frontier lab researchers; design schemas, metadata, and versioning for new task/trajectory formats.
Build developer tooling & environments: Ship tooling, sandboxes, CLIs/SDKs, and capture/instrumentation to make contribution flows fast and safe.
Excellent written communication skills, with a proven ability to explain complex concepts to a less technical audience.
An organized and process-oriented mindset – you enjoy bringing structure to ambiguous problems and are meticulous about quality.
Foundational full-stack skills, with experience in React and at least one modern backend language (e.g., Python, Node.js, Go).
Strong technical judgment and a pragmatic mindset – you know how to balance speed with quality, recognizing the need for a scrappy solution versus when to invest in a robust architecture.
A deep resourcefulness with AI – you are highly adept at prompt engineering and using AI tools to find the fastest path to a solution.
Curiosity, pride in your work, desire to push the frontiers
Experience designing or running evaluations for LLM outputs to measure and track quality, accuracy, or other performance metrics.
Familiarity with building tools for other developers, such as CLIs, SDKs, or internal dashboards.
Experience with cloud infrastructure (AWS), Docker, and CI/CD pipelines
Join to apply for the 1099 Process Server role at Proof2 days ago Be among the first 25 applicants... ...Employment type Employment type Full-timeJob function Job function Legal... ...roles. Remote Data Entry Clerk - Typing - Part Time Entry Birmingham, AL $30,000.00-$36...
Adecco is assisting a local client recruiting for French Translator opportunities remote nationwide. This is an excellent opportunity to join a winning culture and get your foot in the door with a company that takes pride in their employees and their work. If French Translator...
...mission to automate sales, marketing, and customer success for B2B companies. We build the most... ...We are looking for a Customer Success Manager who is passionate about helping... ...position is uniquely flexible-offering remote options for exceptional candidates based...
...715pm, 36 hours/week Shift: Days Location: 9100 W 74 th Street, Merriam, KS The role you'll contribute: The Patient Safety Attendant (PSA) participates in high-quality, patient-centered care by providing continuous observation and monitoring for high risk...
...on 12 hour shifts and 35 hours per week (subject to confirmation) with tax-free stipend amount to be determined. Posted job title: COTA About Core Medical Group CoreMedical Group is one of the largest healthcare staffing agencies in the country. We have jobs nationwide...