MLOps Engineer

This listing is synced directly from the company ATS.

Role Overview

This MLOps Engineer role involves developing and supporting ML pipelines for large-scale data generation, preprocessing, and ETL workflows, working collaboratively with AI and MLOps teams to transform research into reproducible datasets. It appears to be a junior to mid-level position, focusing on building and running batch data jobs, validating datasets, and debugging pipeline issues to scale the models underlying the company's core product. The hire will directly contribute to rapidly scaling ML pipelines, ensuring data quality and supporting the company's deepfake detection platform.

Perks & Benefits

The role is fully remote with no explicit time zone restrictions, offering comprehensive benefits including 100% premium coverage for healthcare, dental, vision, disability, and life insurance for employees, plus partial coverage for dependents. Additional perks include equity compensation, 20 days of PTO, 12 weeks of parental leave, a learning and development budget, monthly wellness benefits, and an annual company offsite, fostering a supportive and growth-oriented culture. For NYC-based employees, there are in-office benefits like daily lunch and remote Fridays, but the remote setup implies flexibility and a focus on work-life balance.

Full Job Description

Who we are.

Reality Defender is an award-winning cybersecurity company helping enterprises and governments detect deepfakes and AI-generated media. Utilizing a patented multi-model approach, Reality Defender is robust against the bleeding edge of generative platforms producing video, audio, imagery, and text media. Reality Defender's API-first deepfake detection platform empowers teams and developers alike to identify fraud, disinformation campaigns, and harmful deepfakes in real time.

Backed by world class investors including DCVC, Illuminate Financial, Y Combinator, Booz Allen Hamilton, IBM, Accenture, Rackhouse, and Argon VC, Reality Defender works with leading enterprise clients, financial institutions, and governments in order to ensure AI-generated media is not used for malicious purposes.

Youtube: Reality Defender Wins RSA Most Innovative Startup

The ML OPs Engineer Role.

We are hiring a MLOps engineer to assist in development and support of our ML pipelines including large scale data generation, preprocessing, and ETL workflows. In this role, you will work collaboratively with both our AI and MLOps teams to transform research questions into reproducible, well-structured datasets using our existing MLOps and data platform infrastructure. You will have the opportunity to directly contribute to rapidly scaling ML pipelines that support the models underlying our core product.

What you will do.

  • Work with ML engineers and researchers to ensure delivered datasets are usable and correctly scoped

  • Build and run batch data generation and preprocessing jobs for image, video, and audio data

  • Execute preprocessing pipelines using shared batch orchestration tools

  • Design and run ETL jobs to ingest, transform, and organize data in our warehouse.

  • Validate input and output datasets (schema, metadata, basic quality checks)

  • Collect, organize, and deliver processed datasets using established conventions

  • Support creation of development and prototype datasets ahead of large-scale backfills

  • Maintain version control of data processing repositories following industry best practices

  • Debug data pipeline failures, ETL issues, and data quality problems

Required qualifications.

  • Bachelor's degree in computer science, machine learning, or a related field

  • 1+ year of experience working with large datasets in a production environment or academic setting

  • Strong command of Python fundamentals and data wrangling (pandas, scikit-learn, matplotlib)

  • Basic experience with batch data pipelines and ETL workflows

  • Familiarity with cloud object storage (AWS S3 or equivalent) and structured data organization

  • Basic understanding of structured data organization and common associated issues

  • Ability to follow structured workflows and deliver reproducible results

  • Attention to detail and strong ownership of data quality

  • Basic experience working with cloud services

Nice to have.

  • Master’s degree in computer science, machine learning, or a related field

  • Exposure to vision or audio data processing techniques

  • Experience with data lake technologies or distributed processing systems

  • Familiarity with Docker or containerized batch jobs

  • Understanding of dataset versioning and development vs training data separation

  • Experience with ML-related data pipelines or training workflows

  • Experience with our python data processing tech stack is ideal but not required

    • uv for project and dependency management

    • polars for dataframe workflows

    • DynamoDB and PostgreSQL for live data management

    • pydantic and pyright for data typing

What we offer.

Reality Defender offers the following benefits to all our employees, regardless of location:

  • Healthcare plans with 100% premium coverage for employees and partial coverage available for dependents

  • Dental and Vision plans with 100% premium coverage for employees and their dependents

  • Short/Long-term disability and life insurance plans with 100% premium coverage for employees

  • FSA/HSA and 401k programs

  • Equity compensation

  • 20 days of PTO per year

  • 12 weeks of Parental Leave

  • Learning and Development budget

  • Monthly wellness benefits

  • Annual company-sponsored offsite

For employees working from Reality Defender’s HQ in NYC, we offer the following benefits:

  • Daily in-office lunch through UberEats

  • Commuter benefits

  • Remote Fridays

  • Happy Hours and other local events

Similar jobs

Found 6 similar jobs