Data Engineering Intern

Role Overview

The Data Engineering Intern at RefinedScience will assist in building and maintaining data pipelines that support advanced analytics in healthcare. This junior role involves collaborating with data scientists and bioinformaticians to integrate and optimize data workflows, ensuring data quality and reliability. The intern will contribute to the design and improvement of data infrastructure, impacting research initiatives significantly.

Perks & Benefits

This remote internship allows flexibility in work hours, accommodating students' schedules. Interns can expect a collaborative team environment, with opportunities for mentorship and skill development in data engineering. The company culture emphasizes learning and innovation, making it a great opportunity for career growth in the tech and healthcare sectors.

⚠️ This job was posted over 5 months ago and may no longer be open. We recommend checking the company's site for the latest status.

Full Job Description

Data Engineering Intern At RefinedScience, our mission is to advance care by bringing together the best science, data and minds – disease by disease, patient by patient, cell by cell to discover pathways to life beyond disease. WHAT WE ARE LOOKING FOR We are seeking a motivated Data Engineering Intern to join our team. This internship is open to undergraduate and graduate students who are interested in building data infrastructure that supports advanced analytics, data science, and AI-driven insights in healthcare and life sciences. You will work closely with data scientists, bioinformaticians, and engineers to help design, build, and improve data pipelines and platforms that power RefinedScience's research and analytics initiatives. KEY ACTIVITIES

Assist in building and maintaining data pipelines for ingesting, transforming, and validating clinical, biological, and real-world data Support integration of data from multiple sources (e.g., clinical data, analytics outputs, external datasets) Help develop and optimize ETL/ELT workflows to ensure data quality and reliability Collaborate with data science and bioinformatics teams to support analytics and machine learning workflows Contribute to data modeling, documentation, and best practices for data infrastructure Participate in code reviews, testing, and performance improvements Participate in Quality Reviews and Troubleshooting Communicate progress and findings to cross-functional teams

MUST HAVES

Currently enrolled in a Bachelor's, Master's, or Ph.D. program in Data Engineering, Computer Science, Data Science, Software Engineering, or a related field Experience with Python and/or SQL through coursework, projects, or internships Basic understanding of data pipelines, databases, and data transformation concepts Familiarity with version control (e.g., Git) Strong analytical thinking and problem-solving skills Ability to learn quickly and work collaboratively in a team envirPlease mention the word **LOGICAL** and tag RODguMTk4Ljk5LjE0Mw== when applying to show you read the job post completely (#RODguMTk4Ljk5LjE0Mw==). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.

Apply on original site

Similar jobs

Found 6 similar jobs

Software Developer Intern

RefinedScience • Remote

Engineering Manager, Enterprise Applied AI

Member of Technical Staff (Secure Intelligence Institute)

Perplexity • Remote

Member of Technical Staff (AI Software Engineer, Agents)

Perplexity • Remote

RefinedScience

refinedscience.com

RefinedScience develops advanced scientific software and data analysis tools that help researchers and organizations process complex datasets. Their typical customers include academic institutions, pharmaceutical companies, and research laboratories that require sophisticated computational solutions. The company's main products include specialized data visualization platforms, machine learning algorithms for scientific discovery, and workflow automation tools for research teams. As a remote-friendly organization, RefinedScience emphasizes distributed collaboration with team members working across different time zones while maintaining strong communication through digital platforms.

Industry

Technology/Scientific Software

Fully remote or hybrid with strong remote support

2 open positions