Data Engineer II
Insight Global
Job Description
Insight Global is seeking a Data Scientist for a remote direct placement opportunity with an employee-focused digital learning company. The Data Engineer is expected to design, build, and optimize ETL pipelines that power analytics, data science, and machine learning workflows using tools such as Databricks, PySpark, and Airflow. This role involves developing and maintaining labeling and retraining pipelines for ML models to ensure quality, reproducibility, and observability, while implementing MLOps practices like model versioning, CI/CD for ML, and production monitoring. You will collaborate closely with data scientists to productionize and scale model training, inference, and evaluation pipelines, and contribute to the evolution of our data lakehouse through schema design, partitioning strategies, and performance optimization. The position requires documenting and communicating data architecture, lineage, and dependencies for transparency across teams, championing data quality and governance, and leveraging infrastructure-as-code and containerization for reproducible environments. Additionally, you will participate in code reviews and drive continuous improvement of engineering best practices within the team. You’ll collaborate closely with Data Science, Business Intelligence, and other teams to enable the next generation of data-driven products and AI capabilities. This is a full-time, permanent opportunity with the best data science team in Ed Tech.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Skills and Requirements
Bachelor’s degree in Computer Science, Engineering, or related field.
2–4 years of experience building and operating large-scale data systems.
Proficiency in Python and SQL, with experience in PySpark, pandas, or similar data processing frameworks.
Experience with DBT and modern data warehousing and lakehouse platforms, preferably Databricks.
Hands-on experience with workflow orchestration tools such as Airflow, Dagster, or Prefect.
Experience with AWS data and compute services (S3, Lambda, ECS, CloudWatch, etc.) or equivalent cloud platforms.
Familiarity with MLOps concepts (e.g., feature stores, model registries, CI/CD for ML).
Experience using Infrastructure as Code Terraform Experience
Experience with supporting analytics and ML workloads
Experience in the education space
Confirmar seu email: Enviar Email
Todos os Empregos de Insight Global