Junior Data Engineer - Remote Job, 1+ Year Experience

3 Days ago • 1-2 Years • Data Analyst

About the job

Job Description

This remote Junior Data Engineer role at Patterned Learning focuses on building and scaling machine learning and AI solutions for a CVS Health AIOps platform. Key responsibilities include data pipeline development (design, implementation, and management), data modeling for scalability and efficiency, data integration from diverse sources (structured and unstructured), utilizing big data technologies (e.g., Kafka), implementing data security measures, and performance optimization. The ideal candidate will possess strong programming skills (Python, Java, SQL), ETL tool experience, and database management expertise (relational and non-relational). Experience with data modeling techniques, data quality assessment, and data cleansing is essential. The role demands a team player with excellent communication and problem-solving skills.
Must have:
  • 2+ years programming (Python, Java, SQL)
  • 2+ years ETL tools & database management
  • 2+ years data modeling experience
  • Data pipeline development & optimization
  • Data integration & security implementation
Good to have:
  • Big data technologies (PySpark, Databricks, Azure Synapse)
  • Cloud platform experience

This is a remote position.

Junior Data Engineer  - Remote Job, 1+ Year Experience


Annual Income: $63K - $77K


A valid work permit is necessary in the US


About us: Patterned Learning is a platform that aims to help developers code faster and more efficiently. It offers features such as collaborative coding, real-time multiplayer editing, and the ability to build, test, and deploy directly from the browser. The platform also provides tightly integrated code generation, editing, and output capabilities.




Position Summary

Join the fast-paced, innovative, and collaborative environment focused on providing an AIOps platform that enhances the intelligence of the CVS Health infrastructure. Work closely with subject matter experts and colleagues to build and scale out machine learning and AI solutions that will detect, predict, and recommend solutions to correct issues before system impact and enhance the efficiency, reliability, and performance of CVS Health’s IT operations. 

Key Responsibilities include:

  • Data pipeline development: Designed, implemented, and managed data pipelines for extracting, transforming, and loading data from various sources into data lakes for processing, analytics, and correlation.

  • Data modeling: Create and maintain data models ensuring data quality, scalability, and efficiency

  • Develop and automate processes to clean, transform, and prepare data for analytics, ensuring data accuracy and consistency

  • Data Integration: Integrate data from disparate sources, both structured and unstructured to provide a unified view of key infrastructure platform and application data

  • Utilize big data technologies such as Kafka to process and analyze large volumes of data efficiently

  • Implement data security measures to protect sensitive information and ensure compliance with data and privacy regulation

  • Create/maintain documentation for data processes, data flows, and system configurations

  • Performance Optimization- Monitor and optimize data pipelines and systems for performance, scalability and cost-effectiveness

Characteristics of this role:

  • Team Player: Willing to teach, share knowledge, and work with others to make the team successful.

  • Communication: Exceptional verbal, written, organizational, presentation, and communication skills.

  • Creativity: Ability to take written and verbal requirements and come up with other innovative ideas.

  • Attention to detail: Systematically and accurately research future solutions and current problems.

  • Strong work ethic: The innate drive to do work extremely well.

  • Passion: A drive to deliver better products and services than expected to customers.


Required Qualifications

  • 2+ years of programming experience in languages such as Python, Java, SQL

  • 2+ years of experience with ETL tools and database management (relational, non-relational)

  • 2+ years of experience in data modeling techniques and tools to design efficient scalable data structures

  • Skills in data quality assessment, data cleansing, and data validation


Preferred Qualifications

  • Knowledge of big data technologies and cloud platforms

  • Experience with technologies like PySpark, Databricks, and Azure Synapse.


Education

Bachelor’s degree in Computer Science, Information Technology, or related field, or equivalent working experience


Why Patterned Learning LLC?


Patterned Learning can provide intelligent suggestions, automate repetitive tasks, and assist developers in writing code more effectively. This can help reduce coding errors, improve productivity, and accelerate the development process.


The pattern recognition is particularly relevant in the context of coding. Neural networks, especially deep learning models, are commonly employed for pattern detection and classification tasks. These models simulate human decision-making and can identify patterns in data, making them well-suited for tasks like code analysis and generation.




View Full Job Description
$63.0K - $77.0K/yr (Outscal est.)
$70.0K/yr avg.
Worldwide

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

Similar Jobs

ION - Lead Software Engineer, Italy

ION, Italy (On-Site)

Microsoft - Customer Engineer

Microsoft, United States (On-Site)

Microsoft - Principal Software Engineering Manager

Microsoft, Australia (On-Site)

Nolimit City - Senior Software Engineer

Nolimit City, Sweden (Hybrid)

Dream Sports - Senior ML Scientist

Dream Sports, India (On-Site)

Sabre India - Sr Data Scientist

Sabre India, India (On-Site)

EXUSIA - Informatica - Sr. Data Engineer

EXUSIA, India (Remote)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

HP - Senior Data Engineer

HP, Spain (On-Site)

Wargaming - Game Data Analyst  (World of Warships)

Wargaming, Serbia (Hybrid)

AGCO Corporation - Data Scientist II

AGCO Corporation, India (On-Site)

Social Discovery Group - Senior Web Analyst

Social Discovery Group, Serbia (Remote)

N-iX - Senior Python Data Engineer (#2299)

N-iX, Colombia (On-Site)

Rank group - BI (Insights) Manager

Rank group, Mauritius (On-Site)

Luxoft - Senior PySpark Data Engineer

Luxoft, United States (Remote)

Get notifed when new similar jobs are uploaded