Data Engineer

2 Weeks ago • 3 Years + • Data Analyst

About the job

Summary

Varonis seeks an experienced Data Engineer to design and build scalable data pipelines for AI, ML, SLM, LLM, and advanced analytics. Responsibilities include designing ETL/ELT pipelines using Databricks and Azure Data Factory, optimizing for performance and cost. The role involves collaboration with data scientists and engineers, implementing data solutions for model training and deployment, and working with a cybersecurity research team to leverage advanced data analytics. Prompt engineering for an LLM-based labeling platform is also key, requiring optimization for high-quality responses. Experience with cloud-based data solutions, Python (PySpark preferred), and big data technologies is essential. The role demands strong analytical and problem-solving skills and excellent communication within a multidisciplinary team.
Must have:
  • 3+ years data engineering experience
  • Cloud-based data solutions expertise
  • Proficiency in Python, PySpark a plus
  • Databricks & Azure Data Factory experience
  • Prompt engineering experience
  • Strong analytical & problem-solving skills
Good to have:
  • Experience with vector databases
  • Experience with embedding techniques
  • Experience with MLOps
Perks:
  • Hybrid work model
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

Description

Summary 
Data has never been more valuable and vulnerable. As cybercriminals become more sophisticated and regulations more strict, organizations struggle to answer one key question: “Is my data safe?
At Varonis, we see the world of cybersecurity differently. Instead of chasing threats, we believe the most practical approach is protecting data from the inside out. We’ve built the industry’s first fully autonomous Data Security Platform to help our customers dramatically reduce risk with minimal human effort.
At Varonis, we move fast. We’re an ultra-collaborative company with brilliant people who care deeply about the details. Together, we’re solving interesting and complex puzzles to keep the world’s data safe.
We work in a flexible, hybrid model, so you can choose the home-office balance that works best for you. 
 
We are looking for an experienced Data Engineer with expertise in modern data architectures, pipeline engineering, and cloud-based data ecosystems. In this role, you will design and build efficient, scalable data pipelines that enable robust data integration for AI, ML, SLM, LLM, and advanced analytics initiatives. You will also work closely with our data scientists and engineers to ensure high-quality, high-performance data flow and contribute to our data-driven culture.
Responsibilities: 
  • Design, build, and maintain scalable ETL/ELT pipelines to integrate data from diverse sources, optimizing for performance and cost efficiency.
  • Leverage Databricks and other modern data platforms to manage, transform, and process data for ML and AI models, supporting both real-time and batch processing workflows.
  • Work with cross-functional teams to implement data solutions that support model training, monitoring, and production deployment.
  • Collaborate with a cybersecurity research team to understand emerging threats and develop solutions that leverage advanced data analytics.
  • Design and develop innovative prompts and instruction sets to enhance our autonomous LLM-based labeling platform.
  • Optimize prompts to generate high-quality, coherent, and contextually relevant responses.
  • Collaborate with software and data engineers to integrate ML/LLM techniques into production systems.
 
Requirements:  
  • 3+ years of experience in data engineering, including cloud-based data solutions
  • Proven expertise in implementing large-scale data solutions.
  • Proficiency in Python. PySpark is a plus.
  • Experience with cloud and big data technologies such as Databricks and Azure Data factory.
  • Experience with prompt engineering techniques.
  • Experience with vector databases and embedding techniques is a plus.
  • Experience with MLOps is a plus.
  • Strong analytical and problem-solving skills, with the ability to evaluate and interpret complex data.
  • Excellent communication and collaboration skills, with the ability to work effectively in a multidisciplinary team.
  • Proven track record of delivering high-quality results in a fast-paced and dynamic environment.

 

We invite you to check out our Instagram Page to gain further insight into the Varonis culture!
@VaronisLife 
Varonis is an equal opportunity employer. We evaluate qualified applicants without regard to race, color, religion, sex, national origin, disability, veteran status, and other legally protected characteristics.
#LI-Hybrid

View Full Job Description

About The Company

Oregon, United States (On-Site)

North Carolina, United States (On-Site)

District Of Columbia, United States (On-Site)

Texas, United States (On-Site)

Tel Aviv District, Israel (Hybrid)

United States (Remote)

Tel Aviv District, Israel (Hybrid)

Tel Aviv District, Israel (Hybrid)

Tel Aviv District, Israel (Hybrid)

Tel Aviv District, Israel (Hybrid)

View All Jobs

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug