Staff Site Reliability Engineer - Cloud Engineering

1 Month ago • 5-6 Years • DevOps • $124,700 PA - $180,650 PA

Job Summary

Job Description

Visa seeks a Staff Site Reliability Engineer to support their Data Platform and cloud-based Big Data and Kafka platforms. Responsibilities include designing, building, and managing Big Data and Kafka infrastructure on AWS, GCP, and Azure; optimizing clusters for performance and scalability; developing monitoring tools; collaborating with other teams on solutions; ensuring platform security and compliance; conducting root cause analysis; planning capacity expansions and upgrades; automating tasks; tuning alerting and observability; and creating standard operating procedures. The role involves working with DevOps tools and collaborating with Level 3 teams to improve platform robustness and reliability. The ideal candidate will have strong experience with cloud platforms, Big Data tools (like Spark and Kafka), scripting languages (Python, Bash), and a passion for building scalable and reliable systems.
Must have:
  • 5+ years experience
  • AWS/GCP experience
  • Big Data & Kafka expertise
  • Scripting (Python, Bash)
  • System architecture knowledge
  • Problem-solving skills
Good to have:
  • Docker, Kubernetes, Ansible, Terraform
  • Observability tools (Grafana, Splunk)
  • SQL proficiency
  • Java/Python programming
Perks:
  • Comprehensive benefits package
  • Bonus and equity eligibility

Job Details

Company Description

Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose – to uplift everyone, everywhere by being the best way to pay and be paid.

Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa.

Job Description

Visa’s Technology Organization is a community of problem solvers and innovators reshaping the future of commerce. We operate the world’s most sophisticated processing networks capable of handling more than 65k secure transactions a second across 80M merchants, 15k Financial Institutions, and billions of everyday people. While working with us you’ll get to work on complex distributed systems and solve massive scale problems centered on new payment flows, business and data solutions, cyber security, and B2C platforms.

 

The Opportunity:

As a Staff Site Reliability Engineer in Product Reliability Engineering, you will be part of a team that maintains and supports Visa's Data Platform and provides support for key cloud based Big data and Kafka Platforms. You will be responsible for driving innovation for our partners and clients, within Visa and globally. You will work on open-source Big Data and Kafka clusters focusing on Cloud, ensuring their availability, performance, reliability, and improving operational efficiency.

 

The Work itself:

Essential Functions:

· Design, build and manage Big Data and Kafka infrastructure on AWS, GCP and Azure.

· Manage and optimize Apache Big Data and Kafka clusters for high performance, reliability, and scalability.

· Develop tools and processes to monitor and analyze system performance and to identify potential issues.

· Collaborate with other teams to design and implement Solutions to improve reliability and efficiency of the Big data cloud platforms.

· Ensure security and compliance of the platforms within organizational guidelines.

· Other responsibilities include effective root cause analysis of major production incidents and the development of learning documentation. The person will identify and implement high-availability solutions for services with a single point of failure.

· The role involves planning and performing capacity expansions and upgrades in a timely manner to avoid any scaling issues and bugs. This includes automating repetitive tasks to reduce manual effort and prevent human errors.

· The successful candidate will tune alerting and set up observability to proactively identify issues and performance problems. They will also work closely with Level 3 teams in reviewing new use cases and cluster hardening techniques to build robust and reliable platforms.

· The role involves creating standard operating procedure documents and guidelines on effectively managing and utilizing the platforms. The person will leverage DevOps tools, disciplines (Incident, problem, and change management), and standards in day-to-day operations.

· The individual will ensure that the platforms can effectively meet performance and service level agreement requirements. They will also perform security remediation, automation, and self-healing as per the requirement.

· The individual will concentrate on developing automations and reports to minimize manual effort. This can be achieved through various automation tools such as Shell scripting, Ansible, or Python scripting, or by using any other programming language.

 

The Skills You Bring:

· Energy and Experience: A growth mindset that is curious and passionate about technologies and enjoys challenging projects on a global scale.

·  Challenge the Status Quo: Comfort in pushing the boundaries, “hacking” beyond traditional solutions.

·  Language Expertise: Expertise in one or more general development languages (e.g., Java, python)

· Builder: Experience building and deploying distributed systems.

·  Learner: Constant drive to learn new technologies such as cloud technologies, Kubernetes, MLOPS.

· Partnership: Experience collaborating with Engineering, Application and Other functional teams.

 

**We do not expect that any single candidate would fulfill all these characteristics. For instance, we have awesome team members who are really focused on building scalable systems but didn’t work with payments technology or web applications before joining Visa.

This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs.

Qualifications

Basic Qualifications
· 5 or more years of relevant work experience with a Bachelors Degree or at least 2 years of work experience with an Advanced degree (e.g. Masters, MBA, JD, MD) or 0 years of work experience with a PhD

Preferred Qualifications:
· 6 or more years of work experience with a Bachelors Degree or 4 or more years of relevant experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) or up to 3 years of relevant experience with a PhD
· Demonstrated experience with AWS and GCP cloud platforms.
· Experience with managing and optimizing Big Data and Kafka clusters.
· Proficient in scripting languages (Python, Bash) and SQL.
· Familiarity with big data tools (Big Data, Spark, Kafka, etc.) and frameworks (HDFS, MapReduce, etc.).
· Strong knowledge in system architecture and design patterns for high-performance computing.
· Good understanding of data security and privacy concerns.
· Experience with infrastructure automation technologies like Docker, Kubernetes, Ansible, Terraform is a plus.
· Excellent problem-solving and troubleshooting skills.
· Strong communication and collaboration skills.
· Observability: knowledge on observability tools like Grafana, opera and Splunk.
· Linux: understanding of Linux, networking, CPU, memory, and storage.
· Programming Languages: Knowledge of and ability to code or program in one of Java, python or a widely used coding language.
· Communication: Excellent interpersonal skills, along with superior verbal and written communication abilities.

Additional Information

Work Hours: Varies upon the needs of the department.

Travel Requirements: This position requires travel 5-10% of the time.

Mental/Physical Requirements: This position will be performed in an office setting.  The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, frequently operate standard office equipment, such as telephones and computers.

Visa is an EEO Employer.  Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status.  Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law.

Visa will consider for employment qualified applicants with criminal histories in a manner consistent with applicable local law, including the requirements of Article 49 of the San Francisco Police Code.

U.S. APPLICANTS ONLY: The estimated salary range for a new hire into this position is 124,700.00 to 180,650.00 USD per year, which may include potential sales incentive payments (if applicable). Salary may vary depending on job-related factors which may include knowledge, skills, experience, and location. In addition, this position may be eligible for bonus and equity. Visa has a comprehensive benefits package for which this position may be eligible that includes Medical, Dental, Vision, 401 (k), FSA/HSA, Life Insurance, Paid Time Off, and Wellness Program.

Similar Jobs

Rocket - Technical Support Engineer

Rocket

Bengaluru, Karnataka, India (On-Site)
5 Years ago
N-iX - SENIOR COMPUTER VISION ENGINEER (#2655)

N-iX

Poland (Remote)
1 Month ago
Zoox - Senior/Staff Software Engineer - Simulation Workload Orchestration

Zoox

Seattle, Washington, United States (Hybrid)
3 Months ago
Seedify - Senior VFX Artist

Seedify

China (Remote)
2 Months ago
PwC - IN-Senior Associate_Devops_FS Tech_Advisory _Mumbai

PwC

Mumbai, Maharashtra, India (On-Site)
2 Months ago
Rackspace Technology - Sr Cloud Architect

Rackspace Technology

India (Remote)
1 Month ago
Google - Systems Development Engineer, Silicon

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago
Moon Active - DevOps Engineer

Moon Active

Warsaw, Masovian Voivodeship, Poland (Hybrid)
4 Months ago
Aristocrat Gaming - Head of DevOps

Aristocrat Gaming

Austin, Texas, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Animoca Brands - Backend Developer

Animoca Brands

Malaysia (Remote)
4 Months ago
Enphase Energy - Principal Data Scientist

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Calix - Senior Software Test Engineer

Calix

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Rivos - Silicon DFT - Full time

Rivos

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Zoox - Test Engineer, Manufacturing Test & Diagnostics

Zoox

San Carlos, California, United States (On-Site)
3 Months ago
Nintendo - Security Engineer

Nintendo

Redmond, Washington, United States (Hybrid)
2 Months ago
ByteDance - Software Engineer in ML Systems Graduate (AML - Machine Learning Systems) - 2024 Start (BS/MS)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
PwC - IN_Associate_Azure Cloud Data Engineer_OneCloud _Advisory _Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Ubisoft - Tools Programmer

Ubisoft

Shanghai, Shanghai, China (On_site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Austin, Texas, United States

ByteDance - Senior Software Development Engineer - Distributed NoSQL Database Systems

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Netflix - ML Engineer L5 - Ads Platform Engineering (Forecasting)

Netflix

United States (Remote)
3 Months ago
Nintendo - Supervisor. Packaging & Distribution

Nintendo

North Bend, Washington, United States (On-Site)
2 Months ago
Google - Senior Software Engineer, Embedded Systems/Firmware, Google Cloud Platforms

Google

Sunnyvale, California, United States (On-Site)
3 Months ago
Blizzard Entertainment - Senior Software Engineer, Online - Diablo IV | Irvine, CA or Albany, NY

Blizzard Entertainment

Irvine, California, United States (Hybrid)
3 Months ago
CloudHire - Senior Database Engineer (PostgreSQL)

CloudHire

Jersey City, New Jersey, United States (Hybrid)
3 Months ago
Fabric - Applied Researcher, Cryptography Hardware

Fabric

New York, New York, United States (Remote)
3 Months ago
Gala - Senior Infrastructure Platform Engineer

Gala

Green Bay, Wisconsin, United States (On-Site)
6 Months ago
Life church - Director of Product Design

Life church

Edmond, Oklahoma, United States (On-Site)
3 Months ago
Netflix - Data Engineer (L5) - Security

Netflix

United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Telnyx - Infrastructure Engineer (Data)

Telnyx

India (Remote)
4 Months ago
KingsIsle Entertainment - Build and Tools Software Engineer

KingsIsle Entertainment

Texas, United States (Hybrid)
2 Months ago
Extreme Network - Staff/Principal Software Engineer – Edge compute -Containerization 9401

Extreme Network

Toronto, Ontario, Canada (Hybrid)
3 Months ago
Redhorse Corp - Data Engineer

Redhorse Corp

Arlington, Virginia, United States (On-Site)
2 Months ago
Trek - DevOps Engineer

Trek

Haryana, India (Hybrid)
2 Months ago
Google - Systems Engineer III, Site Reliability Engineering

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
Axalta - Senior Infrastructure Engineer

Axalta

Gurugram, Haryana, India (On-Site)
3 Months ago
Microsoft - Technical Support Engineer - Identity & Security (Entra)

Microsoft

Seoul, South Korea (Hybrid)
1 Month ago
Lytx,  Inc  - Sr. Technical Project Manager

Lytx, Inc

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Google - Data Cloud Consultant, Professional Services, Google Cloud

Google

Mexico City, Mexico City, Mexico (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Austin, Texas, United States (Hybrid)

Austin, Texas, United States (Hybrid)

Auckland, Auckland, New Zealand (On-Site)

Auckland, Auckland, New Zealand (Hybrid)

Atlanta, Georgia, United States (Hybrid)

Ashburn, Virginia, United States (Hybrid)

Ashburn, Virginia, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by VISA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug