Software Engineer - GPU performance

3 Months ago • All levels • Research & Development

Job Summary

Job Description

As a GPU developer, you will be responsible for optimizing the entire Vision Algorithm & Learning Software Stack for performance on GPUs. You will build and translate the code into a performance-optimized block and create mathematical models that are better represented in GPU. You will also be involved in designing debugging, profiling, and image visualization tools for GPU. This role involves breaking down the processing pipeline into optimized blocks and kernels, discovering the most efficient mathematical models for different algorithms, and building a team as the product evolves.
Must have:
  • Experienced with Low-level CUDA APIs
  • Strong C++/C fundamentals
  • Adept with Visual Studio
  • Low-level performance analysis and optimization
  • Understanding of GPU HW architecture
  • Proficiency with GPU profiling tools
  • Optimizing Time Continuous kernels
  • Designing Pipelined Image processing CUDA core optimization
  • Dynamic Load balancing between kernels and functions
  • Interleaving processing between CPU and GPU
  • Experience with NvidiaDirect
  • Constructing Direct Visualization of GPU Memory
  • Designing and optimizing foundational neural networks
  • Understanding of GPU-based application development
  • Knowledge of CUDA
  • State machine architecture
  • Realtime computing
  • Memory architectures and optimizations
  • MIMD, SIMD
Good to have:
  • Experience with Compiler working and construction
  • CPU architectures – x86, x64 & ARM
  • Hardware-associated driver development
  • OS and layers (Board Support Packages, BIOS, UEFI, BootLoader)
  • UI-based deployable application development
  • Exposure to Omniverse

Job Details

About the job

About CynLr

Just like a baby’s brain, CynLr Visual Intelligence stack makes Robots to instinctively see & pick any object under any ambience, without any training. (a demo video link).


Today, we don’t have a robot that can fit a screw into a nut without slipping a thread. Imagine what it would take for a robot to assemble a Smartphone or a car by putting together 1000s of parts with varied shapes and weights, all in random orientations. Thus factories become complex, needing heavy customization of their environment.


CynLr enabled visual robots intuitively handles any object, even from a clutter – a universal alternative to custom machines, simplifying factory lines into modular LEGO blocks of micro-factories. Simplifying factories with robots that can pick & place any object has been a 40 year old pipe dream - touted as The Holy Grail of Robotics.


As a GPU developer, you will be responsible for building and translating the entire Vision Algorithm & Learning SW Stack into a performance-optimized code block and build mathematical models that are better represented in GPU.


Requirements in Practice:

  • Experienced with Low-level CUDA API
  • Strong with fundamentals of C++/C.
  • Adept with Visual Studio developer toolchain.
  • Experience in low-level performance analysis and optimization with a strong understanding of the GPU HW architecture and HW-oriented performance optimization.
  • including proficiency using GPU profiling tools such as NVIDIA Visual Profiler, NVIDIA Nsight Compute and Graphics Developer tools for debugging
  • Optimizing Time Continuous kernels - not just High-Level Kernel optimizations that are shipped with CUDA.
  • Design the framework of Pipelined Image processing
  • CUDA core optimization to achieve maximum performance for a pipelined processing between multiple blocks of functions executing simultaneously.
  • Dynamic Load balancing between kernels and functions.
  • Interleaving processing between CPU and GPU and runtime modification of GPU processing control flow from CPU.
  • Practice with NvidiaDirect to access memory directly from Peripheral devices (PCIe), Display and USB, bypassing the CPU
  • Practice with constructing Direct Visualization of GPU Memory for Debugging without CPU transfer
  • Experienced with designing and optimising foundational neural networks and modelling neurons (basically optimizing mathematical models that involve time-weighted kernels) ground up.
  • Exposure to Omniverse is a Plus


Must have an understanding of :

  • GPU-based application development. Knowledge of CUDA (Excellency is not necessary)
  • State machine architecture
  • Realtime computing
  • Memory architectures and optimizations.
  • MIMD, SIMD


Good to have experience and practice with

  • Compiler working and construction.
  • CPU architectures – x86, x64 & ARM
  • Hardware-associated driver development.
  • OS and layers (Board Support Packages, BIOS, UEFI, BootLoader)
  • UI-based deployable application development



Team Structure:

The engineering team will comprise of – Algo Team, GPU Team, SW Dev Team & HW Team. Members of other teams will be passive members of each team apart from the team they lead. The Algo Team will provide the Neural Models & Vision algorithms, while the GPU Team will provide the GPU optimizations for the algos, HW team will provide the HW integration and SW team with translate GPU optimized algos into SW blocks. Each team will split the implementation among other teams and guide them through the implementation. Every team member will be a passive member of all other teams.


What will you do?

Simplistically put – you will think all the algorithms that the Neuroscience team comes up with through GPU for maximum performance. You will break down the entire pipeline of processing that imitates the visual pathway into optimized blocks and kernels of processing in GPU. You will meticulously discover the mathematical models that gives the maximum timing performance for every Neural Model/algorithm that the Vision and Neuro team comes up with.

You will also be building some aspects of Debugging, profiling and Image visualizing tools for GPU.


How will you Do?

You have complete freedom here, but you will be subjected to reviews. Since this is a startup and the product is not yet well-defined, you would be the one with the responsibility of defining it. Expect things to be not orderly and requirements to not be solid. Part of your design effort largely involves requirements building too and developing architectures that are agnostic to such requirement changes. The SW part of the product significantly evolves as per your thought process and will henceforth carry your signature in it.

You will also be building a team as the product evolves to maintain and develop further. Though confined to a focused area, the work is pretty much expected to be entrepreneurial with the exact advantages and difficulties of a startup.

Similar Jobs

G5 Games - 2D UI/UX Artist (Hidden objects project)

G5 Games

Astana, Astana, Kazakhstan (Remote)
1 Month ago
Homa games - Senior MLOps Engineer

Homa games

Paris, Île-de-France, France (On-Site)
2 Months ago
ByteDance - Research Scientist Graduate (AML- AI-for-Science) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Resemble AI - Deep Learning Speech Researcher

Resemble AI

Mountain View, California, United States (On-Site)
6 Months ago
Cadence - IT -Sr Staff Systems Engineer

Cadence

Noida, Uttar Pradesh, India (On-Site)
4 Months ago
Virtuos - QA Lead

Virtuos

China (On-Site)
2 Months ago
Google - Senior Software Engineer, Machine Learning, YouTube

Google

Mountain View, California, United States (On-Site)
3 Months ago
Adept Global - 3D Geometry Engineer

Adept Global

Bengaluru, Karnataka, India (On-Site)
4 Months ago
ByteDance - SOC System Architect

ByteDance

San Jose, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Playrix - Feature Owner (LiveOps)

Playrix

Portugal (Remote)
3 Months ago
GT - ML Engineer

GT

(Remote)
1 Day ago
Evolution - Technical Game Artist

Evolution

Riga, Latvia (On-Site)
1 Week ago
Patterned Learning Career - Vice President, Software Development, Automation, Material Handling

Patterned Learning Career

(Remote)
1 Day ago
Playrix - Game Director

Playrix

Serbia (Remote)
3 Months ago
Sinch - Senior Machine Learning Engineer

Sinch

Flanders, Belgium (Hybrid)
1 Month ago
Luxoft - Senior ML Engineer

Luxoft

Poland, Ohio, United States (Remote)
1 Month ago
LeoVegas - Data Scientist - Sportsbook

LeoVegas

Stockholm, Stockholm County, Sweden (Hybrid)
1 Month ago
Zoox - Collision Avoidance System, Machine Learning Internship/Co-op

Zoox

Foster City, California, United States (On-Site)
3 Months ago
Sabre India - Data Scientist

Sabre India

Bengaluru, Karnataka, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

Saviynt - Senior Manager- Self-Service & Knowledge

Saviynt

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
PwC - Senior Associate - SAP ABAP - RDC

PwC

Kolkata, West Bengal, India (On-Site)
4 Months ago
Luxoft - Business Analyst - Risk & Finance

Luxoft

Gurugram, Haryana, India (On-Site)
2 Months ago
JioSaavn - Associate/Senior Associate – Music Research (English & Bengali)

JioSaavn

Mumbai, Maharashtra, India (On-Site)
4 Months ago
Extreme Network - Customer Lifecycle Manager – India (German and English Speaker)

Extreme Network

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Rock My Sales (RMS) - Creative Art Director

Rock My Sales (RMS)

Noida, Uttar Pradesh, India (On-Site)
4 Months ago
Paytm - Sales - Team Lead - Khammam

Paytm

Khammam, Telangana, India (On-Site)
2 Months ago
CloudHire - Sr. Developer - Angular & NestJS

CloudHire

India (Remote)
3 Months ago
Luxoft - Senior Murex Front Office BA

Luxoft

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Maestro Realtek - Manager - Offline Marketing

Maestro Realtek

Pune, Maharashtra, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Assystems - Lead Electrical Engineer (LV/HT/ELV)

Assystems

Gurugram, Haryana, India (On-Site)
3 Months ago
ByteDance - Software Development Engineer - Machine Learning System

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Spacelabs Healthcare - Software Engineer II

Spacelabs Healthcare

Hyderabad, Telangana, India (On-Site)
3 Months ago
NXP - 2025Y Campus - MCU System Engineer Intern

NXP

Suzhou, Jiangsu, China (On-Site)
4 Months ago
Activate Games - Electronics Assembler (Night Shift)

Activate Games

Winnipeg, Manitoba, Canada (On-Site)
2 Months ago
Intel Corporation - Client SoC Performance Architect

Intel Corporation

Hillsboro, Oregon, United States (Hybrid)
1 Month ago
Intel Corporation - Formal Verification Engineer

Intel Corporation

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
ByteDance - Video Codec Firmware Engineer - Multimedia Lab

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Student Researcher (Doubao (Seed) - LLM Post-training) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Tesla - Jr. PLC Programmer

Tesla

Neutraubling, Bavaria, Germany (On-Site)
3 Hours ago

Get notifed when new similar jobs are uploaded

About The Company

Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by CynLr

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug