Community

Home >

Jobs >

Research Scientist, Multilingual Large Language Models

Research Scientist, Multilingual Large Language Models

1 Month ago • All levels • Research & Development • Undisclosed

About the job

6 skills required for this role

Add these skills to join the top 1% applicants for this job

java

javascript

cpp

prototyping

python

innovation

Job Description

As a Research Scientist, Multilingual Large Language Models at Google, you'll contribute to developing advanced methodologies for multilingual environments. Responsibilities include authoring research papers, researching and developing technology for improving multilingual LLMs (instruction-tuning, pre-training, multilingual reasoning), and pre-training LLMs for languages other than English. Collaboration with other research teams and Google's partner teams to deliver new multilingual technologies to production is crucial. You'll work on real-world problems spanning computer science, including machine learning, natural language processing, and more, applying the latest theories to develop new products and processes. The role involves setting up large-scale tests, deploying ideas quickly, and managing deadlines while contributing to the wider research community by publishing findings.

Must have:

PhD in CS or related field
Coding experience (Python, JS, R, Java, or C++)
Publication(s) in conferences/journals
Research & development of multilingual LLMs
Collaboration with research teams

Good to have:

2+ years Python coding experience
1+ year experience owning research agendas
Experience with LLMs and generative models
Experience with multilingual LLMs
Recent publications in Generative AI

Minimum qualifications:

PhD in Computer Science, a related field, or equivalent practical experience.
Coding experience in Python, JavaScript, R, Java, or C++.
One or more scientific publication submission(s) for conferences, journals, or public repositories.

Preferred qualifications:

2 years of coding experience in Python, JavaScript, R, Java, or C++.
1 year of experience owning and initiating research agendas.
Experience with modern Large Language Models (LLM) and generative models.
Experience with multilingual LLMs.
Recent publication track in related Generative Artificial Intelligence fields.

About the job

As an organization, Google maintains a portfolio of research projects driven by fundamental research, new product innovation, product contribution and infrastructure goals, while providing individuals and teams the freedom to emphasize specific types of work. As a Research Scientist, you'll setup large-scale tests and deploy promising ideas quickly and broadly, managing deadlines and deliverables while applying the latest theories to develop new and improved products, processes, or technologies. From creating experiments and prototyping implementations to designing new architectures, our research scientists work on real-world problems that span the breadth of computer science, such as machine (and deep) learning, data mining, natural language processing, hardware and software performance analysis, improving compilers for mobile platforms, as well as core search and much more.

As a Research Scientist, you'll also actively contribute to the wider research community by sharing and publishing your findings, with ideas inspired by internal projects as well as from collaborations with research programs at partner universities and technical institutes all over the world.

Our team is committed to developing advanced methodologies tailored specifically for multilingual environments. We focus on pre-training multilingual models, enhancing the quality of multilingual instruction-tuning datasets, refining multilingual evaluation processes, boosting knowledge transfer across languages, and optimizing multilingual tokenization among other initiatives.

The Technology & Society organization connects research, people, and ideas across Google and Alphabet to help shape and advance our most ambitious technology innovations and initiatives and their impact on users and society for the better, and responsibly. In addition, we also aim to share perspectives, engage, and collaborate with others externally on technology related issues and opportunities for society.

Responsibilities

Author research papers to share and generate impact of research results across the team and in the research community.
Research and develop technology for improving multilingual Large Language Models (LLM) such as instruction-tuning, pre-training, multilingual reasoning.
Research and develop technology for pre-training LLMs for languages other than English.
Collaborate with other research teams to expand multilingual LLM technology.
Collaborate with Google first-party partner teams to deliver new multilingual technologies to production.

View Full Job Description

Upload your resume, increase your shortlisting chances by 80%

About The Company

Google

1183 Active Jobs

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.