Research Scientist, Gemini Vision, DeepMind

Google

Mountain View, CA

JOB DETAILS
JOB TYPE
Full-time, Employee
SKILLS
Architectural Design, Artificial Intelligence (AI), Business Strategy, Computer Science, Computer Vision, Cross-Functional, Data Mining, Deep Learning, Delivery Management, Employee Benefits, Equal Employment Opportunity (EEO), Experiment Design, GPU (Graphics Processing Unit), Machine Learning, Market Research, Modeling Languages, Natural Language Processing (NLP), Pattern Matching, Performance Analysis, Performance Management, Process Improvement, Prototyping, Reinforcement Learning, Research Skills, Scientific Research, State Laws and Regulations, Time Management
LOCATION
Mountain View, CA
POSTED
1 day ago
Applicants in the County of Los Angeles: Qualified applications with arrest or conviction records will be considered for employment in accordance with the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act.
In accordance with Washington state law, we are highlighting our comprehensive benefits package, which is available to all eligible US based employees. Benefits for this role include:
  • Health, dental, vision, life, disability insurance
  • Retirement Benefits: 401(k) with company match
  • Paid Time Off: 20 days of vacation per year, accruing at a rate of 6.15 hours per pay period for the first five years of employment
  • Sick Time: 40 hours/year (increased to 69 hours/year for Seattle) including 5 discretionary sick days per instance
  • Maternity Leave (Short-Term Disability + Baby Bonding): 28-30 weeks
  • Baby Bonding Leave: 18 weeks
  • Holidays: 13 paid days per year
Note: By applying to this position you will have an opportunity to share your preferred working location from the following: Mountain View, CA, USA; Cambridge, MA, USA; Kirkland, WA, USA; Los Angeles, CA, USA; New York, NY, USA; Pittsburgh, PA, USA; Seattle, WA, USA.

Minimum qualifications:

  • PhD degree, or equivalent practical experience.
  • Experience in one or more of the following areas: machine learning, recommendation systems, natural language processing, computer vision, pattern recognition, or artificial intelligence.
  • Experience in an applied research setting.
  • Experience with large language models, NLP, or generative AI.

Preferred qualifications:

  • Experience in large-scale training of multimodal foundation models.
  • Experience in running AI/ML training workloads or inference workloads on Graphics Processing Unit (GPU) or Tensor Processing Unit (TPU).
  • A proven track record of designing and implementing Agentic workflows and autonomous AI systems.

About the job

As an organization, Google maintains a portfolio of research projects driven by fundamental research, new product innovation, product contribution and infrastructure goals, while providing individuals and teams the freedom to emphasize specific types of work. As a Research Scientist, you'll setup large-scale tests and deploy promising ideas quickly and broadly, managing deadlines and deliverables while applying the latest theories to develop new and improved products, processes, or technologies. From creating experiments and prototyping implementations to designing new architectures, our research scientists work on real-world problems that span the breadth of computer science, such as machine (and deep) learning, data mining, natural language processing, hardware and software performance analysis, improving compilers for mobile platforms, as well as core search and much more.

As a Research Scientist, you'll also actively contribute to the wider research community by sharing and publishing your findings, with ideas inspired by internal projects as well as from collaborations with research programs at partner universities and technical institutes all over the world.

Artificial intelligence will be one of humanity’s most transformative inventions. At Google DeepMind, we are a pioneering AI lab with exceptional interdisciplinary teams focused on advancing AI development to solve complex global challenges and accelerate high-quality product innovation for billions of users. We use our technologies for widespread public benefit and scientific discovery, ensuring safety and ethics are always our highest priority.

We are pushing the boundaries across multiple domains. Our global teams offer different learning opportunities and varied career pathways for those driven to achieve exceptional results through collective effort.
The US base salary range for this full-time position is $147,000-$211,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.
Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google.

Responsibilities

  • Conduct original research in multimodal AI (Gemini), including vision-language models (VLMs), image understanding, OCR and document intelligence, spatial reasoning and embodied perception, image-text alignment and retrieval, agentic multimodal systems, scaling laws, and data infra, pipeline, training data attribution, and mixture optimization.
  • Design, train, and evaluate large-scale transformer-based architectures for image and video understanding.
  • Develop novel methods for multimodal pretraining, instruction tuning, alignment, and reinforcement learning.
  • Collaborate with cross-functional teams to transition research ideas into production-grade Gemini capabilities.
  • Contribute to research direction, experimental design, and scientific strategy within the Gemini organization.
Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form.

About the Company

G

Google