Principal Data Architect - Office of the CTO

Allen Institute

Seattle, WA

JOB DETAILS
SALARY
$167,850–$209,750 Per Year
SKILLS
Academic Research, Best Practices, Biology, Biomedical Research, Budget Management, Cataloguing, Cloud Storage, Communication Skills, Computer Science, Conferences, Cost Control, Cross-Functional, Data Management, Data Sets, Data Storage, DataArchitect Data Modeling Tool, Database Extract Transform and Load (ETL), Ecosystems, Equal Employment Opportunity (EEO), Establish Priorities, Genetics, Genomics, HIPAA (Health Insurance Portability and Accountability Act), Human Health, Immunology, Information Science, Information/Data Security (InfoSec), Legal, Library and Information Science, Metadata, Microscopy, Neuroscience, Office Equipment, Organizational Skills, Physical Demands, Physiology, Regulations, Scientific Research, Standards Development, State Laws and Regulations, Stewardship, Sustainability, Taxonomies, Team Player, Technical Strategy, U.S. National Institute of Standards and Technology (NIST), Usability Engineering, Willing to Travel
LOCATION
Seattle, WA
POSTED
2 days ago

Principal Data Architect, Office of the CTO

The mission of the Allen Institute is to unlock the complexities of bioscience and advance our knowledge to improve human health. Using an open science, multi-scale, team-oriented approach, the Allen Institute focuses on accelerating foundational research, developing standards and models, and cultivating new ideas to make a broad, transformational impact on science. The Office of the Chief Technology Officer (OCTO) supports this mission by providing engineering infrastructure, as well as developing and implementing high-impact, data-driven tools for foundational bioscience.

We are seeking a highly motivated Principal Data Architect to lead the organization and curation of the Allen Institute’s extensive, multimodal biological data assets, spanning over 20 years and multiple petabytes of data across neuroscience, cell science, immunology, and more. This role operates as the primary owner of data governance and stewardship practices across the Institute, establishing standards, driving adoption, and advancing a centralized approach to data cataloging and lifecycle management. Working across scientific programs and technology teams, the Principal Data Curator will improve data discoverability and responsible data use, ensuring alignment with institutional and regulatory standards. Through this work, the role will help shape a sustainable data ecosystem that supports open science while balancing governance, security, and operational efficiency.

The mission of the Office of the Chief Technology Officer (CTO) is not only to provide state of the art engineering infrastructure to the Allen Institute as a whole, but to also address high-risk, high-reward questions in biology through AI.

At the Allen Institute, we believe that science is for everyone – and should be open to everyone. We are dedicated to combating biases and reducing barriers to STEM careers more broadly.

We also believe that science is better when it includes different perspectives and voices. We strive to make the Allen Institute a place where everyone feels like they belong and are empowered to do their best work in a supportive environment.

We are an equal-opportunity employer and strongly encourage people from all backgrounds to apply for our open positions.

Essential Functions

  • Establish and implement Institute-wide standards and frameworks to improve data discoverability, usability, and consistency across scientific and technology teams, influencing technical direction and decision-making across domains
  • Own and advance the Institute’s data governance and stewardship program, including data cataloging, and lifecycle management practices, operating with a high degree of autonomy as a recognized subject matter expert
  • Develop guidance and best practices for data ETL, lifecycle management, including creation, storage, archiving, and retirement of datasets aligned with scientific mission
  • Drive adoption of data governance practices across programs, ensuring alignment and sustained use of standards and tools, with impact measured through uptake and effectiveness
  • Ensure data governance practices align with relevant NIST, HIPAA, and institutional data security standards where applicable. Work with technology, security, and legal teams to help define processes for managing sensitive or restricted datasets and implement appropriate safeguards and compliance practices
  • Lead the development and evolution of a centralized data registry and associated metadata frameworks for large-scale, multimodal datasets
  • Influence decisions related to data storage, infrastructure, and cost optimization, helping balance scientific priorities with resource constraints
  • Serve as a technical and strategic advisor to scientific, engineering, and operational leaders on data management practices, tradeoffs, and long-term sustainability
  • Lead high-impact, cross-functional initiatives to improve data accessibility, reuse, and long-term sustainability across the Institute

*Note: Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions. This description reflects management’s assignment of essential functions; it does not proscribe or restrict the tasks that may be assigned.*

Required Education and Experience

  • Bachelor’s Degree in Information and Library Science, Data Management, Computer Science, or a related field; or equivalent combination of degree and relevant experience
  • Minimum of 7 years of relevant experience in data governance, data stewardship, or large-scale data management environments
  • Deep expertise in metadata standards, data lifecycle management, and governance frameworks
  • Experience working across complex, multi-stakeholder environments, ideally in scientific or research settings
  • Strong ability to influence without authority and translate complex concepts across diverse audiences
  • Familiarity with scientific datasets, ideally in biology or biomedical research environments, including high resolution microscopy, genomics, physiology, and video
  • Deep knowledge of data governance frameworks and regulatory standards, including NIST and/or HIPAA compliance
  • Understanding of cloud storage environments, data infrastructure, or research computing ecosystems

Preferred Education and Experience

  • Master’s degree or Ph.D. in Information Science, Data Management, or a related field, with over 5 years of experience in large data-intensive environments, ideally in a scientific or academic research setting
  • Advanced skills in developing and applying data valuation frameworks to prioritize data retention and decommissioning, particularly within budget-constrained environments
  • A track record of success aligning data management practices to scientific and technical goals in a complex, multi-stakeholder environment
  • Skilled in navigating and influencing at the intersection of technical and scientific teams, with the ability to communicate complex data management principles clearly to diverse audiences
  • Familiarity with information management principles, such as metadata standards (e.g., FAIR data principles) and taxonomy development for data cataloging
  • Experience with archival standards, data preservation techniques, and regulatory frameworks related to scientific data
  • Ability to work collaboratively across scientific, technical, and operational teams
  • Strong organizational and communication skills, with the ability to translate technical and policy concepts across diverse audiences

Physical Demands

  • Fine motor movements in fingers/hands to operate computers and other office equipment

Position Type/Expected Hours of Work

  • This role is currently working onsite and is expected to work onsite for the majority of working hours. The primary work location for this role is 615 Westlake Ave N., with flexibility to work remotely on a limited basis.

Travel

  • Occasional travel to national or international conferences

Additional Comments

  • **Please note, this opportunity offersrelocation assistance**
  • **Please note, this opportunity offers work visa sponsorship**

Annualized Salary Range

$167,850 - $209,750*

* Final salary depends on required education for the role, experience, and level of skills relevant to the role, along with work location, where applicable.

Benefits

Employees (and their families) are eligible to enroll in benefits per eligibility rules outline in the Allen Institute’s Benefits Guide. These benefits include medical, dental, vision, and basic life insurance. Employees are also eligible to enroll in the Allen Institute’s 401k plan. Paid time off is also available as outlined in the Allen Institutes Benefits Guide. Details on the Allen Institute’s benefits offering are located at the following link to the Benefits Guide: https://alleninstitute.org/careers/benefits.

It is the policy of the Allen Institute to provide equal employment opportunity (EEO) to all persons regardless of age, color, national origin, citizenship status, physical or mental disability, race, religion, creed, gender, sex, sexual orientation, gender identity and/or expression, genetic information, marital status, status with regard to public assistance, veteran status, or any other characteristic protected by federal, state or local law. In addition, the Allen Institute will provide reasonable accommodations for qualified individuals with disabilities.

About the Company

A

Allen Institute