Senior AI/ML Engineer - Engineering Excellence (Full Stack Developer) Vice President

Citi

Jacksonville, FL

JOB DETAILS
SKILLS
Amazon Web Services (AWS), Analysis Skills, Apache Cassandra, Application Integration, Application Programming Interface (API), Architectural Services, Artificial Intelligence (AI), Automation, Banking Services, Best Practices, Brokerage, Cloud Computing, Communication Skills, Computer Science, Computer Services, Configuration Management, Consumer Loans, Continuous Deployment/Delivery, Continuous Improvement, Continuous Integration, Corporate Banking, Cross-Domain Solutions (CDS), Cross-Functional, Data Quality, Data Recovery, Data Science, Data Storage, Database Design, Database Technology, DevOps, Distributed Computing, Diversity, Docker, Enterprise Applications, Enterprise Protection, Financial Services, GCP (Good Clinical Practices), High Availability, Investment Services, Java, Java Platform Enterprise Edition (Java EE/J2EE), Large-Scale Systems, Machine Learning, Machine Tool, Maintain Compliance, Management Strategy, Mentoring, Microservices, Microsoft Windows Azure, Modeling Languages, MongoDB, MySQL, Natural Language Parsing, Natural Language Processing (NLP), NoSQL, Operational Improvement, Oracle, PostgreSQL, Privacy Regulations, Problem Solving Skills, Process Improvement, Production Systems, Python Programming/Scripting Language, Query Optimization, REST (Representational State Transfer), Regulatory Compliance, SQL (Structured Query Language), Scalable System Development, Securities Investments, Software Engineering, System Architecture, Systems Administration/Management, Team Player, Technical/Engineering Design, Test Automation, Wealth Management
LOCATION
Jacksonville, FL
POSTED
2 days ago
Senior AI/ML Engineer - Engineering Excellence (Full Stack Developer)

Working at Citi is far more than just a job. A career with us means joining a team of more than 230,000 dedicated people from around the globe. At Citi, you'll have the opportunity to grow your career, give back to your community and make a real impact.

Job Overview

Citi, the leading global bank, has approximately 200 million customer accounts and does business in more than 160 countries and jurisdictions. Citi provides consumers, corporations, governments, and institutions with a broad range of financial products and services, including consumer banking and credit, corporate and investment banking, securities brokerage, transaction services, and wealth management.

Our commitment to diversity includes a workforce that represents the clients we serve from all walks of life, backgrounds, and origins. We foster an environment where the best people want to work. We value and demand respect for others, promote individuals based on merit, and ensure opportunities for personal development are widely available to all. Ideal candidates are innovators with well-rounded backgrounds who bring their authentic selves to work and complement our culture of delivering results with pride. If you are a problem solver who seeks passion in your work, come join us. We'll enable growth and progress together.

We are seeking a highly skilled and experienced Senior AI/ML & Agentic AI Engineer with a robust background in Python, Java, database technologies, cloud platforms, and CI/CD DevOps practices to join our Engineering Excellence and Transformation organization. In this critical role, you will be at the forefront of designing, developing, and integrating cutting-edge AI/ML and Agentic AI solutions, including advanced AI assistant tools, to revolutionize engineering processes and drive significant operational improvements across our enterprise. You will leverage your comprehensive technical expertise to build scalable, resilient, and high-performance AI systems from conception to deployment, fostering a culture of innovation and continuous delivery.

Key Responsibilities:

  • Lead the design, development, and implementation of sophisticated AI/ML, Generative AI, and Agentic AI solutions, including AI assistant tools, in close collaboration with AI architects, product owners, and cross-functional engineering teams within the Engineering Excellence framework.
  • Architect and develop end-to-end AI systems, leveraging both Python for machine learning workflows and Java for scalable enterprise-grade backend services, ensuring seamless integration with existing and new applications.
  • Design and implement intelligent agentic systems that exhibit autonomous decision-making capabilities, enabling advanced automation and self-optimizing processes.
  • Develop, fine-tune, and optimize Large Language Models (LLMs) using both parameter-efficient techniques and full fine-tuning, focusing on their integration into robust, production-ready systems supported by both Python and Java components.
  • Drive the implementation and experimentation with advanced generative AI methods such as prompt engineering and Retrieval-Augmented Generation (RAG), ensuring their effective deployment and performance in enterprise environments.
  • Own the deployment pipeline for AI models and associated services, leveraging CI/CD methodologies, containerization (Docker), and orchestration (Kubernetes) on leading cloud platforms (AWS, Azure, GCP) to ensure secure, scalable, and automated releases.
  • Apply deep knowledge of database technologies (SQL and NoSQL) to design efficient data storage, retrieval, and management strategies for AI applications, ensuring data integrity and performance.
  • Contribute to the establishment and enforcement of engineering best practices, architectural patterns, and tooling standards for full-stack AI development, advocating for maintainability, observability, and cost-efficiency.
  • Stay abreast of the latest advancements in AI/ML, Agentic AI, cloud technologies, and DevOps trends, proactively sharing knowledge and driving adoption of innovative solutions.
  • Ensure strict adherence to ethical AI guidelines, data privacy regulations, and compliance standards throughout the entire AI solution lifecycle.
  • Act as a mentor for junior engineers, providing expert guidance on Python, Java, cloud technologies, and CI/CD best practices, fostering a culture of technical excellence and continuous improvement.

Required Technical Skills:

  • Polyglot Programming Expertise:

    • Python: Expert proficiency in Python for AI/ML development, including data manipulation (Pandas), scientific computing (NumPy), and machine learning frameworks.
    • Java: Strong proficiency in Java (e.g., Spring Boot, Microservices, Enterprise Integration Patterns, RESTful APIs) for building scalable, high-performance, and resilient enterprise applications.
  • AI/ML & Agentic AI:

    • Extensive experience with Generative AI, Agentic AI principles, and the development of AI assistant tools.
    • Hands-on experience with LLMs and fine-tuning methods (e.g., LoRA, QLoRA, Adapter/Prefix Tuning, instruction tuning).
    • Practical knowledge of model optimization techniques (e.g., compression, quantization) and familiarity with tools such as DeepSpeed, vLLM, GPTQ, or similar.
    • Proficient in prompt engineering, prompt design tools/frameworks, and building robust RAG systems (hybrid search, multi-vector retrieval).
    • Proficient with machine learning frameworks (PyTorch, TensorFlow, Keras) and distributed training.
    • Strong skills in Natural Language Processing (NLP) techniques (NER, Dependency Parsing, Text Classification, Topic Modeling), transfer learning, and advanced learning paradigms.
    • Familiarity with generative AI tools and libraries like LangChain, LlamaIndex, Hugging Face, and major GenAI APIs (e.g., OpenAI, Gemini, Claude, AWS Bedrock).
  • Database Technologies:

    • Solid experience with both SQL (e.g., PostgreSQL, Oracle, MySQL) and NoSQL (e.g., MongoDB, Cassandra, DynamoDB) databases, including schema design, query optimization, and integration with applications.
  • Cloud Platforms:

    • Extensive hands-on experience with at least one major cloud provider (AWS, Azure, or GCP), including services for compute, storage, networking, AI/ML, and data.
  • CI/CD & DevOps:

    • Strong understanding and practical experience with CI/CD pipelines, automated testing, infrastructure as code (IaC), and configuration management.
    • Expertise with containerization (Docker) and orchestration (Kubernetes, OpenShift).
    • Familiarity with monitoring, logging, and alerting tools for production systems.
  • Security & Compliance:

    • Solid understanding of AI compliance, guardrails, Responsible AI practices, and enterprise security standards within a highly regulated environment.

Required Soft Skills:

  • Exceptional collaboration and communication skills, capable of effectively bridging the gap between diverse technical teams (AI/ML, Java, DevOps, Cloud) and non-technical stakeholders.
  • Proactive and analytical problem-solver, adept at navigating complex technical challenges and driving innovative, cross-domain solutions in a dynamic environment.
  • Ability to clearly articulate complex technical concepts, designs, and solutions to diverse audiences, both technical and non-technical.
  • Strong passion for continuous learning, innovation, and mentoring, contributing significantly to a culture of technical excellence and organizational transformation.

Qualifications:

  • At least 6+ years of progressive experience in software engineering and AI/ML development, with a minimum of 5 years specifically focused on Generative AI, Agentic AI, and full-stack AI solutions.
  • Demonstrated portfolio of successful, impactful projects leveraging Python, Java, cloud services, and CI/CD/DevOps practices in an enterprise setting.
  • Extensive experience working with large-scale distributed systems and architecting solutions for high availability and performance.

Education:

  • Bachelor's or master's degree in computer science, Data Science, Artificial Intelligence, Software Engineering, or a related quantitative field.

About the Company

C

Citi