Looking for AI Application Engineer || Santa Clara, CA (3 days onsite in a week)

TechnoGen Inc

(remote)

JOB DETAILS
SKILLS
Application Programming Interface (API), Artificial Intelligence (AI), Automation, Computer Science, Continuous Improvement, Information Technology Software, Injections, Memory Management, Natural Language Toolkit (NLTK), Nonprofit, Product Development, Production Systems, Python Programming/Scripting Language, ServiceNow, Shallow Parsing, Small Business, Software Development, Software Engineering
POSTED
5 days ago

TECHNOGEN, Inc. is a Proven Leader in providing full IT Services, Software Development and Solutions for 15 years.

TECHNOGEN is a Small and Woman Owned Minority Business with GSA Advantage Certification. We have offices in VA; MD and Offshore development centers in India. We have successfully executed 100 projects for clients ranging from small business and non-profits to Fortune 50 companies and federal, state and local agencies.


Hi,

Greetings of the day!

We are looking to Hire a Talented Professional for the below Job opportunity with one of our clients,

If you're interested, please share your updated resume at your earliest convenience, and I'll be happy to provide more details about the role.

Position: AI Application Engineer

Location: Santa Clara, CA (3 days onsite in a week)

Duration: Long Term Contract

Job Description:

AI Application Engineer to support the development and delivery of next-generation AI-powered applications built on infrastructure. This role will focus on production-grade LLM application engineering, RAG quality, prompt engineering, AI safety, and orchestration of complex multi-step AI pipelines.

Day-to-Day Responsibilities

Design, develop, and optimize production-grade LLM-powered applications

Own AI quality, RAG accuracy, prompt engineering, and AI safety across multiple applications

Develop and maintain multi-step LLM orchestration pipelines using LangChain, LlamaIndex, or custom frameworks

Implement and optimize RAG pipelines including chunking strategies, embedding selection, reranking, and hybrid search

Design multi-turn conversational AI experiences with context management and session memory

Integrate technologies including NIM, NeMo, NeMoGuardrails, and Riva into enterprise AI applications

Build automated evaluation pipelines for model quality, hallucination detection, regression testing, and release gating

Perform latency profiling and optimization across multi-step LLM call chains

Implement AI safety guardrails including prompt injection prevention, jailbreak mitigation, and topical control

Collaborate with globally distributed engineering and product teams to deliver scalable AI solutions

Support deployment, monitoring, and continuous improvement of AI applications in production environments

Basic Qualifications:

4 7 years of software engineering experience with at least 2 years focused on production LLM application development

Expert-level experience with Python for AI/ML application development and async programming

Strong expertise in prompt engineering including system prompts, few-shot prompting, and instruction tuning

3 Years of Hands-on experience with multi-step LLM orchestration frameworks such as LangChain or LlamaIndex

3 Years of Experience designing and optimizing RAG pipelines and retrieval systems

3 Years of Experience with vector databases, similarity search tuning, and reranking techniques

3 Years of Hands-on experience with, NeMo, NeMoGuardrails, and Riva

3 Years of Experience implementing AI safety and guardrails for customer-facing applications

Strong knowledge of automated AI evaluation frameworks such as RAGAS or TruLens

3 Years of Experience profiling and optimizing latency in multi-step AI pipelines

Ability to work onsite in Santa Clara, CA

Preferred Qualifications

Experience with adaptive learning systems or recommendation engines

Knowledge graph integration experience with RAG architectures

Experience with multi-agent orchestration patterns

ServiceNow API integration experience

Prior experience building AI products on infrastructure

Experience with streaming LLM response handling and real-time AI applications

Technology Stack

Python

LangChain

LlamaIndex

NeMo

NeMoGuardrails

Vector Databases

RAGAS / TruLens

LLM APIs and orchestration frameworks

Education

Bachelor's degree in Computer Science, Engineering, Artificial Intelligence, or equivalent work experience.

Ranjitha P | Sr. IT Recruiter

ranjitha.p@technogeninc.com

R, NLTK, Automation-Robotic Process Automation (RPA)-Developer-Python

About the Company

T

TechnoGen Inc