Systems Architect 4 - AI / Distributed Systems Location: Dallas | Charlotte | San Francisco Bay Area Must have skills Experience in designing and implementing high-performance, large-scale distributed systems Implementing and Deploying AI/ML platforms at scale. Building and Creating agent architectures, evaluation frameworks, know-hows of prompt/context engg, MCP servers. Hands on experience in LLM inference optimization, batching and caching strategies Experience in Kubernetes and Cloud Infrastructure Strong in any programming Language Expertise in designing agent data stack & retrieval systems (vector dbs, hybrid search ,data freshness , memory , graph reasoning , BM25 etc ) Key Responsibilities Architect and deliver scalable, high‑performance distributed systems. Design and deploy AI/ML and GenAI platforms at enterprise scale. Build agent‑based architectures, including prompt/context engineering, MCP servers, and evaluation frameworks. Optimize LLM inference pipelines (batching, caching, latency, throughput). Design agent data & retrieval stacks (vector DBs, hybrid search, memory, graphs). Lead Kubernetes‑based, cloud‑native deployments. Provide technical leadership, architecture governance, and hands‑on guidance.