Observability Lead / Architect role based in Warren, NJ (onsite only) with 10-12 years of experience. Must have expertise in Grafana stack (metrics, traces, logs, Alloy agent, OTelCollector, exporters, application metrics), troubleshooting in Linux, Python or similar programming, shell scripting, AWS cloud, and Kubernetes (helm charts).
Preferred skills include experience with other observability platforms (Splunk), DevOps, cloud operations, Terraform, and Jenkins CI/CD. The candidate should be proficient in architecture and solution design across AWS and Google Cloud, focusing on platform services and web application monitoring, with expert-level DevOps skills involving Terraform, Ansible, Jenkins, and Kubernetes.
Responsibilities include leading technical discussions, defining architecture for application design, analyzing current systems, creating roadmaps, designing infrastructure components, producing design documentation, contributing to enterprise architecture, researching emerging technologies, serving as SME, and promoting reuse of patterns and components.