Sr. Systems Engineer - Observability

MROADS

Callao, VA

JOB DETAILS
JOB TYPE
Full-time
SKILLS
Administrative Skills, Amazon Elastic Compute Cloud (EC2), Amazon Web Services (AWS), Analysis Skills, Artificial Intelligence (AI), Automation, Best Practices, Business Strategy, Business Support, Change Management, Cloud Computing, Communication Skills, Computer Science, Configuration Management, Cross-Functional, Database Programming Languages, Detail Oriented, Establish Priorities, Git, GitHub, High Availability, Information Technology & Information Systems, Infrastructure as a Service (IaaS), Java, JavaScript, Leadership, Management of Information Systems/Technology (MIS), Metrics, Microsoft Windows Azure, Needs Assessment, Operations, Operations Management, Operations Processes, Organizational Skills, Persuasion Skills, Platform as a Service (PaaS), Presentation/Verbal Skills, Problem Solving Skills, Process Development, Process Management, Production Support, Production Systems, Project Schedule, Python Programming/Scripting Language, Quality Assurance Methodology, Reporting Dashboards, Requirements Management, Resource Management, Sarbanes-Oxley Act (SOX), Scalable System Development, Scripting (Scripting Languages), Service Delivery, Service Level Agreement (SLA), ServiceNow, Set Goals, Software Administration, Software Development, Software Engineering, Software as a Service (SaaS), Splunk, Status Reports, Systems Engineering, Team Player, Technical Delivery, Technical Leadership, Test Requirements, Time Management, Windows PowerShell, Writing Skills
LOCATION
Callao, VA
POSTED
Today
DescriptionJOB SUMMARYThe Observability Sr. Systems Engineer role will define, implement, govern, optimize, and monitor solutions to enhance observability platform. The role will collaborate with architects from engineering, application, and enterprise/solution teams to develop, implement, and support logging, monitoring, reporting and automation for infrastructure and application services. This role serves as a subject-matter expert performing research, analysis, design, creation, and implementation of observability systems/solutions to meet current and future requirements across the enterprise.CANDIDATE PROFILEEducation and Experience Required: • Undergraduate degree in engineering or computer science discipline and/or equivalent experience/certification • 7+ years’ experience in information technology with hands-on technical/engineering roles including:o 5+ years’ experience using at least one of the following: JavaScript, Typescripto 5+ years' experience developing applications in AWSo 5+ years' experience developing and supporting Java applicationso 3+ years' using application deployment tools including at least two of the following: Git, Harness,Terraform and NPM (Node Package Manager).o 3+ years’ admin experience with observability tools: Dynatrace, Splunk Cloud, Cribl • Experience using Dynatrace Query Language (DQL) and/or Splunk Processing Language (SPL) to build dashboards, reports and alerts to meet customer requirements. • Experience in integrating observability tools with other ITOps solutions (ServiceNow, BigPanda, ReadyAPI, etc.)Additional Preferred Experiences: • Dynatrace, Splunk, Cribl, HashiCorp or other application certifications • Strong scripting experience in at least one of the following: PowerShell, Python • Strong knowledge of emerging tools, software, applications, and AI solutions for attaining best-in-class IT technology across the enterprise. • Experience in building scalable pipelines for collecting, processing, and analyzing metrics, logs, and traces. • Experience in establishing and implementing Observability best practices to standardize, monitor and control usage/performance of solutions. • Excellent verbal and written communication skills for a wide range of audiences including executives, business stakeholders and IT teams. • Demonstrated experience delivering technology solutions in a fast-paced, deadline-driven enterprise environment. • Excellent problem-solving skills • Ability to work independently and as part of a cross functional team. • Excellent understanding of change management, testing requirements, techniques, and tools to ensure quality and high availability of systems. • Strong attention to detail with ability to operate effectively across multiple prioritiesCORE WORK ACTIVITIES • Build and maintain the DTT SOX Compliance solution in Dynatrace and GitHub. • Design, implement, and maintain high-performance and scalable observability solutions for Kubernetes – EKS/ACK, DocumentDB, EC2 and other data sources in a complex enterprise environment. • Collaborate with cross-functional teams to gather requirements, architect solutions, and deploy logging and monitoring solutions that align with business needs (incl. DTT SOX Compliance). • Leverage in-depth knowledge of AWS, Azure and Alibaba Cloud technologies, including IaaS, PaaS, and SaaS, to architect and manage logging and monitoring tools’ deployments. • Enable streamlined operational processes and efficient management of the Dynatrace infrastructure using scripting and automation. • Responsible for infrastructure-as- code development and configuration management. • Lead optimization efforts for observability platform and explore alternative solutions using other automation technologies like Cribl, etc. • Onboard data sources from various IT infrastructure and app. components into observability tools (Dynatrace/Grail, Cribl). • Provide technical leadership, oversight, governance and direction for services related to solution delivery. • Determine customer requirements and work with sourced resources to develop solutions • Provide and present status, analysis and reporting to internal stakeholders, Senior Leadership and Executive Management. • Lead analysis of current environment for deficiencies and provides solutions • Identify opportunities to enhance the service delivery, operations and continual service improvement processes.Delivering Technology • Creates and enhances administrative, operational and technical policies and procedures, adopting best practice guidelines, standards and procedures for employees, contractors and vendor engagements. • Management of daily infrastructure operations to ensure availability SLA is met for storage services • Interfaces with stakeholders to establish requirements and formulate priorities for infrastructure projects. • Leads/assists in configuration management • Works in a concerted effort with application development and engineering teams to resolve complex issues. • Provides oversight, collaboration, provisioning, management and maintenance of technology products and service alternatives that improve the production services environment • Responsible for the establishment and continuous development of monitoring and alerting for all production environments. • Develops internal processes and training to ensure team members have the skills needed and tools to support the production environment and deliver on project commitments. • Provides consultation for routine and complex systems development • Facilitates achievement of expected deliverables and obligations of Services Providers • Coordinates with Product and Architecture & Development teams for deployment and production support activities.Managing Work, Projects, and Policies • Manages and implements work and projects as assigned. • Generates and provides accurate and timely results in the form of reports, presentations, etc. • Analyzes information and evaluates results to choose the best solution and solve problems. • Provides timely, accurate, and detailed status reports as requested.Delivering on the Needs of Key Stakeholders • Understands and meets the needs of key stakeholders. • Develops specific goals and plans to prioritize, organize, and accomplish work. • Determines priorities, schedules, plans and necessary resources to ensure completion of any projects on schedule • Collaborates with internal partners and stakeholders to support business/initiative strategies • Communicates concepts in a clear and persuasive manner that is easy to understand. • Generates and provides accurate and timely results in the form of reports, presentations, etc. • Demonstrates an understanding of business prioritiesAdditional Responsibilities • Manages time effectively and conducts activities in an organized manner. • Presents ideas, expectations and information in a concise, organized manner. • Performs other reasonable duties as assigned by manager.

About the Company

M

MROADS