Write and maintain automated scripts (using tools like Terraform or Ansible) to deploy cloud resources Assist DevOps teams by supporting developers that build pipelines that enable rapid, reliable, and automated application delivery Maintain consistency across environments by using immutable infrastructure principles, reducing deployment drift Monitor cloud environments for performance bottlenecks and execute capacity planning Diagnose and resolve complex, cloud-related issues during outages with minimal disruption to services Set up proactive monitoring, logging, and alerting using tools like Datadog, CloudWatch, or Prometheus Participate in on-call rotations to support production services, perform root cause analysis, and handle routine maintenance Skills. They will collaborate across NYPA IT daily to implement Infrastructure as Code (IaC), manage and provision systems and services, and optimize costs and security across major platforms like Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform (GCP).They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. Develop end‑to‑end solution designs, determining appropriate technologies, defining interface patterns, managing upstream and downstream integrations, architecting data models to support use cases, and producing required documentation and best‑practice guidance. This role requires a solid understanding of infrastructure technologies combined with rigorous project management discipline, strong documentation skills, and a proven ability to hold teams accountable within SAFe, Agile, and Hybrid delivery environments. Position Overview:Experienced IT Infrastructure Project Manager to lead the delivery of infrastructure initiatives across cloud, network, compute, storage, and security domains. p>In this role, you will utilize geotechnical engineering knowledge to complete technical engineering assignments in support of Kiewit Corporation business, including: The Citrix Infrastructure Engineer is responsible for the implementation and lifecycle management of a large scale Citrix estate, including Citrix Cloud–managed services, hybrid cloud integrations, and associated application delivery technologies. The role requires strong cross team and cross region collaboration, close engagement with Architecture and Security teams, and effective management of strategic vendor relationships. The platform provides instant access to digital assets without the friction of external wallets or bridges, while offering advanced social trading features and institutional-grade execution data. Ecosystem Knowledge: Strong experience with Solana is highly preferred, though exceptional candidates with deep expertise in other major ecosystems will be considered. Strong team or technical experience strong verbal, written interpersonal communication skills.2+ years of Technology Infrastructure Engineering and Solutions experience, or equivalent 1+ years of Strong Technical Troubleshooting skills1+ years Working knowledge of application monitoring tool like Splunk, AppDynamics, Grafana, Prometheus.1+ years of Root Cause Analysis of problem issues. Experience communicating technical issues clearly, demonstrated through support interactions, group projects, or ticket documentationAbility to produce clear written documentation, such as incident summaries, operational notes, or procedural updates. Engineers on this team wear multiple hats: infra engineering, application-layer debugging, and close collaboration with product and application teams to minimize overhead so those teams can stay focused on building. This team owns the full lifecycle of Abnormal's cell-based deployment architecture-bootstrapping new cells, deploying our entire application and infrastructure stack onto them, and keeping every cell healthy, isolated, cost-efficient, and compliant. White Plains, NY4 days ago They will collaborate across IT daily to implement Infrastructure as Code (IaC), manage and provision systems and services, and optimize costs and security across major platforms like Amazon Web Services (AWS), Microsoft Azure, or Google Cloud Platform (GCP). Job Functions & Responsibilities- Write and maintain automated scripts (using tools like Terraform or Ansible) to deploy cloud resources .
Lightning AI operates globally with offices in New York City, San Francisco, Seattle, and London, and is backed by Coatue, Index Ventures, Bain Capital Ventures, and Firstminute. You'll operate at the intersection of infrastructure, data, and product, building platforms for metrics, logs, traces, and alerting that power both internal operations and customer-facing visibility. New York City, NY30+ days ago p>As a Senior ML Infrastructure Engineer, you will own the data infrastructure that powers our underwriting, claims, and operational workflows, translating complex business logic into reliable, scalable pipelines that the entire company depends on. A strong track record of building scalable data systems and pipelines in production, with deep proficiency in Spark, Databricks, and modern data processing infrastructure (AWS or equivalent).
Backed by premier investors such as Paradigm and Dragonfly, the organization is building a full-service platform designed to make issuing, managing, and integrating digital dollars seamless for developers, fintechs, and institutional partners. This individual will bring a software engineering mindset to infrastructure by building reusable abstractions, internal services, and developer tooling that reduce cognitive load across the engineering organization. New York City, New York30+ days ago style="min-height:1.5em">As a Senior ML Infrastructure Engineer, you will own the data infrastructure that powers our underwriting, claims, and operational workflows, translating complex business logic into reliable, scalable pipelines that the entire company depends on. style="min-height:1.5em">A strong track record of building scalable data systems and pipelines in production, with deep proficiency in Spark, Databricks, and modern data processing infrastructure (AWS or equivalent). New York City, New York30+ days ago All legitimate communication comes from brellium.com, or no-reply@ashbyhq.com, and we will never ask for money or sensitive personal information as part of our hiring process. We’ve built AI-powered technology that helps healthcare providers deliver safer, higher-quality care - starting with the first real-time medical review platform built to fix clinical and compliance risks before they impact patients. White Plains, NY30+ days ago li style="text-align:justify">This role is a critical part of a larger cross-functional Client Information Technology team of Architects, Network Support, Incident Management, Cyber Security and others to ensure resiliency and availability of key Client digital IT assets and services. - Strong knowledge designing and implementing resilient solutions utilizing virtualization, Microsoft clustering technologies, enterprise storage and SAN solutions, Microsoft Server technologies, and software defined networking (SDNs).
Learn more: Interviewing at TRM: How We Hire and What Success Looks LikeAI Fluency at TRMAI fluency is a baseline expectation at TRM.We believe AI meaningfully changes how top performers operate. TRM Labs provides blockchain analytics and AI solutions to help law enforcement and national security agencies, financial institutions, and cryptocurrency businesses detect, investigate, and disrupt crypto-related fraud and financial crime. This role will assess acquired companies' infrastructure environments, develop integration strategies, and help drive successful transitions across enterprise IT systems, collaboration platforms, identity management, networking, security, and workplace technologies. Experience working across a broad range of infrastructure technologies, including collaboration platforms, identity and access management, networking, endpoint technologies, and cloud services. Built on open-source innovations and fueled by industry leading agentic AI technology, Corelight helps teams to detect advanced threats and close cases with unprecedented clarity and precision. Fueled by investments from top-tier venture capital organizations such as Crowdstrike, Accel and Insight, Corelight is one of the fastest growing network detection and response platforms in the industry. p>You will play a key role in owning and evolving our image pipeline, running validation environments and test clusters, and supporting both system-level and GPU hardware qualification. Lightning AI operates globally with offices in New York City, San Francisco, Seattle, and London, and is backed by Coatue, Index Ventures, Bain Capital Ventures, and Firstminute. Lightning AI operates globally with offices in New York City, San Francisco, Seattle, and London, and is backed by Coatue, Index Ventures, Bain Capital Ventures, and Firstminute. You will work at the intersection of software, hardware, and operations-developing automation, improving reliability, and scaling distributed storage systems across our bare-metal infrastructure. li>Own CI/CD and deployments: maintain and extend our GitHub Actions workflows and help migrate toward a dedicated CD tool with proper permissioning - the goal is fully automated, locked-down deploys via service accounts, no direct engineer access to production. Unify observability (likely first project): consolidate today's per-team alerting into a single view - system-to-system dashboards plus incident alerting that routes upstream service/vendor failures to the right impacted teams and on-call rotations. Top Requirements: Bachelor's degree in Computer Science, Engineering, or a related field, plus 4+ years of experience in IT infrastructure or data center operations; or 8+ years of relevant experience without a degree. With offices across the U.S. and clients ranging from Fortune 500 companies to government organizations, we provide opportunities that help professionals grow their careers while making an impact. li>Build, operate, and troubleshoot workloads on Kubernetes, using Kustomize and Helm, and platform tooling such as Argo CD, Argo Workflows, Argo Events, Argo Rollouts, Crossplane, and related controllers (External Secrets, Sealed Secrets, cert-manager, ingress such as Traefik, AWS Load Balancer Controller, External DNS) as appropriate to the environment. Develop and maintain scalable, reliable automation and integrations across AWS, GCP, and Azure, SaaS platforms, and custom services (including APIs and event-driven workflows).
Jersey City, New Jersey30+ days ago p style="text-align:inherit"/>US - NJ - Jersey City - 101 Hudson St - 101 Hudson (NJ2101), US - NJ - Pennington - 1300 American Blvd - Hopewell Bldg 3 (NJ2130). This includes our commitment to being an inclusive workplace, attracting and developing exceptional talent, supporting our teammates’ physical, emotional, and financial wellness, recognizing and rewarding performance, and how we make an impact in the communities we serve. We want to continue building on our Kubernetes-based platform by extending its capabilities to support more modern architectures and workflows, all in service of providing the best experience possible for our customers. For that reason, unless explicitly requested and/or approved by the recruiting team, AI tools (including but not limited to generative AI assistants such as ChatGPT, Claude, Copilot, etc.) may not be used during any interview sessions, including but not limited to: -Recruiting phone interviews. Jersey City, NJ30+ days ago p>JPMorganChase, one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses, and many of the worlds most prominent corporate, institutional, and government clients under the J.P. Deep knowledge of one or more areas of infrastructure engineering such as hardware, networking terminology, databases, storage engineering, deployment practices, integration, automation, scaling, resilience, or performance assessments. JERSEY CITY, NJ30+ days ago Primary Skill: Windows / Linux Administration, VMware / Hyper V, Azure / AWS Cloud, Networking (TCP/IP, DNS, DHCP, VPN), PowerShell / Bash / Python, Terraform / Ansible, CI/CD Tools, Monitoring Tools (Splunk, Grafana, Prometheus), Backup & Disaster Recovery, Active Directory / IAM. " Automate infrastructure tasks using scripts or Infrastructure-as-Code tools (Terraform, Ansible, PowerShell, Python). White Plains, NY2 days ago div>If you are interested, please share your updated resume ASAP. Job Summary:. This role collaborates with IT and DevOps teams to implement Infrastructure as Code (IaC), provision and manage cloud resources, and optimize performance, security, and cost across cloud platforms such as AWS, Microsoft Azure, or Google Cloud Platform (GCP). p>RESPONSIBILITIES: - Design, build, and scale decentralized matching engine, risk system, and limit order book for blockchain protocol using CosmosSDK along with its surrounding backend systems.
- Experience with trading system development at a mid/high frequency firm writing code to communicate with exchanges (i.e. order entry and low latency feed handlers).
We believe that having people across different backgrounds, experiences, abilities, and perspectives enables us not only to build the best financial products, but to help us realize the best versions of ourselves. We're building the next generation of financial products alongside some of the biggest names in the market including Robinhood, DoorDash, Credit Karma, Amex, Discover, Intuit, Acorns, Visa and more. New York City, New York30+ days ago Backed by strong investor support and early customer traction, our team is composed of experts from OpenAI, Meta, Mandiant, Palantir, Cruise, Trail of Bits, and Aptiv. Founded in 2023 by Ari Herbert-Voss and Vlad Ionescu, RunSybil is on a mission to automate hacker intuition. em> Experience with highly scalable networked APIs, Postgres, data pipelines, and async processing Experience and opinions with implementing effective observability instrumentation and ensuring alerts are timely and actionable Experience building tools and systems that empower internal developers to deliver business value rapidly Experience building systems from scratch, making trade-offs, and executing autonomously in early-stage environments A product mindset and strong problem solving skills, with the ability to navigate ambiguity and focus on delivering customer value, for internal developers and the customers they build for A collaborative mindset, fostering a culture of mentorship, shared success, and continuous improvement Based in NYC Nice-to-have: Experience with building and supporting infrastructure in multiple AWS regions Experience with Python and Django Experience with Open Telemetry Experience with transaction, merchant, and/or location data Experience and/or interest in fintech and/or data products Experience with agentic development infrastructure and workflows Familiarity data science and analytics tools such as Pyspark, Databricks and Delta Lake, and Hex Why join Spade? Customers such as, FIS, Bilt, Mercury, Stripe, alongside many other leaders in fintech and financial services, trust Spade's data to enable personalized rewards programs, accurate applied spending rules, precise analytics requests, and innovative AI-powered features. Jersey City, NJ30+ days ago MongoDB Atlas: Operate Atlas clusters with right‑sizing and autoscaling policies, multi‑region/global configurations, and maintenance within change controls; configure security and networking (private endpoints/peering, RBAC, API governance, encryption); implement cost/capacity guardrails; manage snapshots and point‑in‑time restore with periodic DR testing. MongoDB: Administer replica sets and sharded clusters on‑prem with scaling and version upgrades through change governance; performance and resiliency via index/schema design, query plan analysis, server parameter tuning; robust backup/restore with PITR and DR runbooks; enterprise security hardening and integrated observability. New York, New York30+ days ago In this role you will build and maintain the platform layer supporting large-scale ML training, inference, and deployment. Familiarity with ML infrastructure a strong plus — training pipelines, inference serving, GPU workloads. |