Senior Engineering Manager, Cloud Platform
full-time
senior
Posted 1 day ago
About this role
Get to Know Us
Horizon3.ai is a fast-growing, remote cybersecurity company dedicated to the mission of enabling organizations to proactively find, fix and verify exploitable attack vectors before criminals exploit them. Our flagship product, the NodeZeroTM platform, delivers production-safe autonomous pentests and other key assessment operations that scale across the largest internal, external, cloud, and hybrid cloud environments. NodeZero has been adopted by organizations of all sizes, from small educational institutions to government agencies and Global 100 enterprises. It is used by IT Ops/SecOps teams, consulting pentesters, and MSSPs and MSPs.
We are a fusion of former U.S. Special Operations cyber operators, startup engineers & operators, and formerly frustrated cybersecurity practitioners. We're committed to helping solve our common security problems: ineffective security tools and false positives, resulting in alert fatigue, blind spots, "checkbox” security culture, cybersecurity skills shortage, and the long lead time and expense of hiring outside consultants. Collectively, we are a team of learn it alls, committed to a culture of respect, collaboration, ownership, and results.
What You’ll Do
We are looking for an engineering leader to lead Horizon3’s infrastructure platform team. Our platform team is building capabilities for product development teams to be able to rapidly launch new product features, by providing self-service environments and infrastructure. This leader will also be chartered to establish a Site Reliability function to define and drive investments in operational excellence to ensure Horizon3’s product and service offerings meet customer and business expectations.
You will:
Enable Feature Development Teams
- Lead software engineering teams providing infrastructure-as-code to manage cloud infrastructure. Provide high quality IaC components and frameworks to support application development teams to leverage and extend to self-service their infrastructure provisioning.
- Establish governance and mechanisms for application development teams to self-service infrastructure provisioning, while providing for best practices and controls.
- Provide documentation, training, and support to ensure feature dev teams are leveraging self-service capabilities.
Establish an SRE function.
- Hire experienced site reliability staff, and a line manager to grow and oversee the SRE team.
- Professionalize incident management. Define and document incident processes and practices for your SRE team and for the application feature teams. Make tool and vendor decisions to support processes.
- Drive incident professionalism across the engineering organization through training and process adoption.
Drive Engineering Excellence in Platform Engineering
- Establish design-before-build discipline. Facilitate lightweight design documents, architectural decision records, and working group reviews. Outline operational lifecycles, “Day 2” concerns, and developer experience as part of infrastructure architecture decisions.
- Use design reviews, code reviews, and blameless retrospectives to drive a culture of quality and excellence in engineering.
People Leadership and Project Management
- Balance providing developer support while also executing on a roadmap of infrastructure engineering initiatives. Establish intake, allocate resources, provide visibility into backlogs to stakeholders, and manage prioritization against capacity.
- Directly manage a growing team of infrastructure engineers. Hire and develop line managers and staff / principal engineers. Ensure a strong bench of technical and leadership talent in your group. As a Manager, you will be responsible for:
- Recruiting and onboarding talented individuals to support our organizational goals
- Mentoring, coaching, equipping, and developing your team
- Recognizing and retaining high performers
- Leading horizontally with peer Management & Senior Leaders
What You’ll Bring (Qualifications)
Technical Leadership
- Demonstrated experience leading teams operating SaaS service infrastructure.
- Deep hands-on experience deploying and operating production infrastructure on public cloud platforms (AWS strongly preferred; Azure and GCP familiarity a plus).
- Strong command of Infrastructure as Code, including Terraform; experience with Crossplane and GitOps patterns strongly preferred.
- Experience managing production Kubernetes environments at scale.
- Solid understanding of security best practices including zero trust architecture, secrets management, identity and access management, and software supply chain security.
- Experience building and operating self-service infrastructure platforms that enable application development teams, while balancing self-service and developer productivity with maintainability and security.
Site Reliability and Operations
- Experience leading or building
Similar Jobs
Related searches:
Get jobs like this delivered weekly
Free AI jobs newsletter. No spam.