Job Description
About Hashgraph:
Hashgraph is a fast-growing software company committed to supporting, developing and servicing Hedera, an open source, proof-of-stake platform. Hedera is EVM-compatible and has been specifically built to meet the needs of enterprise and web3 applications, which require speed, security, stability and sustainability. Hedera's public network is governed by industry-leading organizations, spanning 11 sectors and 14 regions who oversee the development and direction of the decentralized platform.
About the role:
Hashgraph is seeking an experienced DevOps Manager to lead our DevOps team in supporting the operations of consensus nodes across Hedera testnet, previewnet, and preproduction environments. This role requires a hands-on technical leader who can balance strategic planning with day-to-day operational excellence in our web3 infrastructure.
As the DevOps Manager, you will lead a team of operations engineers while remaining technically engaged in building automation, improving infrastructure as code, and coordinating with Hedera Governing Council members. You'll be responsible for team development, process optimization, and ensuring 24/7 operational readiness of critical Hedera network infrastructure.
This role requires strong technical expertise in cloud infrastructure (particularly GCP), infrastructure as code tools (Terraform, Ansible), and container orchestration (Kubernetes), combined with proven people management skills to mentor, grow, and retain top engineering talent.
You may find yourself doing all of the following:
Lead and mentor a team of DevOps engineers, providing technical guidance and career development
Manage day-to-day operations of Hedera production and preproduction infrastructure
Coordinate with Hedera Governing Council members on operational matters and infrastructure requirements
Design and implement automation solutions to reduce operational toil and improve efficiency
Own and evolve infrastructure as code practices using Terraform and Ansible
Establish and maintain incident management processes, including on-call rotations and post-mortem reviews
Drive continuous improvement initiatives for monitoring, observability, and alerting systems
Manage capacity planning and scaling strategies for cloud and bare metal infrastructure
Ensure 24/7 operational readiness and lead response to critical incidents
Lead hiring efforts to grow the DevOps team, including defining role requirements, interviewing candidates, and making hiring decisions
Collaborate with development teams to improve CI/CD pipelines and deployment processes
Define and track team KPIs, SLOs, and operational metrics
Manage team budget and resource allocation
Interface with senior leadership on strategic planning and technical roadmap
Qualification Requirements:
B.S in Computer Science or a similar study
3+ years of people management experience leading DevOps or infrastructure engineering teams
7+ years of DevOps or software development experience
5+ years of experience running AWS / GCP / Azure cloud workloads at scale
Strong hands-on experience with Terraform, Kubernetes, and Ansible
Deeply familiar with operating and troubleshooting issues in a Linux environment
Proven track record of building high-performing teams and developing engineering talent
Experience with incident management, on-call rotations, and post-mortem processes
Deeply familiar with DevOps and software development lifecycle best practices
Strong written and verbal communication skills, including the ability to interface with senior leadership
Comfortable leading a fully remote, distributed team across multiple time zones
Other skills that are great to bring with you but that we can help you develop:
Experience in blockchain, web3, or distributed systems operations
Familiarity with the LGTM stack and observability best practices
Programming experience in Golang, Python, Bash, Java, or JavaScript
Experience with Jenkins Pipelines, Github, and Github Actions
Background in SRE principles and practices
Hashgraph is a fast-growing software company committed to supporting, developing and servicing Hedera, an open source, proof-of-stake platform. Hedera is EVM-compatible and has been specifically built to meet the needs of enterprise and web3 applications, which require speed, security, stability and sustainability. Hedera's public network is governed by industry-leading organizations, spanning 11 sectors and 14 regions who oversee the development and direction of the decentralized platform.
About the role:
Hashgraph is seeking an experienced DevOps Manager to lead our DevOps team in supporting the operations of consensus nodes across Hedera testnet, previewnet, and preproduction environments. This role requires a hands-on technical leader who can balance strategic planning with day-to-day operational excellence in our web3 infrastructure.
As the DevOps Manager, you will lead a team of operations engineers while remaining technically engaged in building automation, improving infrastructure as code, and coordinating with Hedera Governing Council members. You'll be responsible for team development, process optimization, and ensuring 24/7 operational readiness of critical Hedera network infrastructure.
This role requires strong technical expertise in cloud infrastructure (particularly GCP), infrastructure as code tools (Terraform, Ansible), and container orchestration (Kubernetes), combined with proven people management skills to mentor, grow, and retain top engineering talent.
You may find yourself doing all of the following:
Lead and mentor a team of DevOps engineers, providing technical guidance and career development
Manage day-to-day operations of Hedera production and preproduction infrastructure
Coordinate with Hedera Governing Council members on operational matters and infrastructure requirements
Design and implement automation solutions to reduce operational toil and improve efficiency
Own and evolve infrastructure as code practices using Terraform and Ansible
Establish and maintain incident management processes, including on-call rotations and post-mortem reviews
Drive continuous improvement initiatives for monitoring, observability, and alerting systems
Manage capacity planning and scaling strategies for cloud and bare metal infrastructure
Ensure 24/7 operational readiness and lead response to critical incidents
Lead hiring efforts to grow the DevOps team, including defining role requirements, interviewing candidates, and making hiring decisions
Collaborate with development teams to improve CI/CD pipelines and deployment processes
Define and track team KPIs, SLOs, and operational metrics
Manage team budget and resource allocation
Interface with senior leadership on strategic planning and technical roadmap
Qualification Requirements:
B.S in Computer Science or a similar study
3+ years of people management experience leading DevOps or infrastructure engineering teams
7+ years of DevOps or software development experience
5+ years of experience running AWS / GCP / Azure cloud workloads at scale
Strong hands-on experience with Terraform, Kubernetes, and Ansible
Deeply familiar with operating and troubleshooting issues in a Linux environment
Proven track record of building high-performing teams and developing engineering talent
Experience with incident management, on-call rotations, and post-mortem processes
Deeply familiar with DevOps and software development lifecycle best practices
Strong written and verbal communication skills, including the ability to interface with senior leadership
Comfortable leading a fully remote, distributed team across multiple time zones
Other skills that are great to bring with you but that we can help you develop:
Experience in blockchain, web3, or distributed systems operations
Familiarity with the LGTM stack and observability best practices
Programming experience in Golang, Python, Bash, Java, or JavaScript
Experience with Jenkins Pipelines, Github, and Github Actions
Background in SRE principles and practices