offergenie_white
Ascendion

Lead Cloud Engineer

Ascendion

Dallas, TXRemote$60 - $65 an hour
Senior LevelCloud EngineerRemote
Apply with AI Cover Letter

Job Description

About Ascendion

Ascendion is a full-service digital engineering solutions company. We make and manage software platforms and products that power growth and deliver captivating experiences to consumers and employees. Our engineering, cloud, data, experience design, and talent solution capabilities accelerate transformation and impact for enterprise clients. Headquartered in New Jersey, our workforce of 6,000+ Ascenders delivers solutions from around the globe. Ascendion is built differently to engineer the next.

Ascendion | Engineering to elevate life

We have a culture built on opportunity, inclusion, and a spirit of partnership. Come, change the world with us:

Build the coolest tech for the world’s leading brands
Solve complex problems – and learn new skills
Experience the power of transforming digital engineering for Fortune 500 clients
Master your craft with leading training programs and hands-on experience

Experience a community of change makers!

Join a culture of high-performing innovators with endless ideas and a passion for tech. Our culture is the fabric of our company, and it is what makes us unique and diverse. The way we share ideas, learning, experiences, successes, and joy allows everyone to be their best at Ascendion.

About the Role
We are seeking an experienced Lead Cloud Engineer with strong expertise in monitoring, observability, and cloud platform reliability. The ideal candidate will design, implement, and manage end-to-end observability solutions using Datadog, Grafana, and related monitoring stacks. This role will partner closely with DevOps, SRE, Application, and Infrastructure teams to ensure the health, performance, and availability of mission-critical systems across multi-cloud environments.
Job Title: Lead Cloud Engineer Key Responsibilities
Design and Implement Monitoring Solutions: Architect and deploy scalable monitoring and observability frameworks using Datadog, Grafana, and Prometheus across AWS, Azure, or GCP environments.
Build Dashboards & Alerts: Develop comprehensive dashboards, alerts, and performance indicators to enable proactive monitoring of applications, microservices, APIs, and infrastructure.
Integrate Observability Tools: Integrate Datadog and Grafana with CI/CD pipelines, logging (ELK/CloudWatch), APM, and tracing systems for unified observability.
Define KPIs and SLOs: Work with application and platform teams to define Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets.
Automation & Infrastructure as Code (IaC): Automate provisioning, configuration, and management of monitoring tools using Terraform, CloudFormation, or Ansible.
Performance Analysis & Incident Management: Analyze performance bottlenecks, conduct root cause analysis, and drive post-incident reviews with actionable insights.
Governance & Best Practices: Establish observability standards, documentation, and best practices across development and operations teams.
Mentoring & Leadership: Lead a small team of cloud engineers, providing technical guidance, code reviews, and skill development support.
Minimum Qualifications
8+ years of experience in Cloud Engineering, DevOps, or Site Reliability Engineering roles.
Proven expertise with Datadog (metrics, logs, APM, RUM, synthetic monitoring, custom dashboards, alerting).
Strong hands-on experience with Grafana, Prometheus, and OpenTelemetry.
Solid understanding of cloud platforms – AWS, Azure, or GCP (certification preferred).
Proficiency in Infrastructure as Code (Terraform/CloudFormation) and automation scripting (Python, Bash, PowerShell).
Experience implementing observability pipelines, log forwarding, and distributed tracing.
Familiarity with Kubernetes, Docker, and microservices architecture.
Strong problem-solving, analytical, and communication skills.
Desired Qualifications
Experience implementing monitoring for Kubernetes clusters and serverless workloads.
Knowledge of incident response automation and AIOps capabilities in Datadog.
Exposure to security monitoring and compliance dashboards.
AWS Certified DevOps Engineer, Azure DevOps Expert, or Datadog Certified Professional preferred.
Location
Remote

Salary and Other Compensation: The hourly rate for this position is between [$60.00-65.00 per hour]. Factors which may affect pay within this range may include geography/market, skills, education, experience and other qualifications of the successful candidate.

Benefits: The Company offers the following benefits for this position, subject to applicable eligibility requirements: [medical insurance] [dental insurance] [vision insurance] [401(k) retirement plan] [long-term disability insurance] [short-term disability insurance] [personal days accrued each calendar year. The Paid time off benefits meet the paid sick and safe time laws that pertains to the City/ State] [12-15 days of paid vacation time] [6-8 weeks of paid parental leave after a year of service] [9 paid holidays and 2 floating holidays per calendar year] [Ascendion Learning Management System] [Tuition Reimbursement Program]

Want to change the world? Let us know.

Tell us about your experiences, education, and ambitions. Bring your knowledge, unique viewpoint, and creativity to the table. Let’s talk!
Preferred Skills
AWS Certified
Python AWS Certified
Python
Job details
Job ID

331769

Job Requirements

Lead Cloud Engineer

Location

Dallas, Texas, US

Recruiter

Binal

Email

[email protected]

About Ascendion
Ascendion is a full-service digital engineering solutions company. We make and manage software platforms and products that power growth and deliver captivating experiences to consumers and employees.

Our engineering, cloud, data, experience design, and talent solution capabilities accelerate transformation and impact for enterprise clients. Headquartered in New Jersey, our workforce of 6,000+ Ascenders delivers solutions from around the globe. Ascendion is built differently to engineer the next.