Description
At Webflow, we're building the world's leading AI-native Digital Experience Platform. As a Senior Infrastructure Engineer, you will own and evolve the cloud substrate that Webflow's product and engineering teams depend on. This includes our compute layer, EKS fleet, networking, and cloud operations across AWS and GCP.
You will design and maintain the networking fabric that connects Webflow's services, ensuring reliability, security, and scalability across our cloud environments. You will also build and enforce guardrails around IAM, SCPs, and scoping permissions that keep infrastructure secure and auditable without slowing engineers down.
You will drive FinOps across Webflow's cloud footprint, owning cost attribution, right-sizing recommendations, and surfacing waste before it becomes a problem. You will partner with Infra teams on instance type selection, capacity planning, and right-sizing decisions that impact application performance.
You will build and maintain AI-powered automation that improves how we manage cloud infrastructure, from policy-as-code and drift detection to LLM-assisted runbook generation. You will help define the culture of this growing team as it expands its international presence.
To succeed in this role, you will have a background as an infrastructure or cloud engineer with an enthusiasm for automation and code, or a background as a software engineer with deep enthusiasm for cloud infrastructure and distributed systems. You will have 5+ years of experience owning and operating cloud infrastructure in a customer-facing environment that allows for little to no downtime.
You will have deep hands-on experience with AWS and a strong opinion on what good cloud operations look like. You will have experience managing Kubernetes clusters at scale, including upgrades, node group management, autoscaling, and cluster add-on lifecycle. You will have experience with infrastructure-as-code tools like Pulumi or Terraform, and a strong preference for changes made through code, not consoles.
You will have experience navigating multi-region or multi-cloud environments on AWS or GCP. You will stay curious and open to growth, demonstrating a proactive embrace of AI, and actively building and applying fluency in emerging technologies to elevate how we work, drive faster outcomes, and expand collective impact.
Preferred skills include experience with Karpenter, cluster autoscaler, or other Kubernetes-native scaling tooling, experience with GCP infrastructure alongside AWS in a multi-cloud environment, experience building AI-assisted infrastructure tooling, including cost optimization loops, anomaly detection, or policy-as-code with LLM assistance, and experience contributing to multi-region architecture including data residency, regional failover, or latency-based routing.
As a collaborative and strategic thinker, you will thrive in a fast-paced environment where you will navigate ambiguous situations with ease, gathering data and making progress even with incomplete information or unclear requirements. You will be comfortable with ambiguity and will be able to communicate effectively with both technical and non-technical stakeholders.
If you don't meet 100% of the qualifications, you should still seriously consider applying. Studies show that you can still be considered for a role if you meet just 50% of the role's requirements.
Webflow is committed to building lasting customer trust, winning together, reinventing ourselves, and delivering with speed, quality, and craft. We offer a range of benefits, including ownership in what you help build, comprehensive medical, dental, and vision plans, flexible vacation, paid holidays, and a sabbatical program to help you recharge and come back inspired.
We also offer access to mental health resources, therapy and coaching, a 401(k) with 100% employer match, and monthly stipends that flex with your life. All full-time, permanent, non-commission employees are eligible for our annual WIN bonus program.