ABOUT SQUAREOPS
SquareOps is a managed DevOps and SRE company. We run production infrastructure for cloud-native product teams — startups, scale-ups, and enterprises — across AWS and GCP. Our clients build on modern stacks: Kubernetes, ECS, serverless, and AI-powered applications. Several of our clients operate in fintech and healthcare, where uptime, security, and information discipline are non-negotiable.
This is not a legacy IT shop. You will operate infrastructure that is genuinely cutting edge — the kind of stack that most engineers only read about.
We are scaling our 24x7 shared SRE operations team and looking for 3 L1 SRE Engineers to join us.
WHAT YOU WILL DO
Monitor cloud infrastructure (AWS primary, GCP secondary) across multiple client environments simultaneously
Respond to alerts across Slack, Google Chat, and MS Teams per defined SLAs
Triage incidents following ITSM processes — log, classify, escalate, close with documentation
Execute runbooks for standard operational tasks and known issue patterns
Perform clean shift handoffs with accurate status updates
Coordinate with L2 engineers and client stakeholders during escalations
Participate in change windows as an executor — implementing pre-approved changes safely
Handle client credentials, access, and operational data with strict infosec discipline
WHAT WE ARE LOOKING FOR
Must Have
3–5 years in cloud operations, infrastructure support, or managed services
Linux fundamentals — process management, log navigation, basic troubleshooting
Networking basics — DNS, TCP/IP, HTTP/S, load balancers, VPCs
Scripting in Bash and/or Python — operational scripts, not development-level
Hands-on AWS — EC2, ECS, RDS, CloudWatch, IAM, S3 at minimum
Experience in an ITSM-governed environment — incident management, SLA adherence, escalation paths
Worked in or alongside infosec-conscious environments — credential hygiene, least-privilege access, secure information handling as second nature
Strong written and verbal English — direct client communication is part of the role
Comfortable with rotational shifts including nights and weekends
Good to Have
Kubernetes or ECS hands-on exposure — familiarity with pods, services, deployments
GCP experience
AWS Certified Cloud Practitioner or SysOps Administrator
Observability tools — Grafana, Prometheus, Loki, CloudWatch dashboards
Prior experience in a multi-client MSP or shared services environment
Basic CI/CD and containerisation awareness (Docker, EKS, ECS)
Exposure to fintech or healthcare client environments
WHO THRIVES HERE
WHY JOIN SQUAREOPS
Operate infrastructure that is actually cutting edge
Our clients run Kubernetes, ECS, serverless, and AI-powered applications on AWS and GCP. You will operate modern, fast-moving production systems — including AI workloads — and build fluency with a tech stack that is shaping where the industry is headed. This is a meaningful upgrade from traditional cloud ops or legacy ITSM environments.
Night and weekend shifts are from home
Rotational work should be sustainable. Night shifts and weekend shifts are fully remote — no commute, no compromise on your time.
Shift allowances, separately paid
Night and weekend shifts come with dedicated allowances on top of your fixed salary. You work the hard shifts, you get paid for them explicitly.
Grow fast across clients, not slow within one
Clear L1 → L2 → L3 progression. Working across multiple client environments from day one means the breadth of exposure you get here compresses years of learning into months.
AWS certifications, fully sponsored
Cloud Practitioner and SysOps Administrator — we sponsor both. We want our engineers certified and we invest in making that happen.
Small team, high visibility
You are not employee #4000. Your work is seen, your growth is a direct function of your capability, and you will have real ownership of your shift and your accounts from early on.