What you’ll be doing • Ensure that our applications and environments are stable, scalable, secure and performing as expected. • Proactively engage and work in alignment with cross-functional colleagues to understand their
requirements, contributing to and providing suitable supporting solutions. • Develop and introduce systems to aid and facilitate rapid growth including implementation of deployment policies, designing and implementing new procedures, configuration management and planning of patches and for capacity upgrades • Observability: ensure suitable levels of monitoring and alerting are in place to keep engineers aware of issues. • Establish runbooks and procedures to keep outages to a minimum. Jump in before users notice that things are off track, then automate it for the future. • Automate everything so that nothing is ever done manually in production. • Identify and mitigate reliability and security risks. Make sure we are prepared for peak times, DDoS attacks and fat fingers. • Troubleshoot issues across the whole stack - software, applications and network. • Manage individual project priorities, deadlines, and deliverables as part of a self-organizing team. • Learn and unlearn every day by exchanging knowledge and new insights, conducting constructive code reviews, and participating in retrospectives. You Must Have • 2+ years extensive experience of Linux server administration include patching, packaging (rpm), performance tuning, networking, user management, and security. • 2+ years of implementing systems that are highly available, secure, scalable, and self-healing on Azure cloud platform • Strong understanding of networking, especially in cloud environments along with a good understanding of CICD. • Prior experience implementing industry standard security best practices, including those recommended by Azure • Proficiency with Bash, and any high-level scripting language. • Basic working knowledge of observability stacks like ELK, prometheus, grafana, Signoz etc • Proficiency with Infrastructure as Code and Infrastructure Testing, preferably using Pulumi/Terraform. • Hands-on experience in building and administering VMs and Containers using tools such as Docker/Kubernetes. • Excellent communication skills, spoken as well as written, with a demonstrated ability to articulate technical problems and projects to all stakeholders. Extra credits for ● Experience with these technologies: ● Pulumi with TypeScript or Golang ● Node.js ● Kubernetes ● Serverless infrastructure. ● Azure cloud ● Experience in governance processes and compliance validation, especially for financial services such as ISOm, SOC2, PCI etc. ● Experience working in product startups. ● Experience in administering and scaling PostgreSQL
Corporate card & spend management platform for Indian startups
Location
Mumbai, MH, IN / Mumbai, Maharashtra, IN
Experience
3+ years
Total raised
$8.7M
Last stage
Seed
Investors
No applications, no recruiter spam. Just the intro.
A few questions to make sure this role is the right shape for you. Two minutes.
I write the intro, send it to the founder, and handle the back-and-forth.
If they’re a yes, I book the chat. You show up — that’s the whole job-hunt.