SRE Architect

Bengaluru, Karnataka, India | SRE | Full-time


About MoEngage

MoEngage is an Intelligent customer engagement platform, built for customer-obsessed marketers and product owners. We enable hyper-personalization at scale across multiple channels like mobile push, email, in-app, web push, on-site messages, and SMS. With AI-powered automation and optimization, brands can analyze audience behavior and engage consumers with personalized communication at every touchpoint across their lifecycle.

Fortune 500 brands and Enterprises across 35 countries such as Deutsche Telekom, Samsung, Ally Financial, Vodafone, and McAfee along with internet-first brands such as Flipkart, Ola, OYO, Bigbasket, and ShareChat use MoEngage to orchestrate their cross-channel campaigns and engage efficiently with their customers sending 50 billion messages to 500 million consumers every month!

Our vision is to build the world’s most trusted customer engagement platform for the mobile-first world.

We promise to care about your customers as much as you do. And that justifies our top ratings for service and support in Gartner Magic Quadrant, Gartner Peer Insights, and G2 Summer Reports. We have also been recognized as one of the 25 Highest Rated Private Cloud Computing Companies To Work For in a list released by Battery Ventures, a global investment firm based on the employee feedback on Glassdoor where employees reported the highest levels of satisfaction at work during the first six months of the pandemic."

Vital stats about our scale

600M+  Monthly Active Users

70B+  Push Notifications per month

5 Data Centers

About SRE team

SRE team is the life of our operations at MoEngage. Managing a daunting fleet of over 5000+ servers in over 5 regional data centers across multiple clouds, the SRE team helps the teams in MoEngage to build, release, monitor and run the services to serve our customers reliably, and also takes care of bigdata operation, resource optimization, micro-services adoption and security . At the scale at which MoEngage operates every aspect of SRE's ingenuity is needed to run our systems smoothly. Everyone who joins the SRE team at MoEngage is expected to own the infrastructure and work with developers to better the delivery pipeline wherever possible. 

We handle more than a billion messages everyday. Rest assured, you will be surrounded by really smart and passionate people as we scale much more to build a world class technology team.

Responsibilities :

 "In short, You will be responsible for all the tech built by SRE team, and have the freedom to execute your ideas. Only restriction is that we want you to bring high standards and breath it everyday."

  1. Accountable for infrastructure design, Automation, stability, resilience, performance, monitoring, security, and implementation of right practices.
  2. Design, architect and implement best in class  CI/CD pipelines and strive for continuous improvement.
  3. Collaborate with other engineering teams and create opportunities to improve the development/production environment.
  4. Containerizing and orchestrating with K8S and driving the micro-services adoption across multiple engineering functions.
  5. Owning/Building functional KPIs for services, incident, and infrastructure metrics.
  6. Identify and track metrics such as MTTR (mean time to recovery, repair, respond or resolve) in order to exceed SLA expectations
  7. Build services and Maintain once they are online by measuring and monitoring availability, latency and overall system reliability.
  8. Building solutions and mentoring the team for Monitoring at scale with Prometheus and TICK stack.
  9. Building automation and helping the developers for, one of the largest Elasticsearch / MongoDB /  Kafka cluster deployment at MoEngage.
  10. Drive in and Participate for AWS cost efficiency.
  11. Work closely with team members to ensure best practices and strategic goals are incorporated into development work.
  12. Implementing best practices, challenging status quo, tab on industry and technical trends, changes and developments to ensure team is always striving for best.
  13. Participation and guiding the team members in 24/7 on-call rosters.
  14. Customer-Obsession - Build a great customer experience internally for engineers using our products/services.

Skill Requirements :

  • Holistic understanding of high-availability, fault-tolerant, scalable, resilient and distributed systems.
  • Stronger experience with AWS cloud computing infrastructure and its components.
  • Proven and Hands-on experience in handling large scale infrastructure like Package management, EC2, SQS, S3, MongoDB and Distributed systems like Kafka, Yarn, Elastic Search etc..
  • Familiarity with container orchestration tools (K8's, ECS, swarm) build, artifacts, packaging, service discovery management tools.
  • Hands-on with containerization, container management, and cluster management - Kubernetes, Docker, EKS, Apache Mesos/Marathon, etc.
  • Hands-on experience with AWS public cloud offerings -  components like EC2, Athena, SQS, IAM, S3, DynamoDB, Cost Optimization..etc)
  • Hands-on experience with configuration management tools (Ansible, Terraform, etc...)
  • Experience with software development and a good understanding of any of the following languages - Python, Java, Go.
  • Strong experience with systems internals and administration (e.g. filesystems, inodes, system calls, etc) and  networking (e.g. TCP/IP, routing, network topologies, and hardware, SDN, etc)
  • Familiarity with task queue frameworks like Celery or Pika is a plus.
  • Source code management and Implementation of security best practices.
  • Know-how of gathering metrics across distributed system (instances/container) & generating automated notification, reports.
  • Prowess in analyzing App bottlenecks, performance degradation and implementing automated process/tools to detect such anomalies.
  • Ability and willing to learn new programming languages, frameworks, and paradigms.
  • Good understanding & implementation experience using 12-factor App principles.

At MoEngage, we are passionate about our team and technology - see below to know more about us and technology.

Tech @MoEngage | Scale @MoEngage | Life @MoEngage