Lead/Senior Site Reliability Engineer

Zalo
Thành phố Hồ Chí Minh
Full time
4 tuần trước
Hồ Chí Minh

Full-time

As we provide services which serving million customers such as: Zalo, ZMP3, BaoMoi, Kiki ....We are looking for an experienced SRE who brings a unique perspective, a passion for collaborating with cross-functional teams, and the ability to derive real-time insights from massive-scale data to build practical solutions and deliver exceptional user experiences at every touchpoint.
  • Run the production environment by monitoring availability and taking a holistic view of system health;
  • Build software and systems to manage platform infrastructure and applications;
  • Improve reliability, quality, and time-to-market of our suite of software solutions;
  • Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement;
  • Provide primary operational support and engineering for multiple large-scale distributed software applications;

What you will do

  • Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding;
  • Partner with development teams to improve services through rigorous testing and release procedures;
  • Participate in system design consulting, platform management, and capacity planning;
  • Create sustainable systems and services through automation and uplifts;
  • Balance feature development speed and reliability with well-defined service-level objectives

What you will need

  • Ability to program (structured and OOP) using one or more high-level languages, such as Python, Golang;
  • Experience with dynamic resource management frameworks (Kubernetes, Nomad, Yarn);
  • Experience manage infrastructure as code (Terraform,..);
  • Experience with source version control (git, svn...), as well as configuration management (Ansible, Puppet, Salt stack...);
  • Experience with distributed storage technologies such as NFS, HDFS, Ceph and Amazon S3;
  • Proactive approach to identifying problems, performance bottlenecks, and areas for improvement;

Preferred skills and qualifications
  • Previous success in technical engineering;
  • Coding experience beyond simple scripts.

Apply
Other Job Recommendations:

Senior Site Reliability Engineer

Pinnacle
Remote
  • Design, build, and maintain scalable, reliable, and secure...
  • Develop and implement CI/CD pipelines, automating...
3 tuần trước

Quality Control Engineer

Sai Digital
Thành phố Hồ Chí Minh
25.000.000 ₫ - 35.000.000 ₫
  • Review requirements, specifications, and technical documents...
  • Create detailed, comprehensive and well-structured test...
6 ngày trước

Lead/Senior Site Reliability Engineer

Zalo
Thành phố Hồ Chí Minh
  • Build software and systems to manage platform infrastructure...
  • Measure and optimize system performance, with an eye toward...
4 tuần trước

Cộng Tác Viên Tuyển Dụng làm việc tại Hà Nam

CÔNG TY TNHH HILIONS ASIA
Hà Nội
  • Tìm kiếm và tư vấn ứng viên có nhu cầu đi xuất khẩu lao động...
  • Giới thiệu cơ bản các chương trình tuyển dụng lao động ngành...
20 giờ trước

Clinical Data Architect

WRS Health
Remote
  • Lead the evolution of our AWS-based data lake architecture,...
  • Optimize data storage and retrieval strategies for...
20 giờ trước

Nhân Viên Sản Xuất (Tiếng Nhật N2)

Công Ty Nhật Bản
Biên Hòa, Tỉnh Đồng Nai
  • Địa điểm: KCN Amata, Biên Hòa, Đồng Nai
  • Lương: 12 – 15 triệu Gross
  • Nam, Đến 40 tuổi...
20 giờ trước

(Senior) Automation QC Engineer

Zalo
Thành phố Hồ Chí Minh
  • Execute hands-on automation testing of applications,...
  • Participate in research of new technology and validation...
1 ngày trước

Senior Software Engineer, Java

Zalo
Thành phố Hồ Chí Minh
  • Have ability to provide technical solutions and system...
  • Program and optimize to ensure the best performance, quality...
20 giờ trước

Chuyên viên thu thập đồng thuận dữ liệu y tế làm việc tại Bình Dương

Veeva Systems
Thủ Dầu Một
10.000.000 ₫ - 25.000.000 ₫
  • Công ty có bộ phận riêng để thu thập dữ liệu từ các nguồn...
  • Dữ liệu sẽ được các công ty dược lớn trên thế giới, là các...
20 giờ trước

Full-Stack intern (.NET, Windows Application)

Hitachi Digital Services
Thành phố Đà Nẵng
We’re Hitachi Digital Services, a global digital solutions and transformation business with a bold vision of our world’s potential...
1 ngày trước