Site Reliability Engineer

15000 人民币~20000 人民币/每月

全职
3~5年
刷新于 6 小时前
258 查看
48 申请
北京
分享
职位职责
Ensuring a stable computing cluster is critical to a quantitative company. You will have the opportunity to work with high-performance computing clusters, designing systems to collect various metrics related to the cluster and ensuring the stability of the platform. Your responsibilities will include researching and evaluating new technologies and solutions, resolving issues in large-scale cluster operations, and effectively supporting the sustainable development of the company.
职位要求
Bachelor's degree or higher in Computer Science, Software Engineering, or a related field. 3+ years of experience with related tools is preferred. Strong background in computer science including Linux operating systems, networks, and storage I/O principles. Experience with at least one programming language under Linux, such as Bash, Python, or Go. Proficient in Kubernetes, with knowledge of its deployment architecture and management. Experience with large-scale cluster troubleshooting, performance optimization, and observability is preferred. Mastery of process-oriented and standardized thinking methods. Excellent learning, logical reasoning and problem solving skills. Ability to design and implement solutions based on actual requirements and to quickly identify and resolve system issues. Responsible, meticulous, and practical, with good communication skills.
搜索你理想的职位
职位类别
城市或国家
职位
人才
博客
我的