Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.
Essential Roles and Responsibilities
· Architect and drive the design of a DevOps platform to facilitate deployment and management of container-native workloads based on Kubernetes, to support Razer's microservice, data, and AI development teams.
· Design and Build proof-of-concepts to validate architecture and solutions leveraging open-sourced CNCF and/or Kubernetes-native projects to meet platform team goals.
· Build and constantly improve our automation and CI/CD pipeline based on Argo Workflows and ArgoCD to achieve smooth and easy-to-operate GitOps workflow.
· Initiate and drive new automation, process, or platform enhancements that improves developers’ or AI engineers’ productivity and efficiency in deploying to and managing their cloud workloads.
· Ensure the platform covers the observability and monitoring needs to manage the differing requirements of service uptime and performance goals for Razer's landscape of microservices and AI workloads.
· Understand and design a platform that can support Razer's incident management processes.
· Collaborate with cloud software and AI development teams to ensure services, AI models, and data pipelines are reliable, scalable, and efficient.
· Participate in on-call rotation to support the team's 24x7 coverage of critical systems.
· Other duties related to the reliability, observability, security, and cost-efficiency of Razer ID and AI services as assigned.
Requirements
· Bachelor's degree in Electrical/Computer Engineering, Computer Science, or related field.
· 5+ years of experience in site-reliability engineering or DevOps role supporting container or cloud-native services, with at least 1 year of being in a technical leadership position.
· Strong understanding of system design, networking, web technologies, and high-traffic systems
· Demonstrates innovative methods of declaratively automating cloud-based IaaS/PaaS deployments and applications using modern DevOps techniques and technologies
· Deep exposure to public cloud (GCP, AWS or Azure), Docker and Kubernetes
· Experience building pipelines and supporting services for mainstream programming language such as node.js, Go, Python.
· Broad experience with modern CI/CD pipelines (GitOps, Jenkins, Argo etc)
· Proficient in maintaining and troubleshooting Linux systems
· A fervent enthusiasm for declarative languages, particularly infrastructure-as-code
· Monitoring tooling such as CloudWatch, Datadog, Prometheus, Grafana
· Good at communication and documenting technical concepts
Are you game?