Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.
Essential Roles and Responsibilities
· Build excellent design, deployment and management of container-native DevOps platform based on Kubernetes to support Razer's microservice, data, and AI development teams.
· Design and Build proof-of-concepts to validate architecture and solutions leveraging open-sourced CNCF and/or Kubernetes-native projects to meet platform team goals.
· Build and constantly improve our automation and CI/CD pipeline based on Argo Workflows and ArgoCD to achieve smooth and easy-to-operate GitOps workflow.
· Initiate and drive new automation, process, or platform enhancements that improves developers’ or AI engineers’ productivity and efficiency in deploying to and managing their cloud workloads.
· Define and operate tools that tracks site reliability metrics and SLAs, maintain service uptime and performance goals as required by the various microservices.
· Assist in incident management, escalation and post-mortem processes to promote smooth operations.
· Collaborate with cloud software and AI development teams to ensure services, AI models, and data pipelines are reliable, scalable, and efficient.
· Participate in on-call rotation to support the team's 24x7 coverage of critical systems.
· Other duties related to the reliability, observability, security, and cost-efficiency of Razer ID and AI services as assigned.
Requirements
· Bachelor's degree in Electrical/Computer Engineering, Computer Science, or related field.
· 3+ years of experience in site-reliability engineering or DevOps role supporting container or cloud-native services.
· Strong understanding of system design, networking, web technologies, and high-traffic systems
· Demonstrates innovative methods of declaratively automating cloud-based IaaS/PaaS deployments and applications using modern DevOps techniques and technologies
· Deep exposure to public cloud (GCP, AWS or Azure), Docker and Kubernetes
· Experience building pipelines and supporting services for mainstream programming language such as node.js, Go, Python.
· Broad experience with modern CI/CD pipelines (GitOps, Jenkins, Argo etc)
· Proficient in maintaining and troubleshooting Linux systems
· A fervent enthusiasm for declarative languages, particularly infrastructure-as-code
· Monitoring tooling such as CloudWatch, Datadog, Prometheus, Grafana
· Good at communication and documenting technical concepts
Are you game?