Mediabistro logo
job logo

Golang & Kubernetes Engineer

Artech, Sunnyvale, CA, United States


Request ID: 80552-1
Title:

Golang & Kubernetes Engineer
Locations : Sunnyvale CA
Duration:

6 Months
Salary Range: $55.00- $60.00/Hour on W2 (All inclusive)
Applicants must be able to work on W2 without any Visa sponsorship

Job Description:

We are looking for a highly skilled

Golang & Kubernetes Engineer

to design, build, and operate large-scale cloud-native infrastructure supporting distributed machine learning workloads. The role involves managing multi-cluster Kubernetes environments, developing backend platform services in Golang, and optimizing ML model training and inference systems for performance, scalability, and reliability.
Key Responsibilities

Design, deploy, and operate large-scale

Kubernetes (EKS) environments , managing

multi-cluster infrastructure (20-25+ clusters)
Build and maintain

backend platform services using Golang

to support distributed systems and ML workloads
Manage full

Kubernetes cluster lifecycle , including upgrades, scaling, monitoring, and reliability improvements
Develop orchestration frameworks for

ML workloads , including training and inference pipelines
Implement and manage

security controls , including IAM, RBAC, and secure service-to-service communication
Deploy and optimize

distributed ML systems

using frameworks such as

PyTorch and Ray
Improve system performance across

latency, throughput, and availability

for model serving platforms
Collaborate with ML engineers, DevOps, and infrastructure teams to ensure scalable and reliable ML platforms
Build automation tools and infrastructure-as-code solutions to improve operational efficiency
Monitor system health, troubleshoot production issues, and ensure high availability of services
Required Skills & Qualifications
8-10 years of experience in software engineering or cloud infrastructure roles
Strong hands-on experience with

Kubernetes (EKS) at scale
Proficiency in

Golang , or strong backend systems development experience (e.g., Java, C++, Python with system design exposure)
Deep understanding of

distributed systems and cloud infrastructure principles
Experience in designing and operating scalable backend or platform services
Strong knowledge of

cloud security concepts (IAM, RBAC, network policies)
Experience working in production-grade, high-availability environments
Nice to Have (Preferred Skills)
Exposure to

MLOps frameworks

such as

PyTorch, Ray, Kubeflow, or similar
Experience with

machine learning infrastructure or model serving systems
Familiarity with

cloud platforms (AWS/Azure/GCP)

for AI and data solutions

Appreciate your quick response and please feel free to reach me out for any query you may have.

Thanks