Key Responsibilities
Customer Support and Troubleshooting:
Follow company policies to maintain high security and privacy standards while providing customer support.
AI Cluster Maintenance and Optimization:
Customer Training and Enablement:
Proactive Support and Monitoring:
Documentation and Reporting:
Required Qualifications
Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
3+ years in a customer-facing technical support role, preferably in HPC or AI environments.
Hands-on experience with AI clusters, distributed systems, or GPU-based computing.
Knowledge of machine learning frameworks like TensorFlow, PyTorch, or JAX.
Experience with cluster management tools (e.g., SLURM, Kubernetes).
Familiarity with GPU computing, CUDA, and NVIDIA-specific technologies.
Understanding of networking (e.g., InfiniBand, RoCEv2) and high-performance storage systems.
Proficiency in Linux system administration and scripting (e.g., Bash, Python).
Excellent communication and interpersonal skills, including summarizing and handing off issues to other support staff across time zones.
Strong problem-solving ability and attention to detail.
Location: Remote
Contact
Brook Zhang
Tel&Wechat: 13917947687
All applications applied through our system will be delivered directly to the advertiser and privacy of personal data of the applicant will be ensured with security.
CTgoodjobs
You must sign in to apply for this position.