Data Center Cluster Architect
Summary
Posted: Jan 22, 2025
Weekly Hours: 40
Role Number:200582452
The Datacenter Systems Architecture team seeks an outstanding Cluster architect to design and optimize computer architectures specifically for high-performance computing (HPC) clusters. This position is a multi-disciplinary and cross-functional lead engineering role encompassing all aspects of computer system design. The candidate will have the skills and experience to create complex system architectures, surprise and delight our customers, and advance our products? performance, size, power, thermal and cost goals.
Description
Description
As a technical specialist, negotiate and document the solution details of the infrastructure from a physical, electrical, and logical perceptive of compute clusters within the datacenter. Collaborate and leverage domain expertise knowledge to provide guidance, and leadership to cross-functional engineering teams to integrate cluster network architectures into overall system architecture to ensure efficient data flow, impact product definitions, and meet scalability requirements.
Define the rack and cluster capabilities, configurations, and scale out requirements to support the deployment of dense compute and specialty compute workloads and applications, including but not limited to the following:
Pathfinding on novel cluster architecture choices with a broad group of architects and system engineers, networking, technical leads, and HW/SW stakeholders.
Creating optimized network designs for large-scale AI/ML clusters considering factors like bandwidth, latency, and scalability.
Influencing networking hardware and software components selection for the cluster, including switches, adapters, and protocols.
Analyzing network traffic patterns and implementing strategies to improve data transfer speeds within the cluster for target topologies and choice configurations.
Collaborate with mechanical, physical, electrical, thermal, power, networking, OS, SW, datacenter infrastructure stakeholders for performant scalable deployments.
Be innovative and curious. Explore and champion new product-level features and workflows.
Define, develop and utilize tools, scripts, automation and methods of system analysis for performance of compute clusters within a DC environment.
Mentor junior engineers to best practices and data-driven processes
The role may require occasional domestic and international travel.
Minimum Qualifications
Minimum Qualifications
? BS/MS in Electrical Engineering or equivalent with 10+ years of relevant industry experience.
? Possesses strong technical breadth across several computer subsystem technologies, e.g., CPU, GPU, TPU, storage, memory, power delivery, power management, high speed networking, I/O, thermal management.
? Has core competence and subject matter expertise architecting complex system architectures for general purpose compute, or specialty compute (GPUs, TPUs) systems running datacenter workloads for AI/ML applications.
? Comprehends the roles of HW/FW/SW layers and how they interact in system design.
? Ability to create, review and approve engineering requirement specification documents
Key Qualifications
Key Qualifications
Preferred Qualifications
Preferred Qualifications
? Possesses functional experience in defining and deploying datacenter cluster networking architectures over highly dense mesh networks and interconnected nodes for AI/ML based workloads.
? Detailed knowledge of network protocols, expertise in Ethernet, Infiniband, RoCE, UE, UAL, or other relevant networking protocols.
? Has strong analytical, verbal, written, and communication skills. Ability to summarize and effectively communicate technical issues and actions to key stakeholders and leadership teams
Education & Experience
Education & Experience
Additional Requirements
Additional Requirements
More
? Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.
Similar Remote Jobs
Data Center Cluster Architect
Posted on: 02-02-2025 00:00
Cellular SOC Design Verification Engineer - Entry Level
Posted on: 02-02-2025 00:00
(Online Remote jobs) Southwest Airlines Remote Jobs $24 - No Experience
Posted on: 02-02-2025 00:00
Recruiter Remote Opportunity
Posted on: 02-02-2025 00:00
Technical Infrastructure Project Manager (Open to Remote)
Posted on: 02-02-2025 00:00
Dog Walker / Dog Sitter
Posted on: 02-02-2025 00:00
Remote Consumer Direct Mortgage Banker
Posted on: 02-02-2025 00:00
Customer Service Representative - Full Time (Remote)
Posted on: 02-02-2025 00:00
Join Apple as an At Home Advisor Empower Customers from Your Home
Posted on: 02-02-2025 00:00
Customer Service Representative ? Inbound only, NO SALES ? Permanent Full Time Positions
Posted on: 02-02-2025 00:00
Senior Technical Claims Specialist, Doordash Bodily Injury
Posted on: 10-10-2024 00:00
Sr. Director of Sales - Costco
Posted on: 28-01-2025 06:22
Work From Home Customer Service Coordinator - Specialty Servicing
Posted on: 16-07-2024 18:35
Senior Software Engineer - Oracle Service Cloud ? Remote Eligible
Posted on: 16-07-2024 18:55
Consultant Pharmacist - Long Term Care - Part Time (Corpus Christi, TX)
Posted on: 08-12-2024 17:20
Remote Administrative Support Specialist - No Degree Required
Posted on: 31-10-2024 05:15
Remote Hybrid Registered Dietitian (Full Time/Part Time)
Posted on: 19-02-2025 06:28
Manager, Business Development ? Meetings & Events Solutions
Posted on: 24-01-2025 04:53
Wealth Management: Relationship Manager
Posted on: 23-11-2024 06:30
Part Time Assistant Teacher - opener
Posted on: 17-02-2025 05:59