Bachelor's degree and eight years of related increasingly technical work experience or a combination of education and relevant experience. Strong, demonstrated knowledge of Linux and demonstrated experience managing complex multiuser HPC clusters and large-scale research storage environments are required as well.
Advanced knowledge of Linux is required; experience managing, using, supporting and consulting on research computing cyberinfrastructure in an academic or research environment is strongly preferred. Proven ability to deliver outstanding system and service administration and end-user support in a thorough and timely manner is needed. This position requires that you be able to juggle multiple competing priorities, work quickly and accurately, and demonstrate initiative in conceptualizing and moving technical projects successfully to completion. The position must be able to do independent analysis, troubleshooting and problem resolution, but also must work collaboratively with other team members and across organizational group boundaries.
This position requires hands-on experience building and supporting multi-tenant Linux servers/clusters and their associated networks, file systems and storage devices in production research environments. Specifically, this technical knowledge needed to be successful in this positon includes:
Expert demonstrated knowledge of clustered Linux systems, including securing systems, and day-to-day troubleshooting, monitoring, support, software packaging, and working within industry-wide best practices
Experience administering, configuring, and supporting HPC clusters, including systems with accelerators, and high performance file systems and storage. This includes hardware installation, configuration, upgrades and repairs
Knowledge of and experience utilizing data and system security techniques, practices and standards as they relate to HPC systems, storage and networks
Experience installing and supporting parallel computing environments (e.g. OpenMPI, MVAPICH, etc.)
Hands-on experience installing, configuring and supporting job schedulers and resource managers (e.g., SLURM, OGE, LSF, Torque, Maui, etc.)
Familiarity with deploying virtualization technologies and basic knowledge of container technologies
Exceptional written and verbal communication skills
Experience using shell scripts, programming languages (Python), and programming automated system management tools, both at a general level (e.g. Puppet) and at a cluster-level (e.g. Rocks)
Experience installing, configuring, managing and supporting GPFS parallel file systems is desired but not required
Familiarity with TCP/IP, Internet Routing Protocols, private and public networks, VLANs, Firewalls, Load Balancers, addressing schemes, subnet creation and subnet masking. Proven ability to troubleshoot basic network issues and communicate and work with a team of network engineers to solve possible network design issues in HPC
Familiarity with the intersection of storage and networking disciplines: i.e. transport media, speeds of media, storage networks, IP based storage delivery, other storage delivery technologies
Experience with some the following applications: Git, Apache, Kerberos, LDAP
Software installation and maintenance experience supporting research codes and clients
Exceptional client service and communication, focusing on proactive system administrator actions and interactions to reduce or remove barriers to clients’ efficient use of resources to advance research
Imagine a world without search engines or social platforms. Consider lives saved through first-ever organ transplants and research to cure illnesses. Stanford University has revolutionized the way we live and enrich the world. Supporting this mission is our diverse and dedicated 17,000 staff. We seek talent driven to impact the future of our legacy. Our culture and unique perks empower you with:
Freedom to grow. We offer career development programs, tuition reimbursement, or audit a course. Join a TedTalk, film screening, or listen to a renowned author or global leader speak.
A caring culture. We provide superb retirement plans, generous time-off, and family care resources.
A healthier you. Climb our rock wall or choose from hundreds of health or fitness classes at our world-class exercise facilities. We also provide excellent health care benefits.
Discovery and fun. Stroll through historic sculptures, trails, and museums.
Enviable resources. Enjoy free commuter programs, ridesharing incentives, discounts and more!