We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

HPC Engineer - TOP SECRET

AMERICAN SYSTEMS
401(k)
United States, Virginia, Arlington
1401 South Clark Street (Show on map)
Nov 21, 2024
Job Title / Level
HPC Engineer - TOP SECRET
Clearance Required?
Top Secret/SCI
Location:
Arlington, VA 22204 US (Primary)
% Travel
20 - 30%
Job Description

THIS POSITION COMES WITH A 10K SIGNING BONUS!

Are you an HPC Engineer looking to be part of something that is truly unique - not just a job, but a mission? We are in search of an application engineer with specialties and background in large-scale, high-performance computing, supercomputing and parallel processing design to optimize and build applications and tune them for speed and efficiency in an R&D setting. AMERICAN SYSTEMS is building the next generation of high-speed analytics needed to protect our nation, and the environments needed to support them. We need you to join us on the ground floor.

As an HPC Engineer and member of our select team, you will:



  • Apply comprehensive knowledge of High Performance Computing (HPC) systems, comprised of high-speed, multi-petabyte Lustre file systems, Red Hat Enterprise Linux (RHEL) servers, CPU/GPU compute nodes, and high performance storage arrays, using Ethernet, fiber, Omni-Path, and InfiniBand interconnections.
  • Provide functional and technical expertise in support of user-developed software and technical advice and leadership to other technical staff
  • Join us at an exciting time as we introduce next-generation technologies
  • Be part of a group that provides game-changing capabilities to the nation
  • Receive a robust benefits package that includes our Employee Stock Ownership Plan!



A week in the life of an HPC Engineer:



  • Utilize a wide variety of skills in system and network monitoring; large-scale systems administration; scripting and automation; security compliance; network distributed services; storage and backups; and hardware and software problem diagnosis and resolution.
  • Diagnose and troubleshoot technical problems, often of a complex nature, associated with computer hardware and software interrelationships and dependencies.
  • Conduct needs analysis, planning, and scheduling the installation of a wide variety of new or modified hardware/software.
  • Develop functional and technical IT system requirements and specifications. Configure and optimize system tools and applications, to include job schedulers (Slurm and PBSPro) and system resources (GitLab, LUA/TCL modules, and system support applications).
  • Create and brief technical presentations to technical and non-technical stakeholders. Maintain detailed documentation of system configurations, procedures, and troubleshooting guides. Develop user facing documentation.

Job Requirements

  • DoD Top Secret (TS) clearance with SCI eligibility
  • Bachelor's in Computer Engineering, Computer Science, or related field and ten or more years of job related experience.
  • Thorough knowledge of complex concepts, practices, and troubleshooting associated with HPC cluster systems design, installation, and maintenance.
  • Advanced knowledge in distributed computing theory, parallel processing, applications, and associated infrastructure is required.
  • Extensive experience with Linux/Unix systems including installation, configuration, networking, backups, updates and patching, data archiving, and system security.
  • Functional knowledge of HPC middleware, and platform managers such as Bright Cluster Manager; employing job schedulers such as PBS, Slurm, Torque, etc.; and, optimizing job queues.
  • Experience with HPC or large-scale distributed computing environments and technologies such as high-speed low-latency interconnects (e.g. InifiniBand), parallel file systems (e.g. Lustre), and virtualization environments and tools (e.g. VMWare).
  • Experience developing Python/bash/Perl scripts and employing automation frameworks such as Ansible.
  • General knowledge employing Docker containers and Kubernetes ecosystems.
  • Working knowledge in one or more programming languages (e.g. C/C++, Fortran, etc.)



Founded in 1975, AMERICAN SYSTEMS is one of the largest employee-owned companies in the United States. We are a government services contractor focused on delivering Strategic Solutions to complex national priority programs with 100+ locations worldwide. Through our focus on quality, strong cultural beliefs and innovation we deliver excellence every day.

Company Awards:

* Forbes National Best Midsize Companies

* Energage National Best Workplaces, National

* Washington Post Best Workplaces

Veteran Hiring Awards:

* U.S. Department of Labor Hire Vets Medallion

* BEST FOR VETS by Military Times

* TOP 10 MILITARY FRIENDLY COMPANY by MilitaryFriendly.com

AMERICAN SYSTEMS is committed to pay transparency for our applicants and employee-owners. The salary range for this position is $150,000 - $200,000. Actual compensation will be determined based on several factors permitted by law. AMERICAN SYSTEMS provides for the welfare of its employees and their dependents through a comprehensive benefits program by offering healthcare benefits, paid leave, retirement plans (including ESOP and 401k), insurance programs, and education and training assistance.

#CJPOST AMS1


EOE Minorities/Women/Disabled/Veterans/Gender Identity/Sexual Orientation
Applied = 0

(web-5584d87848-7ccxh)