View Our Website View All Jobs

System Administrator III

Responsible for ensuring company’s home grown test framework (HW & SW) systems are well maintained and operate reliably. Perform regular systems and security monitoring, handle backup procedures and help set up. Develop and maintain relevant procedures and standards. ensuring that hardware and systems work as expected and maintain archives as needed. Provide technical support related to systems for external manufacturers using this hardware stack. Handle patches and upgrades to ensure that systems are maximally efficient and effective. Ability to coordinate workflow to ensure all the systems they oversee are operating as expected. Provide hands-on system administration support, system maintenance and operations support. Recommend software and hardware enhancements. Build, configure, and troubleshoot software and hardware enhancements, application deployments and infrastructure up-grades. Diagnose and correct computer systems issues.

 

In addition to the above, the candidate must have:

  • Administer custom built server and storage hardware at remote manufacturing sites, from imaging bare metal systems to debugging installation and operational issues.
  • Develop and fix Python scripts to support manufacturing testing in Linux environment.
  • Help train remote manufacturing test engineers and factory operators on how to use and administer the test framework. Document procedures and protocols.
  • Up to 20% travel might be required within USA and/or Mexico.

Support requires first gaining expertise of the full stack which, in order to ramp quickly and efficiently, depends on having a strong knowledge of Linux configuration/troubleshooting, system administration, networking configuration/troubleshooting, and familiarity with Python (at least sufficient knowledge to read code and apply a hot fix if needed), familiarity with software development lifecycle using source code control is a big plus (and expected from a senior level engineer)

Critical Requirements

  • Linux system administration experience.
  • 3+ years of hands on experience with Python development for Server, Storage, and/or Network testing.
  • 5+ years of industry experience is preferred
  • Debug set up and test failures and root cause in the Linux environment.
  • Test execution experience in Linux environment using Open Source tools.
  • Remote test execution and debugging.
  • Setting up automated provisioning pipelines for new test infra

    - Knowledge of DHCP/TFTP PXE and Linux kickstart process

    * troubleshooting network configuration issues

                    - Familiarity with Linux networking stack and commands

                    - Familiarity with VLAN configurations and switch management

    * troubleshooting system configuration

                    - Strong knowledge of Linux troubleshooting for performance

                    - Distributed systems knowledge, debugging client/server applications

    * implementing and debugging hardware tests

                    - Familiarity with Python

                    - Familiarity with Hardware interfaces (Firmware, NIC, PCI, SMART… )

Nice to Have

  • Database Administration (e.g. MySQL/PostgreSQL…)
  • POSIX storage Administration (e.g. NFS/Gluster/Ceph…)
  • Test planning and test case documentation to cover issues typically found in large DC environments is a big plus.
  • Experience debugging hardware (e.g. Electrical Engineering, PCB Troubleshooting)
  • Experience with manufacturing lifecycle working with CM/ODM/JDMs (e.g. Electrical Engineering PCB troubleshooting)

A minimum B.S is required but a Masters is a plus

Read More

Apply for this position

Required*
Apply with Indeed
We've received your resume. Click here to update it.
Attach resume as .pdf, .doc, .docx, .odt, .txt, or .rtf (limit 5MB) or Paste resume

Paste your resume here or Attach resume file

150