KR183CSA2 - Mid Level Cloud System Administrator - Cleared

Company: NiSUS Technologies Corporation
Job type: Full-time

Senior Linux System Administrator to perform functions in a dynamic and operational environment supporting cloud-based repositories. A successful candidate for this position has experience working with large Hadoop and Accumulo based clusters and interfacing with hardware, software, security, and network teams. Experience with Bash, Python, and configuration management tools such as Puppet preferred. Responsibilities include monitoring systems and resolving or escalating network and data flow alerts for cloud systems hardware, software or network issues. Experience with large Hadoop and Accumulo based clusters sustaining a strong background in troubleshooting operational system issues as they arise required. Performance in a fast-paced mission environment where requirements change on a daily basis due to operational and world events.
Additional responsibilities include the following:
Monitor system health
Troubleshoot system problems
Maintain storage systems
Interface with external teams for hardware, network, and infrastructure support
Provide after-hours on-call/call-in support
Ensure system security requirements are satisfied
Participate in engineering discussions
Create/maintain system scripts
Patch/upgrade system
Administer user accounts
Maintain hardware inventory
Requirements
TS/SCI with poly required
Three (3) years of experience is required.
A Bachelor’s Degree in Engineering, Systems Engineering, Computer Science, or Mathematics is highly desired and will be considered equivalent to two (2) years of experience.
Hadoop/Cloud System Administrator Certification or comparable Cloud System/Service Certification is required.
Shall have at least three (3) years of experience performing system administration and monitoring large distributed systems
Shall have experience diagnosing and troubleshooting large-scale cloud computing systems, including familiarity with distributed systems for storage and retrieval of data, e.g., Hadoop, Cassandra, Scality, Swift, Gluster, Lustre, GPFS, Amazon S3, or other comparable technology for big data management or High-performance computing.
Shall have demonstrated ability to work within a pre-defined focused team structure, follow SOPs, communicate effectively, accept constructive feedback, and receive technical guidance and advice from senior-level technical resources.
Shall have demonstrated a willingness to learn new technologies and leverage senior-level resources to expand the current technical foundation using team structure.
Demonstrated ability to work independently on complex tasks and show a willingness to educate and train more junior technical resources.
Demonstrated ability to plan, communicate, lead, and oversee complex technical tasks requiring interaction with multiple groups.
Shall have five (5) years of experience writing software scripts using scripting languages, including bash, perl, or python.
Shall have seven (7) years of experience demonstrating a fundamental understanding and working knowledge of the Linux operating system's core components, including managing user and group accounts in LDAP configuration of DHCP, DNS, and TFTP.
Shall have demonstrated experience with configuration management tools, including Puppet and SALT.
Expert understanding of the end-to-end Linux PXE/Network provisioning process, including familiarity with Anaconda Kickstart configurations, RAID controller utilities, TFTP images, and disk detect scripts.
Experience accessing and troubleshooting systems via remote utilities to diagnose hardware and repair, including VNC, serial over LAN interfaces, and IPMI, BIOS-level configuration.
Understanding of overall corporate architecture and familiarity with OpenSSL and Java keystore manipulation.
Expert in troubleshooting commodity hardware platforms, including previous experiences with SGI/HP hardware, including SGI’s J series.
Desired:
Advanced knowledge of SSH tunneling and protocols, including implementing dynamic SOCKS proxies and other SSH-based utilities, including rysn, pdsh, pdcp, and WinSCP.
Basic understanding of low-level network concepts including vlans, port channel bonding and layer2/layer 3 switch interactions.
Familiarity with software load balancers for large scale webservice implementations including HAProxy and Nginx
Experience with Kubernetes orchestration services and Docker images.
Experience with log aggregation and search tools including ElasticSearch, logstatsh, filebeats, Grafana, and rsyslog.
Benefits
Health & Life Insurance
Dental Insurance
Disability Insurance
401K Retirement Plan with Matching
Tuition Assistance
Vacation and Sick Leave
Hiring Bonuses
Referral Recruitment Program

Apply for this job