Who we want
The ideal candidate will be a Linux system administrator coder with hands-on experience supporting highly available production systems. They are passionate about authoring tools to reliably manage infrastructure.
Responsibilities:
- Linux systems operations architecture and setup
- Systems monitoring and alerting strategies and implementation
- Managing and implementing large scale distributed systems
- Design, implement and maintain our system provisioning framework
- Plan and coordinate systems maintenance to ensure minimal customer impact
- Create operational documentation and troubleshooting guidelines
- Participation in on-call rotation schedule
- Implement scaling best practices
- Design, code and implement automation scripts
- Design, code and conduct reviews
Technologies We Use:
The more of these you know, the better:
- Environments: Linux, JVM, AWS
- Languages: Ruby, Python, Perl, Java, Bash
- Persistence: MySQL, AWS Database Services, Cassandra
- Systems Operations: Puppet, AWS Cloudformation, Bash
- Caching: memcached, ehcache, CDN
- Principles: REST, DRY, TDD, Agile
Skills and experience necessary for the role
Requirements:
- Good scripting skills in at least one of (Ruby, Python, Perl, Java, Bash, Javascript)
- Minimum 3 years administrating production Linux systems in a 24x7 environment
- Managing distributed systems spanning multiple AWS regions / data-centers
- Cloud computing (AWS, Rackspace Cloud, GCE)
- Experience with continuous integration (i.e. Jenkins)
- Experience with automated systems provisioning (puppet, capistrano, chef, cfengine, rightscale)
- Solid understanding of common security principles
- Familiar with version control (SVN, GIT)
Nice to Haves:
- Live and breathe performance
- Distributed database experience, preferably Cassandra
- Experience with bootstrapping solutions (PXE, kickstart, anaconda, AWS userdata)
- Open source contributor
- BS or MS in computer science or related field