Lead/Sr. Hadoop Administrator - Data Infrastructure
Turn delivers real-time insights that transform the way leading advertising agencies and enterprises make decisions. Our digital advertising hub enables audience planning, media execution, and real-time analytics from a single login, and provides point-and-click access to more than 150 integrated marketing technology partners. Turn is headquartered in Silicon Valley and provides its products and services worldwide. For more information, visit turn.com or follow@turnplatform.
Data Infrastructure team at Turn is responsible for running very large Hadoop clusters, extremely fast changing RDBMS environments and different types of storage environments. This is an agile environment that provides opportunity to solve complex and scaling challenges.
As part of the Data Infrastructure team, you will be integral part of managing 24X7 Hadoop infrastructure. Currently this environment is at tens of petabytes in size. You will be contributing to improvements in architectural changes, performance improvements and optimization of Hadoop eco-system components. You’ll get the chance to take on complex and interesting problems as part of a fast-paced, highly collaborative team. We've built a complex analytics system around Hadoop ecosystem for scalability and high availability. It's imperative that you approach administration with an emphasis on repeatability, testability and consistency. The demands on this system are increasing rapidly as we grow the user base, as data ingestion grows and add more functionality and products.
Successful candidate for this position will be a self-motivated with attitude of getting things done. Should be able to see big picture and also be able to deep-dive into details to solve complex problems.
- Contribute actively to improve Turn’s Hadoop ecosystem architecture.
- Apply in-depth analysis of hadoop based workload, project-based work, design solutions to issues, and evaluate their effectiveness.
- Develop and maintain operational best practices for smooth operation of large hadoop clusters.
- Design, develop and manage diagnostic & instrumentation tools for troubleshooting & in-depth analysis.
- Optimize and tune the Hadoop environment to meet the performance requirements.
- Partner with Hadoop developers in building best practices for Warehouse and Analytics environment.
- Investigate emerging technologies in hadoop ecosystem that relevant to our needs and implement.
- 3-5+ years plus of hands-on experience in deploying and administering multi petabyte scale Hadoop cluster.
- Strong problem solving and trouble shooting skills.
- Deep understanding of Hadoop design principals and the factors that affect distributed system performance.
- Well versed in installing, administering and managing Hadoop clusters running CDH4, CDH5, YARN, Spark, Cloudera manager.
- Sound understanding with Hadoop ecosystem products like HDFS, map-reduce, Spark, Storm, Pig, Oozie, Zookeeper and Cloudera manager.
- Good scripting experience with at least two of the following: Shell, Python Ruby or Perl.
- Good knowledge in implementing metric collection for monitoring and alerting.
- Hands on experience with automation tool like Puppet.
- BS/MS degree in computer science or related field.
- Good knowledge of Hadoop cluster connectivity and security.
- Linux administration and troubleshooting skills.
- Good knowledge of common ETL packages / libraries and data ingestion.
- Experience in RDBMS, NOSQL databases and Java programming, .
- Knowledge of open source projects like Git, Nagios, TSDB, Docker and OpenStack.
- Experience in hadoop in cloud.
Location: Redwood City, CA
In addition to our great environment, we offer a competitive base salary, bonus program, stock options, employee development programs and other comprehensive benefits. Please send a cover letter along with your resume when applying to the position of interest located at Turn.com. We are an Equal Opportunity Employer. No phone calls and no recruiting agencies, please.