Careers

Careers at Turn

Lead/Sr. Hadoop Administrator - Data Infrastructure

About

Turn delivers real-time insights that transform the way leading advertising agencies and enterprises make decisions. Our digital advertising hub enables audience planning, media execution, and real-time analytics from a single login, and provides point-and-click access to more than 150 integrated marketing technology partners. Turn is headquartered in Silicon Valley and provides its products and services worldwide. For more information, visit turn.com or follow@turnplatform.

Data Infrastructure team at Turn is responsible for running very large Hadoop clusters, extremely fast changing RDBMS environments and different types of storage environments. This is an agile environment that provides opportunity to solve complex and scaling challenges.

Overview

As part of the Data Infrastructure team, you will be integral part of managing 24X7 Hadoop infrastructure. Currently this environment is at tens of petabytes in size. You will be contributing to improvements in architectural changes, performance improvements and optimization of Hadoop eco-system components. You’ll get the chance to take on complex and interesting problems as part of a fast-paced, highly collaborative team.  We've built a complex analytics system around Hadoop ecosystem for scalability and high availability. It's imperative that you approach administration with an emphasis on repeatability, testability and consistency. The demands on this system are increasing rapidly as we grow the user base, as data ingestion grows and add more functionality and products.

Successful candidate for this position will be a self-motivated with attitude of getting things done. Should be able to see big picture and also be able to deep-dive into details to solve complex problems.

Responsibilities

  • Contribute actively to improve Turn’s Hadoop ecosystem architecture.
  • Apply in-depth analysis of hadoop based workload, project-based work, design solutions to issues, and evaluate their effectiveness.
  • Develop and maintain operational best practices for smooth operation of large hadoop clusters.
  • Design, develop and manage diagnostic & instrumentation tools for troubleshooting & in-depth analysis.
  • Optimize and tune the Hadoop environment to meet the performance requirements.
  • Partner with Hadoop developers in building best practices for Warehouse and Analytics environment.
  • Investigate emerging technologies in hadoop ecosystem that relevant to our needs and implement.

Required Skills

  • 3-5+ years plus of hands-on experience in deploying and administering multi petabyte scale Hadoop cluster.
  • Strong problem solving and trouble shooting skills.
  • Deep understanding of Hadoop design principals and the factors that affect distributed system performance.
  • Well versed in installing, administering and managing Hadoop clusters running CDH4, CDH5, YARN, Spark, Cloudera manager.
  • Sound understanding with Hadoop ecosystem products like HDFS, map-reduce, Spark, Storm, Pig, Oozie, Zookeeper and Cloudera manager.
  • Good scripting experience with at least two of the following: Shell, Python Ruby or Perl.
  • Good knowledge in implementing metric collection for monitoring and alerting.
  • Hands on experience with automation tool like Puppet.
  • BS/MS degree in computer science or related field.
  • Good knowledge of Hadoop cluster connectivity and security.

Pluses

  • Linux administration and troubleshooting skills.
  • Good knowledge of common ETL packages / libraries and data ingestion.
  • Experience in RDBMS, NOSQL databases and Java programming, .
  • Knowledge of open source projects like Git, Nagios, TSDB, Docker and OpenStack.
  • Experience in  hadoop in cloud.

Location: Redwood City, CA

In addition to our great environment, we offer a competitive base salary, bonus program, stock options, employee development programs and other comprehensive benefits. Please send a cover letter along with your resume when applying to the position of interest located at Turn.com. We are an Equal Opportunity Employer. No phone calls and no recruiting agencies, please.

#LI-FO1 #GD

Application Data:

company 
careers 
leadsr_hadoop_administrator_data_infrastructure 
path /srv/www/sites/turn-dev.com/dev/repo/build/app 
main_controller app\controllers\Primary 

Request Data:

$_GET
No Data
$_POST
No Data
$_COOKIE
No Data
$_FILES
No Data
$_SERVER
REDIRECT_STATUS 200 
HTTP_HOST turn.stage.elusive-concepts.com 
HTTP_ACCEPT_ENCODING x-gzip, gzip, deflate 
HTTP_USER_AGENT CCBot/2.0 (http://commoncrawl.org/faq/) 
HTTP_ACCEPT text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8 
PATH REMOVED 
SERVER_SIGNATURE Apache/2.4.10 (Linux/SUSE) Server at turn.stage.elusive-concepts.com Port 80 
SERVER_SOFTWARE Apache/2.4.10 (Linux/SUSE) 
SERVER_NAME turn.stage.elusive-concepts.com 
SERVER_ADDR 192.168.1.201 
SERVER_PORT 80 
REMOTE_ADDR 54.81.44.140 
DOCUMENT_ROOT /srv/www/sites/turn-dev.com/prod/webroot 
REQUEST_SCHEME http 
CONTEXT_PREFIX  
CONTEXT_DOCUMENT_ROOT /srv/www/sites/turn-dev.com/prod/webroot 
SERVER_ADMIN roger.soucy@elusive-concepts.com 
SCRIPT_FILENAME /srv/www/sites/turn-dev.com/prod/webroot/index.php 
REMOTE_PORT 49974 
REDIRECT_URL /company/careers/leadsr-hadoop-administrator-data-infrastructure 
GATEWAY_INTERFACE CGI/1.1 
SERVER_PROTOCOL HTTP/1.0 
REQUEST_METHOD GET 
QUERY_STRING  
REQUEST_URI /company/careers/leadsr-hadoop-administrator-data-infrastructure 
SCRIPT_NAME /index.php 
PATH_INFO /company/careers/leadsr-hadoop-administrator-data-infrastructure 
PATH_TRANSLATED redirect:/index.php/company/careers/leadsr-hadoop-administrator-data-infrastructure/careers/leadsr-hadoop-administrator-data-infrastructure 
PHP_SELF /index.php/company/careers/leadsr-hadoop-administrator-data-infrastructure 
REQUEST_TIME_FLOAT 1506099993.827 
REQUEST_TIME 1506099993 

Logs:

Time Data
2017-09-22 17:06:33
Loading Framework...
2017-09-22 17:06:33
app\models\Career:Array
(
    [id] => 0
    [slug] => leadsr-hadoop-administrator-data-infrastructure
)

Events:

Event Data Listeners
APPLICATION >> RUN null 0
APPLICATION >> LOADED null 0
APPLICATION >> HANDOFF null 0
TEMPLATE >> HTML_START "" 0
TEMPLATE >> BEFORE_HTML_END null 1

Errors:

Notice (8) Undefined index: label /srv/www/sites/turn-dev.com/dev/repo/build/tmp/smarty/templates_c/2a6b68665934d429f218f0734699a46d465d234c.file.share-bar.tpl.php L: 28
Notice (8) Trying to get property of non-object /srv/www/sites/turn-dev.com/dev/repo/build/tmp/smarty/templates_c/2a6b68665934d429f218f0734699a46d465d234c.file.share-bar.tpl.php L: 28

Benchmarks:

Benchmark Tag Time Comment
execution_time TIMER_START 0.000ms Starting bootstrap...
execution_time TIMER_STOP 28.505ms Debug console render output...