Apply Now »

Senior Data Scientist Engineer - Data Pipeline - Data Science/AI/Machine Learning

« Back to results

Matthew Allen Principal Recruiter

Phone Work
Phone Fax

Job Info

Category Data Science/AI/Machine Learning
Employment Type Full-Time Employment
Compensation $0.00 - $0.00
Location United States, CA - 94301

Client Introduction

We’ve built the most scalable and performant real-time platform that has ever existed. This platform has become the basis for the cloud of the future powering hyper-responsive collaborative apps across the globe through billions of mobile devices worldwide with unparalleled speed and efficiency. In fact, for benchmark reference, our platform already powers one such use-case with more than 3 million simultaneous mobile connections with 0.2-second response time.

This is the largest real-time concurrent interactive application suites ever built - there’s nothing even close to it.

Fast Facts, our Engineering team is comprised of industry leading talent coming from Google, Netflix, LinkedIn, IBM Watson, Twitter, Microsoft and Yahoo to name but a few. Our President has been responsible for building the largest revenue generating SaaS applications in the world, grossing multitudes of billions of dollars annually. Our Vice President of Platform Engineering is a Google alumi and joined the company pre-IPO.

We cannot confirm, nor deny, but it’s been rumored that the company produces nearly one billion in revenue annually, and has become one of the fastest companies to achieve a $10 billion dollar valuation in history.

Interested in making history? Interested in building a platform that literally aims to connect every device in the world? Please get in touch.

Job Description

With massive scale global systems feeding our teams large streams of real-time data, we‘re looking for a sharp Data Science Engineer to turn true big data into actionable insights.

As a Data Science Engineer, you’ll play a crucial role in building out a revolutionary next generation real-time analytics system. You’ll work closely with the R&D team to turn theoretical and strategic goals into technical products that push the boundaries of analytics, machine learning, and live operations.

Job Responsibilities

• Design and develop high-performance, data-driven systems in a distributed infrastructure.

• Research and develop algorithms to analyze high velocity streaming data.

• Design and evaluate experiments to monitor key product metrics, understand root causes of changes in performance.

• Build user behavior models to guide product roadmaps.

• Design experiments to answer targeted questions.

• Understand ecosystems, user behaviors, and long-term trends.

• Translate strategic requirements into usable tools and infrastructure.

• Identify and promote best practices across the organization.

• Implement large scale data management and transformation systems.


• You love building new products and can take an idea from inception, to development and production.

• You have a deep research interest in developing novel statistical modeling and machine learning algorithms.

• You’re a team player and an effective communicator.

• Familiarity/experience with machine learning models, statistical modeling and validation.

• Understanding of modern machine learning techniques (i.e. classification, clustering).

• Applied experience with distributed machine learning and computing framework (e.g. Spark).

• 3+ years experience with Python, R, Java or Scala.

• Experience productionalizing machine learning models is a big plus.

• Experience with Python, R, Java, Scala, Shell scripting, Spark, Kafka, Storm, Hadoop, Hive, Presto, MySQL, Redis, Cassandra, Elasticsearch.

Required Experience

Required Education

• BS or MS degree in Computer Science or a related technical field. Advanced degree in Computer Science (with a Data Mining/Machine Learning emphasis) or equivalent experience would be a plus.

Previous MonthNext Month