Job Details

DevOps Engineer

NEW YORK-10014, NY, US
01/08/2019

-


Required Skills

    Ruby on Rails
Company

Infinity Consulting Solutions, Inc

Experience

-

Job Description

DevOps Engineer

The volume of data is large: we're working with 7 of the top 20 largest retailers in the world (+ many more not in the top 20), and are ingesting data from them both in a regular batch and in near-real time.

We've carefully selected the types of data to ingest to favor high signal data, so we care deeply about maintaining the correctness and completeness of the data being ingested as our models (and therefore the output of our product).

Your work directly impacts both the predictions we are able to make, and the day to day performance our customers experience when using our product.

Getting more specific, you will:

Design and build complex data pipelines on the Spark platform, ingesting both batch and real time datasets

Work with our data science team to deploy predictive models at scale

Build tools to continuously validate incoming data and proactively identify and communicate data anomalies before they manifest into problems.

We're a small team, so you'll be working on (and be able to meaningfully contribute to) high impact projects from your first day.

Sure, but what's it really like?

Inspired by Basecamp, we work in ~8 week product cycles. First, we work together (engineering + product) to identify the projects we think will have the biggest impact on our company goals. Here's an example of a recent project we conceptualized and delivered over one of these cycles:

Migrate self-managed Spark cluster to EMR

To lower overall cost, and to be able to easily scale to handle bigger datasets and processing volumes, we recently switched from a self managed Spark cluster to Amazon's
Elastic Mapreduce service.

Our challenge was to move terabytes of data used by our clients while having no downtime.
Some initial concerns were the performance on the EMR cluster and the migration process itself because some clients ingest a combination of live data and batch based data.

The scale of our data sent meant that we had to switch several clients at a time to EMR.

After ensuring that Hive backed by S3 was performant enough, we built tools to move vast amounts of data in parallel, to redirect requests to the correct cluster as clients were being moved, and to validate the data after migration.

Along the way, we also had to reshape the data (in terms of partition size) to ensure efficiency of copying and loading into the Hive database.

The Stack:

While we make use of a wide variety of tools, our primary web stack is ES6/React and Ruby on Rails deployed on AWS. We make extensive use of R for statistical analysis, and our primary data stores are Hive, MySQL, and Redis.

What it's like to work here:

On Monday we eat and meet as a team to chat projects and progress.

We move quickly. You build something and the next day it comes to life. You see and feel an immediate impact with the collective efforts of the team.

We're building a company and a team we love. We're in it for the long run.

Qualifications:

5 or more years of experience as a software engineer.

Degree in Computer Science or a deep competency achieved via other means.

Familiarity with Ruby on Rails, AWS, and SQL-based databases.

High standards for code quality and maintainability.

Nice to Have's:

Experience with R, Scala, Spark, and/or Chef.

Consistent record of delivering significant features or building out platforms and
services.

Experience working in e-commerce.


Developer
Information Technology

No Preference
FullTime Job
Other
3

Candidate Requirements
-
-

Walkin Information
-
-
-

Recruiter Details
Dough Klares
1350 Broadway, Suite 2205, NEW YORK-10018, NY, US
-