Archive of stories published by LiveRamp Engineering

Homepage

Open in app

All

LiveRamp in LiveRamp Engineering

Sep 17, 2018

Using Machine Learning to Auto Detect Column Types in Customer Files

Introduction

4 responses

LiveRamp in LiveRamp Engineering

Feb 22, 2016

Tracking MapReduce job performance with counters

At LiveRamp, most of our heavy data processing is done by MapReduce jobs on our Hadoop YARN cluster. Since these jobs are critical to our data processing workflows, one of our top priorities is making sure they run quickly and reliably.

LiveRamp in LiveRamp Engineering

Jun 16, 2011

Java Performance: synchronized() vs Lock

Yesterday, I noticed that one of our systems was using a Lock where a plain old synchronized() block would suffice, and I thought to myself, does this matter? Since the Lock was already fulfilling the same role, the only real question was performance.

1 response

LiveRamp in LiveRamp Engineering

Sep 16, 2019

Joining Petabytes of Data Per Day: How LiveRamp Powers its Matching Product

1 response

LiveRamp in LiveRamp Engineering

Jan 3, 2018

Friday thoughts: fail, fast and furiously

tl;dr: When implementing a service or API, if you get a request you don’t quite understand, the kindest thing you can do is to return a noisy error.

Let’s consider an API like:

GET /mySum?num=3&num=42

LiveRamp in LiveRamp Engineering

Oct 25, 2018

Introducing MockRDD for testing PySpark code

Summary

The LiveRamp Identity Data Science team is excited to share some of our PySpark testing infrastructure in the new open source library mockrdd. This contains the class MockRDD, which mirrors the behavior of PySpark…

1 response

LiveRamp in LiveRamp Engineering

Apr 10, 2014

Reconnecting Thrift Client

Here at LiveRamp, we use make heavy use of Apache Thrift. In some cases, we have Thrift clients in long-running processes. A variety of issues can cause these clients to disconnect, including:

Transient problems with the network

LiveRamp in LiveRamp Engineering

Aug 5, 2019

Distributed Tracing at Massive Scale

When a developer thinks about monitoring and observability of their production application, two things generally come to mind: metrics and logs. While those are really useful for debugging and monitoring purposes, there is still a critical monitoring element that…

LiveRamp in LiveRamp Engineering

May 13, 2019

Migrating a Big Data Environment to the Cloud, Part 1

LiveRamp in LiveRamp Engineering

May 29, 2019

Migrating a Big Data Environment to the Cloud, Part 3

How do we get to the cloud?

These were the top 10 stories published by LiveRamp Engineering; you can also dive into yearly archives: 2007, 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017, 2018, and 2019.

About

LiveRamp Engineering

LiveRamp Engineering Blog

More information