Internet Brands

Clickstream ingestion system

Our client wanted to capture user activity across the many websites they operate. A clickstream solution (JS snippet, API, message bus) seemed like a straight-forward solution to implement.

Table of Contents

Share this article

Client

Internet Brands

Tech Stack

JavaScript, Hadoop, Apache Spark, Kafka, Apache Hive

The Problem

Our client wanted to capture user activity across the many websites they operate. A clickstream solution (JS snippet, API, message bus) seemed like a straight-forward solution to implement. The catch? They anticipated around 2000 events per second during peak times.

Our Solution

We built a simple solution with high-scale needs baked in. Our JS snippet was designed to work in an extensive range of browsers (with automated testing via BrowserStack to boot), under no circumstances impeding user experience. Each component in the stack was clustered for redundancy and to support horizontal scaling. Components were observable, allowing us to monitor the performance and health of the system constantly.

Result

The collection of this information opened a window to cross website user activity. This allowed for the creation of user segments for marketing purposes, and for various personalised user experience efforts to get underway.

Your vision deserves a great dev team.

We're not about tech jargon or over-promising. Instead, we focus on clear communication, transparency in our process, and delivering results that speak for themselves.

Awards won by Hivekind