1. Real-time bidding (RTB) is a programmatic advertising method where automated ad buying and selling occurs within milliseconds of an ad impression.
2. The document discusses Reelbid, a demand-side platform that handles RTB. It proposes using sampling and hierarchical techniques from a research paper to more efficiently select bids and determine pricing to improve return on investment.
3. The goal is to design an architecture to handle millions of queries per second for real-time bidding while efficiently scaling without adding nodes to reduce costs.
1 of 23
Download to read offline
More Related Content
Reel bid insightd-eproject
1. Reelbid
Real time ad bidding DSP
( programmatic)
MUDIT UPPAL
INSIGHT DATA ENGINEERING, SV
4. Demand side platform
- Reelbid
Mashable.com
(publisher with
inventory)
Sell Side Platform
OpenRTB
server
Auction
service
Ad
exchange
OpenRTB
client
Bidder
service
Data management
platform
Data broker
Some
advertiser
5. Motivation
Really interesting & new* data engineering challenges
Current State of the art
Googles open-bidder still in alpha(not for public)
In 2014, according to Business Insider Intelligence, ad revenue topped $15 billion. Real-time bidding, and in
particular, mobile and video real-time bidding, lead the way for that growth. Business Insider Intelligence estimates
say RTB revenue will pass $26 billion by the end of 2020, which far surpasses the $11.7 billion from this year.
( *Disclaimer: NO background in Ad tech )
Latency ( 100 ms )
Queries Per Second (~2
- 3 million bids/second)
+ Infrequent bids
7. Goals
- Create a more efficient bidding framework using sampling and hierarchical
techniques (as described in the paper)
- Ways to Scale without adding nodes and save costs
10. A different approach to RTB
A paper published by in IEEE for Data Mining--
(source of truth)
Instead of comparing bid requests with user
profile<U,P> we look at bid requests only.
Compensate computational resources with price
(and not accept and bid on everything)
ROI with Gaussian/Gamma models are much better
21. Challenges
- To process that file in 80 ms
- Throttling issues
- Networking challenges
- Programming/debugging/testing in a distributed environment is a very time-
consuming task
22. About me
Surf, Football and micro-controllers [ Building products ]
Data Scientist, Planetary/Fusion Network Ltd (NY)
MS Comp Science, Media, Business Mgmt - University of California, Santa Barbara
MUDIT UPPAL
23. Learning at Insight: Learning by doing
Before Insight: Batch processing, Data Science Analytics
After Insight: Stream processing [ Spark streaming, Storm, Kafka, Cassandra ] +
learning from feedback, the right tool for the right job, More debugging/testing in a
distributed computing with polyglot approach + Thinking asynchronously