1. Evaluation methods for recommendation systems include experimental design with A/B testing, as well as offline evaluation on logged user data using metrics like predictive accuracy, recall, and replay-match methods.
2. Many recommendation system evaluations focus only on predictive accuracy rather than overall user utility or business objectives. Multi-objective optimization aims to maximize multiple goals like clicks, revenue, and engagement.
3. Challenges in recommendation system evaluation and optimization include multi-position optimization, exploration-exploitation tradeoffs for large item pools, constructing user profiles with sparse data, incorporating social information and whole-page effects, and multi-context learning.
This document discusses active learning and how to design active learning units. It defines active learning as learning that involves students and makes the subject matter relevant to their own purposes. Active learning facilitates participation and whole-person involvement in the learning process. When designing active learning units, teachers should choose topics, design tasks leading to a final project, and state the learning aims and language functions students will develop. Active learning reinforces content, develops teamwork skills, and improves student retention when incorporated into an enjoyable learning environment.
This document provides an overview and table of contents for the book "SQL Server Interview Questions and Answers". The book contains questions and answers on various SQL Server topics organized into chapters such as common generic questions, developer questions, tricky questions, SQL Server 2008 questions, data warehousing questions, and best practices. It is authored by Pinal Dave and Vinod Kumar and published by SQLAuthority.com and ExtremeExperts.com.
Es un organismo asesor conformado por un numero igual de representantes del empleador y de los trabajadores con sus respectivos suplentes, cuyas funciones principales son:
Establecer estrategias de conciliaci¨®n entre las partes cuando exista una queja
Promover actividades preventivas de acoso laboral para proteger a los trabajadores contra los riesgos psicosociales que afecten su salud en los lugares de trabajo
El documento describe los diferentes tipos de extintores, incluyendo agua, espuma, di¨®xido de carbono y polvo qu¨ªmico seco, y c¨®mo cada uno es adecuado para diferentes clases de fuego. Tambi¨¦n explica c¨®mo usar un extintor correctamente, como descolgarlo, quitar el seguro, apuntar a la base de las llamas en zigzag y nunca dar la espalda al fuego. Adem¨¢s, proporciona consejos sobre la prevenci¨®n de incendios como almacenar materiales de forma segura y no fumar en ¨¢reas de trabajo
Presentar conceptos b¨¢sicos a desarrollar en el dise?o e implementaci¨®n del Sistema de Gesti¨®n
?Qu¨¦ es la Seguridad y Salud en el Trabajo (SST)?
?En qu¨¦ consiste el Sistema de Gesti¨®n de la Seguridad y Salud en el Trabajo (SG-SST)?
Marco Normativo
Obligaciones de los Empleadores
Obligaciones de las Administradoras de Riesgos Laborales (ARL)
Responsabilidades de los Trabajadores
Elaboraci¨®n y conservaci¨®n de la documentaci¨®n del SG-SST
?QU? PASOS DEBO SEGUIR PARA IMPLEMENTAR EL SG-SST?
How do you create a content strategy and then execute content assets throughout the buying cycle? In this session, Curt will show you how his team at MasterControl has developed a process for content creation, and how that content is found and delivered
to prospects and customers. In this session, you¡¯ll learn:
? Where the content marketing strategy starts and getting buy-in
? What types of content works best at different parts of the buying cycle
? The best way to integrate content marketing into the entire marketing strategy
Optify: how to price your digital marketing servicesOptify
?
This document discusses pricing methods for digital marketing services. It presents four common pricing models: time and materials, fixed bid projects, retainers, and value-based pricing. For each method, it outlines the description, pros, cons, and typical opportunity types where each method may apply. The document provides examples and considerations to help digital marketing agencies determine the most appropriate pricing approach for different types of client engagements and projects.
Affect Labs is developing a system to analyze emotions in online conversations to help brands understand sentiment about them. They plan to launch the product and make money through a freemium model allowing customers to try it without risk. They aim to reach ?100k in revenue in the first year and become profitable after three years. They are seeking seed funding to launch, promote and sell their product to solve the problem of information overload.
This document discusses metrics for startups and how to make them actionable. It recommends establishing baselines and targets for each stage of the customer lifecycle from awareness to loyalty. Both quantitative metrics like revenue and qualitative metrics from surveys are suggested. The document emphasizes segmenting data, testing hypotheses, and being willing to pivot based on metrics. The key is finding the right metrics to understand customer behavior and drive business decisions.
Startup Metrics for Pirates (Nov 2012)Dave McClure
?
This document summarizes Dave McClure's presentation on startup metrics and building a minimum viable product (MVP). The key points are:
1) McClure introduces the AARRR framework for measuring startup success - Acquisition, Activation, Retention, Referral, and Revenue.
2) To build an MVP, focus on understanding customer problems and testing solutions using measurable conversion metrics like usage, signups, retention, and referrals.
3) An MVP should start with the minimum needed to validate a problem and solution with customers through iterative testing and optimization. The goal is product/market fit before worrying about features or launch.
Michael Goguen, Sequoia Capital: Think Big, Start SmallDanuta Pysarenko
?
1. Sequoia Capital is a 40-year-old venture capital firm that has invested in over 820 companies, including 150 IPOs and 150 acquisitions.
2. When evaluating startups, Sequoia looks for companies addressing large, disruptive markets; with unique, differentiated products that deeply alleviate customer pain; and exceptional founding teams focused on product excellence.
3. Sequoia advises startups to think ambitiously about market and company potential but to start small with minimal headcount, spending, and time to market in order to maximize efficiency and focus.
The document discusses startup metrics and focuses on the AARRR model, which stands for Acquisition, Activation, Retention, Referral, and Revenue. It emphasizes optimizing the customer experience and prioritizing the most critical metrics like usage, retention, and revenue. The overall message is that startups should iterate quickly, test different metrics and conversion funnels, and optimize for both user happiness and business outcomes.
Attribution SESsf 2012 by @stevelatham - Encore Media MetricsEncore Media Metrics
?
The document discusses multi-touch attribution, which is an approach to measuring marketing performance that accounts for the fact that customers are influenced by multiple touchpoints. It outlines some of the limitations of traditional last-click attribution models. The key aspects of multi-touch attribution covered include analyzing both click and impression data, statistically modeling interactions to attribute credit across channels, and producing reports that show engagement paths and optimized cost per action by channel based on attribution.
This document provides an overview of digital marketing topics including: popular 2012 Google searches like Whitney Houston; Facebook penetration in Australia is 55%; 5.1 billion people own mobile phones. It then covers digital marketing 101, focusing on search engine marketing, optimization, and strategies. Mobile search and social media are also discussed, emphasizing their importance for engagement, relationships, and cost effectiveness. Various industry terms are defined.
The document discusses establishing a new business called Re:Quest that would act as a content agent connecting prosumers (users who both produce and consume content) and consumers. It would develop a point-based system allowing micropayments for original artwork, skills training, and targeted advertising. Financial projections estimate annual revenue of 46.8 million yen from sources like referral fees, advertising, content sales, and education services.
John Sumser provides analysis and consulting on electronic recruiting strategies. He publishes an annual Electronic Recruiting Index that analyzes trends, opportunities, surveys, rankings, and the performance of individual sites. Sumser also holds industry outbriefings in major cities to discuss results and implications, posting techniques, investment strategies, driving trends, forecasts, and market directions. The recruiting industry is maturing with rising candidate acquisition costs, declining availability, and increasing hours needed per placement. Companies must focus on training, developing an online presence, measuring results, and having a clear strategic plan for electronic recruiting.
Customer Development 2: Three types of marketsVenture Hacks
?
This document discusses different types of markets that startups can enter: existing markets, resegmented markets, and new markets. It defines each market type and provides examples. Existing markets involve providing a faster or better solution than incumbents. Resegmented markets involve targeting unserved customer niches or a lower price point. New markets create brand new customer needs. The document notes that the type of market affects factors like sales models, costs, risks, and challenges. It emphasizes that understanding the market type is crucial for startups to select the right customer development strategies.
This presentation on Attribution and Media Measurement was made at the 2012 Digital World Expo. For an electronic copy please email info@encoremetrics.com
UX & ROI: What to measure and what to expectcxpartners
?
Sharing experiences of UX in the world of eCommerce.
What metrics to look at, how much of an increase can you expect, real numbers, benchmarking, business case justification and much more.
The document discusses interactive digital ads and the case for using analytics to measure their performance. It provides examples of key features of interactive ads, such as including visual, computational, and video elements. It also discusses best practices for interactive ad design and metrics for tracking user engagement, including dwell time, goal completion rates, and click-through rates. Finally, it addresses considerations for using interactive marketing in Vietnam and outlines an integrated marketing lifecycle framework.
The document discusses the future of technology in recruitment. It describes how Alan Whitford has over 20 years of experience in starting up and expanding high-tech businesses in Europe. He founded companies that develop web-based recruitment solutions and has clients including large companies and HR software vendors. The document then discusses how digital advertising spending and the use of videos, social networks, and mobile devices are evolving recruitment practices.
This document discusses revenue streams and how to determine a startup's revenue model, including how to estimate how many products or services will sell, where revenue will come from, how to set pricing, and whether the total revenue will cover costs and generate a profitable business. It provides examples of revenue models, pricing strategies, and considerations for forecasting revenue from new and existing markets. The goal is to help startups develop financial projections and determine if their business concept is viable.
Six Sigma is a statistical methodology for improving process quality by reducing defects. It aims to reduce variation and improve customer satisfaction by focusing on key inputs that drive outputs. The Six Sigma methodology involves defining problems in terms of defects, measuring the current process performance, analyzing relationships between inputs and outputs, improving the process by experimenting with key inputs, and controlling the process to sustain improvements.
Kimikazu Kato is the Chief Scientist at Silver Egg Technology, which provides recommender system and online advertising services. He has a PhD in computer science and experience in areas like computer graphics and parallel computing. Silver Egg uses a real-time recommender platform called Aigent Suite to consistently target users from initial visits to retention. The system analyzes user behavior data to determine personalized recommendations and ad targeting. While collaborative filtering and matrix factorization are common recommendation algorithms, approaches need adjustments for sales recommendations versus movie ratings. Consulting is also important for tuning algorithm parameters to specific business needs.
Is Gamification legit or a leap of faith? Presentation investigates whether (or not) there is real business value by looking at case studies, market data, enterprise use cases and best practices. Includes user adoption impact and potential to improve brand loyalty and brand visibility.
More Related Content
Similar to Recommender Systems Tutorial (Part 4) -- Challenges (20)
Affect Labs is developing a system to analyze emotions in online conversations to help brands understand sentiment about them. They plan to launch the product and make money through a freemium model allowing customers to try it without risk. They aim to reach ?100k in revenue in the first year and become profitable after three years. They are seeking seed funding to launch, promote and sell their product to solve the problem of information overload.
This document discusses metrics for startups and how to make them actionable. It recommends establishing baselines and targets for each stage of the customer lifecycle from awareness to loyalty. Both quantitative metrics like revenue and qualitative metrics from surveys are suggested. The document emphasizes segmenting data, testing hypotheses, and being willing to pivot based on metrics. The key is finding the right metrics to understand customer behavior and drive business decisions.
Startup Metrics for Pirates (Nov 2012)Dave McClure
?
This document summarizes Dave McClure's presentation on startup metrics and building a minimum viable product (MVP). The key points are:
1) McClure introduces the AARRR framework for measuring startup success - Acquisition, Activation, Retention, Referral, and Revenue.
2) To build an MVP, focus on understanding customer problems and testing solutions using measurable conversion metrics like usage, signups, retention, and referrals.
3) An MVP should start with the minimum needed to validate a problem and solution with customers through iterative testing and optimization. The goal is product/market fit before worrying about features or launch.
Michael Goguen, Sequoia Capital: Think Big, Start SmallDanuta Pysarenko
?
1. Sequoia Capital is a 40-year-old venture capital firm that has invested in over 820 companies, including 150 IPOs and 150 acquisitions.
2. When evaluating startups, Sequoia looks for companies addressing large, disruptive markets; with unique, differentiated products that deeply alleviate customer pain; and exceptional founding teams focused on product excellence.
3. Sequoia advises startups to think ambitiously about market and company potential but to start small with minimal headcount, spending, and time to market in order to maximize efficiency and focus.
The document discusses startup metrics and focuses on the AARRR model, which stands for Acquisition, Activation, Retention, Referral, and Revenue. It emphasizes optimizing the customer experience and prioritizing the most critical metrics like usage, retention, and revenue. The overall message is that startups should iterate quickly, test different metrics and conversion funnels, and optimize for both user happiness and business outcomes.
Attribution SESsf 2012 by @stevelatham - Encore Media MetricsEncore Media Metrics
?
The document discusses multi-touch attribution, which is an approach to measuring marketing performance that accounts for the fact that customers are influenced by multiple touchpoints. It outlines some of the limitations of traditional last-click attribution models. The key aspects of multi-touch attribution covered include analyzing both click and impression data, statistically modeling interactions to attribute credit across channels, and producing reports that show engagement paths and optimized cost per action by channel based on attribution.
This document provides an overview of digital marketing topics including: popular 2012 Google searches like Whitney Houston; Facebook penetration in Australia is 55%; 5.1 billion people own mobile phones. It then covers digital marketing 101, focusing on search engine marketing, optimization, and strategies. Mobile search and social media are also discussed, emphasizing their importance for engagement, relationships, and cost effectiveness. Various industry terms are defined.
The document discusses establishing a new business called Re:Quest that would act as a content agent connecting prosumers (users who both produce and consume content) and consumers. It would develop a point-based system allowing micropayments for original artwork, skills training, and targeted advertising. Financial projections estimate annual revenue of 46.8 million yen from sources like referral fees, advertising, content sales, and education services.
John Sumser provides analysis and consulting on electronic recruiting strategies. He publishes an annual Electronic Recruiting Index that analyzes trends, opportunities, surveys, rankings, and the performance of individual sites. Sumser also holds industry outbriefings in major cities to discuss results and implications, posting techniques, investment strategies, driving trends, forecasts, and market directions. The recruiting industry is maturing with rising candidate acquisition costs, declining availability, and increasing hours needed per placement. Companies must focus on training, developing an online presence, measuring results, and having a clear strategic plan for electronic recruiting.
Customer Development 2: Three types of marketsVenture Hacks
?
This document discusses different types of markets that startups can enter: existing markets, resegmented markets, and new markets. It defines each market type and provides examples. Existing markets involve providing a faster or better solution than incumbents. Resegmented markets involve targeting unserved customer niches or a lower price point. New markets create brand new customer needs. The document notes that the type of market affects factors like sales models, costs, risks, and challenges. It emphasizes that understanding the market type is crucial for startups to select the right customer development strategies.
This presentation on Attribution and Media Measurement was made at the 2012 Digital World Expo. For an electronic copy please email info@encoremetrics.com
UX & ROI: What to measure and what to expectcxpartners
?
Sharing experiences of UX in the world of eCommerce.
What metrics to look at, how much of an increase can you expect, real numbers, benchmarking, business case justification and much more.
The document discusses interactive digital ads and the case for using analytics to measure their performance. It provides examples of key features of interactive ads, such as including visual, computational, and video elements. It also discusses best practices for interactive ad design and metrics for tracking user engagement, including dwell time, goal completion rates, and click-through rates. Finally, it addresses considerations for using interactive marketing in Vietnam and outlines an integrated marketing lifecycle framework.
The document discusses the future of technology in recruitment. It describes how Alan Whitford has over 20 years of experience in starting up and expanding high-tech businesses in Europe. He founded companies that develop web-based recruitment solutions and has clients including large companies and HR software vendors. The document then discusses how digital advertising spending and the use of videos, social networks, and mobile devices are evolving recruitment practices.
This document discusses revenue streams and how to determine a startup's revenue model, including how to estimate how many products or services will sell, where revenue will come from, how to set pricing, and whether the total revenue will cover costs and generate a profitable business. It provides examples of revenue models, pricing strategies, and considerations for forecasting revenue from new and existing markets. The goal is to help startups develop financial projections and determine if their business concept is viable.
Six Sigma is a statistical methodology for improving process quality by reducing defects. It aims to reduce variation and improve customer satisfaction by focusing on key inputs that drive outputs. The Six Sigma methodology involves defining problems in terms of defects, measuring the current process performance, analyzing relationships between inputs and outputs, improving the process by experimenting with key inputs, and controlling the process to sustain improvements.
Kimikazu Kato is the Chief Scientist at Silver Egg Technology, which provides recommender system and online advertising services. He has a PhD in computer science and experience in areas like computer graphics and parallel computing. Silver Egg uses a real-time recommender platform called Aigent Suite to consistently target users from initial visits to retention. The system analyzes user behavior data to determine personalized recommendations and ad targeting. While collaborative filtering and matrix factorization are common recommendation algorithms, approaches need adjustments for sales recommendations versus movie ratings. Consulting is also important for tuning algorithm parameters to specific business needs.
Is Gamification legit or a leap of faith? Presentation investigates whether (or not) there is real business value by looking at case studies, market data, enterprise use cases and best practices. Includes user adoption impact and potential to improve brand loyalty and brand visibility.
2. Evaluation Methods
? Ideal method
¨C Experimental Design: Run side-by-side experiments on a small
fraction of randomly selected traffic with new method (treatment)
and status quo (control)
¨C Limitation
? Often expensive and difficult to test large number of methods
? Problem: How do we evaluate methods offline on logged
data?
¨C Goal: To maximize clicks/revenue and not prediction accuracy on
the entire system. Cost of predictive inaccuracy for different
instances vary.
? E.g. 100% error on a low CTR article may not matter much
because it always co-occurs with a high CTR article that is
predicted accurately
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 2
3. Usual Metrics
? Predictive accuracy
¨C Root Mean Squared Error (RMSE)
¨C Mean Absolute Error (MAE)
¨C Area under the Curve, ROC
? Other rank based measures based on retrieval accuracy for top-k
¨C Recall in test data
? What Fraction of items that user actually liked in the test data were
among the top-k recommended by the algorithm (fraction of hits, e.g.
Karypsis, CIKM 2001)
? One flaw in several papers
¨C Training and test split are not based on time.
? Information leakage
? Even in Netflix, this is the case to some extent
¨C Time split per user, not per event. For instance, information may leak if
models are based on user-user similarity.
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 3
4. Metrics continued..
? Recall per event based on Replay-Match method
¨C Fraction of clicked events where the top recommended item
matches the clicked one.
? This is good if logged data collected from a randomized
serving scheme, with biased data this could be a problem
¨C We will be inventing algorithms that provide recommendations that
are similar to the current one
? No reward for novel recommendations
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 4
5. Details on Replay-Match method (Li, Langford, et al)
? x: feature vector for a visit
? r = [r1,r2,¡,rK]: reward vector for the K items in inventory
? h(x): recommendation algorithm to be evaluated
? Goal: Estimate expected reward for h(x)
? s(x): recommendation scheme that generated logged-data
? x1,..,xT: visits in the logged data
? rti: reward for visit t, where i = s(xt)
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 5
6. Replay-Match continued
? Estimator
? If importance weights
and
¨C It can be shown estimator is unbiased
? E.g. if s(x) is random serving scheme, importance weights
are uniform over the item set
? If s(x) is not random, importance weights have to be
estimated through a model
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 6
7. Back to Multi-Objective Optimization
EDITORIAL
Recommender AD SERVER
?Clicks on FP links influence PREMIUM display
downstream supply distribution
(GUARANTEED)
content
Spot Market (Cheaper)
Downstream
engagement
(Time spent)
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 7
8. Serving Content on Front Page: Click Shaping
? What do we want to optimize?
? Current: Maximize clicks (maximize downstream supply from FP)
? But consider the following
¨C Article 1: CTR=5%, utility per click = 5
¨C Article 2: CTR=4.9%, utility per click=10
? By promoting 2, we lose 1 click/100 visits, gain 5 utils
? If we do this for a large number of visits --- lose some clicks but obtain
significant gains in utility?
¨C E.g. lose 5% relative CTR, gain 40% in utility (revenue, engagement, etc)
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 8
9. Why call it Click Shaping?
other
buzz
other video autos finance
buzz finance videogames gmy.news
video tv health
autos
videogames gmy.news
health travel hotjobs
tv
hotjobs tech movies
travel
new.music
movies
tech sports
new.music
?AFTER
?BEFORE shopping
sports news
shine
rivals
shopping omg
news realestate
shine
10.00%
rivals
omg 8.00%
realestate 6.00%
4.00%
2.00%
?Supply distribution 0.00%
ts
?Changes
ing
bs
s
omg
s
te ch
-2.00%
e
tv
ie s
s
s
r
o
buzz
l
s tat e
ic
ce
th
es
tra ve
othe
aut o
shin
rival
new
.ne w
vide
spor
.mus
hotjo
heal
finan
gam
mov
-4.00%
p
shop
rea le
gmy
-6.00%
new
o
vide
-8.00%
-10.00%
?SHAPING can happen with respect to any downstream metrics (like engagement)
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 9
10. Multi-Objective Optimization
K properties m user segments
n articles
?A1 ?S1
?news
?A2 ?S2
?finance
?¡ ?¡ ?¡
?omg
?An ?Sm
? CTR of user segment i on article j: pij
? Time duration of i on j: dij
?10
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 10
11. Multi-Objective Program
? Scalarization
Goal Programming
Simplex constraints on xiJ is always applied
Constraints are linear
Every 10 mins, solve x
Use this x as the serving scheme in the next 10 mins
11
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 11
13. Summary
? Modern recommendation systems on the web crucially depend on
extracting intelligence from massive amounts of data collected on a
routine basis
? Lots of data and processing power not enough, the number of things
we need to learn grows with data size
? Extracting grouping structures at coarser resolutions based on
similarity (correlations) is important
¨C ML has a big role to play here
? Continuous and adaptive experimentation in a judicious manner crucial
to maximize performance
¨C Again, ML has a big role to play
? Multi-objective optimization is often required, the objectives are
application dependent.
¨C ML has to work in close collaboration with engineering, product &
business execs
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 13
15. Recall: Some examples
? Simple version
¨C I have an important module on my page, content inventory is
obtained from a third party source which is further refined through
editorial oversight. Can I algorithmically recommend content on this
module? I want to drive up total CTR on this module
? More advanced
¨C I got X% lift in CTR. But I have additional information on other
downstream utilities (e.g. dwell time). Can I increase downstream
utility without losing too many clicks?
? Highly advanced
¨C There are multiple modules running on my website. How do I take
a holistic approach and perform a simultaneous optimization?
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 15
16. For the simple version
? Multi-position optimization
¨C Explore/exploit, optimal subset selection
? Explore/Exploit strategies for large content pool and high
dimensional problems
¨C Some work on hierarchical bandits but more needs to be done
? Constructing user profiles from multiple sources with less
than full coverage
¨C Couple of papers at KDD 2011
? Content understanding
? Metrics to measure user engagement (other than CTR)
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 16
17. Other problems
? Whole page optimization
¨C Incorporating correlations
? Incentivizing User generated content
? Incorporating Social information for better recommendation
? Multi-context Learning
Deepak Agarwal & Bee-Chung Chen @ ICML¡¯11 17