際際滷

際際滷Share a Scribd company logo
The Creative
Analyst
Data Visualization
Competition
Aatish Kumar
Executive Summary
 Big Data Analytics growing globally unabatedly
 Prudent to invest in analytics and data science communities
 Vital to separate the wheat from the chaff
 The terms analytics and data science used very loosely
 Need to understand the true nature of the community
 Require thorough analysis using data analytic tools
 Analytics Vidhya analyzed to assess its suitability for capital investment
Methodology
 Analytics Vidhya provides all the public information via a RESTful API
 Shows commitment to being open and transparent
 Enables easy data access and processing
 A data dump sourced through API used in the exercise
 100 blog articles published in 2015
 Blog attributes and preview of blog articles
 Analysis done using Python and Tableau
Author contributions over time
Blogs and comments over weeks
Blogs and comments cumulative
Blog Categories
Blog Categories - 2
Blog Titles Word Cloud
Blog Text Word Cloud
Blog Tags Word Cloud
Topic Modeling
Link to Dynamic Visualization: CLICK HERE
Overview
 Analytics Vidhya is an active community
 Active publication of blogs
 Active user base to utilize and improve the content via feedback
 Analytics Vidhya provides relevant content
 Category histograms show a broad coverage of topics
 Word clouds and topic models confirm the relevant content within blogs
 Analysis based on a small data set from 2015
 Useful to redo the analyses from a bigger and more recent data set

More Related Content

Creative analyst solution by Aatish Kumar

  • 2. Executive Summary Big Data Analytics growing globally unabatedly Prudent to invest in analytics and data science communities Vital to separate the wheat from the chaff The terms analytics and data science used very loosely Need to understand the true nature of the community Require thorough analysis using data analytic tools Analytics Vidhya analyzed to assess its suitability for capital investment
  • 3. Methodology Analytics Vidhya provides all the public information via a RESTful API Shows commitment to being open and transparent Enables easy data access and processing A data dump sourced through API used in the exercise 100 blog articles published in 2015 Blog attributes and preview of blog articles Analysis done using Python and Tableau
  • 5. Blogs and comments over weeks
  • 6. Blogs and comments cumulative
  • 10. Blog Text Word Cloud
  • 11. Blog Tags Word Cloud
  • 12. Topic Modeling Link to Dynamic Visualization: CLICK HERE
  • 13. Overview Analytics Vidhya is an active community Active publication of blogs Active user base to utilize and improve the content via feedback Analytics Vidhya provides relevant content Category histograms show a broad coverage of topics Word clouds and topic models confirm the relevant content within blogs Analysis based on a small data set from 2015 Useful to redo the analyses from a bigger and more recent data set