The document summarizes an analysis of the Analytics Vidhya data science community performed using blog post data from 2015. Key findings include that Analytics Vidhya has an active user base publishing blogs across a broad range of topics, and receives user feedback on content. Word clouds and topic modeling of blog texts confirm the relevance of the content. A larger, more recent dataset could provide more insights.
2. Executive Summary
Big Data Analytics growing globally unabatedly
Prudent to invest in analytics and data science communities
Vital to separate the wheat from the chaff
The terms analytics and data science used very loosely
Need to understand the true nature of the community
Require thorough analysis using data analytic tools
Analytics Vidhya analyzed to assess its suitability for capital investment
3. Methodology
Analytics Vidhya provides all the public information via a RESTful API
Shows commitment to being open and transparent
Enables easy data access and processing
A data dump sourced through API used in the exercise
100 blog articles published in 2015
Blog attributes and preview of blog articles
Analysis done using Python and Tableau
13. Overview
Analytics Vidhya is an active community
Active publication of blogs
Active user base to utilize and improve the content via feedback
Analytics Vidhya provides relevant content
Category histograms show a broad coverage of topics
Word clouds and topic models confirm the relevant content within blogs
Analysis based on a small data set from 2015
Useful to redo the analyses from a bigger and more recent data set