際際滷shows by User: mrchristine / http://www.slideshare.net/images/logo.gif 際際滷shows by User: mrchristine / Wed, 29 Mar 2017 13:23:17 GMT 際際滷Share feed for 際際滷shows by User: mrchristine What's new with Apache Spark's Structured Streaming? /slideshow/whats-new-with-apache-sparks-structured-streaming/73866136 whatsnewwithstructuredstreaming-170329132317
Overview of the changes from Apache Spark streaming dstreams to structured streaming. ]]>

Overview of the changes from Apache Spark streaming dstreams to structured streaming. ]]>
Wed, 29 Mar 2017 13:23:17 GMT /slideshow/whats-new-with-apache-sparks-structured-streaming/73866136 mrchristine@slideshare.net(mrchristine) What's new with Apache Spark's Structured Streaming? mrchristine Overview of the changes from Apache Spark streaming dstreams to structured streaming. <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/whatsnewwithstructuredstreaming-170329132317-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> Overview of the changes from Apache Spark streaming dstreams to structured streaming.
What's new with Apache Spark's Structured Streaming? from Miklos Christine
]]>
1961 3 https://cdn.slidesharecdn.com/ss_thumbnails/whatsnewwithstructuredstreaming-170329132317-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
Fighting Fraud with Apache Spark /slideshow/fighting-fraud-with-apache-spark/66098723 fightingfraudwithapachespark-160916145105
Big Data Healthcare Innovations, Trends, and Use Cases ]]>

Big Data Healthcare Innovations, Trends, and Use Cases ]]>
Fri, 16 Sep 2016 14:51:04 GMT /slideshow/fighting-fraud-with-apache-spark/66098723 mrchristine@slideshare.net(mrchristine) Fighting Fraud with Apache Spark mrchristine Big Data Healthcare Innovations, Trends, and Use Cases <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/fightingfraudwithapachespark-160916145105-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> Big Data Healthcare Innovations, Trends, and Use Cases
Fighting Fraud with Apache Spark from Miklos Christine
]]>
541 5 https://cdn.slidesharecdn.com/ss_thumbnails/fightingfraudwithapachespark-160916145105-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
The Nitty Gritty of Advanced Analytics Using Apache Spark in Python /slideshow/the-nitty-gritty-of-advanced-analytics-using-apache-spark-in-python/61862255 nittygrittypyspark-160510133236
Apache Spark is the next big data processing tool for Data Scientist. As seen on the recent StackOverflow analysis, it's the hottest big data technology on their site! In this talk, I'll use the PySpark interface to leverage the speed and performance of Apache Spark. I'll focus on the end to end workflow for getting data into a distributed platform, and leverage Spark to process the data for advanced analytics. I'll discuss the popular Spark APIs used for data preparation, SQL analysis, and ML algorithms. I'll explain the performance differences between Scala and Python, and how Spark has bridged the gap in performance. I'll focus on PySpark as the interface to the platform, and walk through a demo to showcase the APIs. Talk Overview: Spark's Architecture. What's out now and what's in Spark 2.0Spark APIs: Most common APIs used by Spark Common misconceptions and proper techniques for using Spark. Demo: Walk through ETL of the Reddit dataset. SparkSQL Analytics + Visualizations of the Dataset using MatplotLibSentiment Analysis on Reddit Comments]]>

Apache Spark is the next big data processing tool for Data Scientist. As seen on the recent StackOverflow analysis, it's the hottest big data technology on their site! In this talk, I'll use the PySpark interface to leverage the speed and performance of Apache Spark. I'll focus on the end to end workflow for getting data into a distributed platform, and leverage Spark to process the data for advanced analytics. I'll discuss the popular Spark APIs used for data preparation, SQL analysis, and ML algorithms. I'll explain the performance differences between Scala and Python, and how Spark has bridged the gap in performance. I'll focus on PySpark as the interface to the platform, and walk through a demo to showcase the APIs. Talk Overview: Spark's Architecture. What's out now and what's in Spark 2.0Spark APIs: Most common APIs used by Spark Common misconceptions and proper techniques for using Spark. Demo: Walk through ETL of the Reddit dataset. SparkSQL Analytics + Visualizations of the Dataset using MatplotLibSentiment Analysis on Reddit Comments]]>
Tue, 10 May 2016 13:32:36 GMT /slideshow/the-nitty-gritty-of-advanced-analytics-using-apache-spark-in-python/61862255 mrchristine@slideshare.net(mrchristine) The Nitty Gritty of Advanced Analytics Using Apache Spark in Python mrchristine Apache Spark is the next big data processing tool for Data Scientist. As seen on the recent StackOverflow analysis, it's the hottest big data technology on their site! In this talk, I'll use the PySpark interface to leverage the speed and performance of Apache Spark. I'll focus on the end to end workflow for getting data into a distributed platform, and leverage Spark to process the data for advanced analytics. I'll discuss the popular Spark APIs used for data preparation, SQL analysis, and ML algorithms. I'll explain the performance differences between Scala and Python, and how Spark has bridged the gap in performance. I'll focus on PySpark as the interface to the platform, and walk through a demo to showcase the APIs. Talk Overview: Spark's Architecture. What's out now and what's in Spark 2.0Spark APIs: Most common APIs used by Spark Common misconceptions and proper techniques for using Spark. Demo: Walk through ETL of the Reddit dataset. SparkSQL Analytics + Visualizations of the Dataset using MatplotLibSentiment Analysis on Reddit Comments <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/nittygrittypyspark-160510133236-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> Apache Spark is the next big data processing tool for Data Scientist. As seen on the recent StackOverflow analysis, it&#39;s the hottest big data technology on their site! In this talk, I&#39;ll use the PySpark interface to leverage the speed and performance of Apache Spark. I&#39;ll focus on the end to end workflow for getting data into a distributed platform, and leverage Spark to process the data for advanced analytics. I&#39;ll discuss the popular Spark APIs used for data preparation, SQL analysis, and ML algorithms. I&#39;ll explain the performance differences between Scala and Python, and how Spark has bridged the gap in performance. I&#39;ll focus on PySpark as the interface to the platform, and walk through a demo to showcase the APIs. Talk Overview: Spark&#39;s Architecture. What&#39;s out now and what&#39;s in Spark 2.0Spark APIs: Most common APIs used by Spark Common misconceptions and proper techniques for using Spark. Demo: Walk through ETL of the Reddit dataset. SparkSQL Analytics + Visualizations of the Dataset using MatplotLibSentiment Analysis on Reddit Comments
The Nitty Gritty of Advanced Analytics Using Apache Spark in Python from Miklos Christine
]]>
1454 8 https://cdn.slidesharecdn.com/ss_thumbnails/nittygrittypyspark-160510133236-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
ETL to ML: Use Apache Spark as an end to end tool for Advanced Analytics /slideshow/etl-to-ml-use-apache-spark-as-an-end-to-end-tool-for-advanced-analytics/61471324 etl2mlsparkmiklos-160428161912
Spark Meetup slides to discuss ETL, SQL, and Machine Learning strategies using Apache Spark as the processing engine. ]]>

Spark Meetup slides to discuss ETL, SQL, and Machine Learning strategies using Apache Spark as the processing engine. ]]>
Thu, 28 Apr 2016 16:19:11 GMT /slideshow/etl-to-ml-use-apache-spark-as-an-end-to-end-tool-for-advanced-analytics/61471324 mrchristine@slideshare.net(mrchristine) ETL to ML: Use Apache Spark as an end to end tool for Advanced Analytics mrchristine Spark Meetup slides to discuss ETL, SQL, and Machine Learning strategies using Apache Spark as the processing engine. <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/etl2mlsparkmiklos-160428161912-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> Spark Meetup slides to discuss ETL, SQL, and Machine Learning strategies using Apache Spark as the processing engine.
ETL to ML: Use Apache Spark as an end to end tool for Advanced Analytics from Miklos Christine
]]>
1213 6 https://cdn.slidesharecdn.com/ss_thumbnails/etl2mlsparkmiklos-160428161912-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
https://cdn.slidesharecdn.com/profile-photo-mrchristine-48x48.jpg?cb=1602210114 https://cdn.slidesharecdn.com/ss_thumbnails/whatsnewwithstructuredstreaming-170329132317-thumbnail.jpg?width=320&height=320&fit=bounds slideshow/whats-new-with-apache-sparks-structured-streaming/73866136 What&#39;s new with Apache... https://cdn.slidesharecdn.com/ss_thumbnails/fightingfraudwithapachespark-160916145105-thumbnail.jpg?width=320&height=320&fit=bounds slideshow/fighting-fraud-with-apache-spark/66098723 Fighting Fraud with Ap... https://cdn.slidesharecdn.com/ss_thumbnails/nittygrittypyspark-160510133236-thumbnail.jpg?width=320&height=320&fit=bounds slideshow/the-nitty-gritty-of-advanced-analytics-using-apache-spark-in-python/61862255 The Nitty Gritty of Ad...