ºÝºÝߣ

ºÝºÝߣShare a Scribd company logo
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
•
•
•
•
•
•
• ANY OTHER
PROVIDER PROVISIONING TOOLS
• HTTPS://WWW.VAGRANTUP.COM/DOWNLOADS.HTML
• VIRTUALIZATION
•
•
GUEST OPERATING SYSTEMS
• HTTPS://WWW.VIRTUALBOX.ORG/WIKI/DOWNLOADS
•
• HTTPS://GITHUB.COM/FELIXCHEUNG/VAGRANT-PROJECTS
• SPARK-CASSANDRA-ZEPPELIN
•
•
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
•
•
•
•
•
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
•
•
HTTPS://ZEPPELIN.INCUBATOR.APACHE.ORG/
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
• HTTPS://GITHUB.COM/FELIXCHEUNG/SPARK-NOTEBOOK-
EXAMPLES/TREE/MASTER/ZEPPELIN_NOTEBOOK/APACHECON2016
•
•
• HTTP://SPARK.APACHE.ORG/DOCS/LATE
ST/CONFIGURATION.HTML
•
•
• HTTPS://GITHUB.COM/FELIXCHEUNG/SPARK-NOTEBOOK-
EXAMPLES/TREE/MASTER/ZEPPELIN_NOTEBOOK/APACHECON2016
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
• PARTITION
CLUSTER MEAN PROTOTYPE
• HTTP://THEORY.STANFORD.EDU/~SERGEI/PAPERS/VLDB12-KMPAR.PDF
K-MEANS++
• STREAMING K-MEANS
•
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
GRAPHFRAMES
•
•
•
•
•
•
•
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
•
• BIGTABLE: A DISTRIBUTED STORAGE SYSTEM FOR STRUCTURED DATA
•
•
•
•
•
• HTTPS://HBASE.APACHE.ORG/BOOK.HTML#QUICKSTART
•
•
•
•
•
•
•
•
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
• BORN AT FACEBOOK AMAZON’S DYNAMO AND GOOGLE’S BIGTABLE
•
•
•
•
•
• HTTP://WIKI.APACHE.ORG/CASSANDRA/GETTINGSTARTED
•
•
•
•
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
•
•
•
•
• HERE
•
•
•
• HTTPS://WWW.DIGITALOCEAN.COM/COMMUNITY/TUTORIALS/HOW-TO-INSTALL-CASSANDRA-AND-RUN-
A-SINGLE-NODE-CLUSTER-ON-A-UBUNTU-VPS
•
Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark
•
•
•
•
•
•
•
•
•
•
•
•
•
• M3.XLARGE
•
•
•
•
•
•
•
•
•
http://www.natalinobusa.com/2015/11/why-is-smack-stack-all-rage-lately.html
• HTTPS://DOCS.MESOSPHERE.COM/ADMINISTRATION/INSTALLING/CLOUD/AWS/
• HTTPS://DCOS.IO/DOCS/1.7/ADMINISTRATION/INSTALLING/CLOUD/AWS/
•
•
•
•
•
• https://dcos.io/docs/1.7/usage/tutorials/spark/
• HTTPS://GITHUB.COM/FELIXCHEUNG

More Related Content

Interactive Data Science From Scratch with Apache Zeppelin and Apache Spark

Editor's Notes

  • #35: https://docs.mesosphere.com/1-7/usage/services/zeppelin/