狠狠撸shows by User: prakash573
/
http://www.slideshare.net/images/logo.gif狠狠撸shows by User: prakash573
/
Sat, 17 Aug 2019 01:41:35 GMT狠狠撸Share feed for 狠狠撸shows by User: prakash573The delta architecture
/slideshow/the-delta-architecture-164400585/164400585
thedeltaarchitecture-190817014135 Lambda architecture is a popular technique where records are processed by a batch system and streaming system in parallel. The results are then combined during query time to provide a complete answer. Strict latency requirements to process old and recently generated events made this architecture popular. The key downside to this architecture is the development and operational overhead of managing two different systems.
There have been attempts to unify batch and streaming into a single system in the past. Organizations have not been that successful though in those attempts. But, with the advent of Delta Lake, we are seeing lot of engineers adopting a simple continuous data flow model to process data as it arrives. We call this architecture, The Delta Architecture.]]>
Lambda architecture is a popular technique where records are processed by a batch system and streaming system in parallel. The results are then combined during query time to provide a complete answer. Strict latency requirements to process old and recently generated events made this architecture popular. The key downside to this architecture is the development and operational overhead of managing two different systems.
There have been attempts to unify batch and streaming into a single system in the past. Organizations have not been that successful though in those attempts. But, with the advent of Delta Lake, we are seeing lot of engineers adopting a simple continuous data flow model to process data as it arrives. We call this architecture, The Delta Architecture.]]>
Sat, 17 Aug 2019 01:41:35 GMT/slideshow/the-delta-architecture-164400585/164400585prakash573@slideshare.net(prakash573)The delta architectureprakash573Lambda architecture is a popular technique where records are processed by a batch system and streaming system in parallel. The results are then combined during query time to provide a complete answer. Strict latency requirements to process old and recently generated events made this architecture popular. The key downside to this architecture is the development and operational overhead of managing two different systems.
There have been attempts to unify batch and streaming into a single system in the past. Organizations have not been that successful though in those attempts. But, with the advent of Delta Lake, we are seeing lot of engineers adopting a simple continuous data flow model to process data as it arrives. We call this architecture, The Delta Architecture.<img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/thedeltaarchitecture-190817014135-thumbnail.jpg?width=120&height=120&fit=bounds" /><br> Lambda architecture is a popular technique where records are processed by a batch system and streaming system in parallel. The results are then combined during query time to provide a complete answer. Strict latency requirements to process old and recently generated events made this architecture popular. The key downside to this architecture is the development and operational overhead of managing two different systems.
There have been attempts to unify batch and streaming into a single system in the past. Organizations have not been that successful though in those attempts. But, with the advent of Delta Lake, we are seeing lot of engineers adopting a simple continuous data flow model to process data as it arrives. We call this architecture, The Delta Architecture.
]]>
7851https://cdn.slidesharecdn.com/ss_thumbnails/thedeltaarchitecture-190817014135-thumbnail.jpg?width=120&height=120&fit=boundspresentationBlackhttp://activitystrea.ms/schema/1.0/posthttp://activitystrea.ms/schema/1.0/posted0Databricks clusters in autopilot mode
/slideshow/databricks-clusters-in-autopilot-mode/81206425
databricksclustersinautopilotmodefinal-171025182936 Why building a big data platform is hard? What are the key aspects involved in providing a "Serverless" experience for data folks. And how Databricks solves infrastructure problems and provides the "Serverless" experience.]]>
Why building a big data platform is hard? What are the key aspects involved in providing a "Serverless" experience for data folks. And how Databricks solves infrastructure problems and provides the "Serverless" experience.]]>
Wed, 25 Oct 2017 18:29:36 GMT/slideshow/databricks-clusters-in-autopilot-mode/81206425prakash573@slideshare.net(prakash573)Databricks clusters in autopilot modeprakash573Why building a big data platform is hard? What are the key aspects involved in providing a "Serverless" experience for data folks. And how Databricks solves infrastructure problems and provides the "Serverless" experience.<img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/databricksclustersinautopilotmodefinal-171025182936-thumbnail.jpg?width=120&height=120&fit=bounds" /><br> Why building a big data platform is hard? What are the key aspects involved in providing a "Serverless" experience for data folks. And how Databricks solves infrastructure problems and provides the "Serverless" experience.
]]>
4501https://cdn.slidesharecdn.com/ss_thumbnails/databricksclustersinautopilotmodefinal-171025182936-thumbnail.jpg?width=120&height=120&fit=boundspresentationBlackhttp://activitystrea.ms/schema/1.0/posthttp://activitystrea.ms/schema/1.0/posted0So you think you can stream.pptx
/slideshow/so-you-think-you-can-streampptx/62782005
soyouthinkyoucanstream-160606185647 Best practices on Spark streaming.]]>
Best practices on Spark streaming.]]>
Mon, 06 Jun 2016 18:56:46 GMT/slideshow/so-you-think-you-can-streampptx/62782005prakash573@slideshare.net(prakash573)So you think you can stream.pptxprakash573Best practices on Spark streaming.<img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/soyouthinkyoucanstream-160606185647-thumbnail.jpg?width=120&height=120&fit=bounds" /><br> Best practices on Spark streaming.
]]>
5594https://cdn.slidesharecdn.com/ss_thumbnails/soyouthinkyoucanstream-160606185647-thumbnail.jpg?width=120&height=120&fit=boundspresentation000000http://activitystrea.ms/schema/1.0/posthttp://activitystrea.ms/schema/1.0/posted0Spark streaming: Best Practices
/slideshow/spark-streaming-best-practices/62650607
sparkstreaming-databythebay1-160602095639 Best practices on Spark streaming]]>
Best practices on Spark streaming]]>
Thu, 02 Jun 2016 09:56:39 GMT/slideshow/spark-streaming-best-practices/62650607prakash573@slideshare.net(prakash573)Spark streaming: Best Practicesprakash573Best practices on Spark streaming<img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/sparkstreaming-databythebay1-160602095639-thumbnail.jpg?width=120&height=120&fit=bounds" /><br> Best practices on Spark streaming
]]>
6113https://cdn.slidesharecdn.com/ss_thumbnails/6d614b50-86e8-46b4-ba66-0bf89ca8958f-150603055258-lva1-app6891-thumbnail.jpg?width=120&height=120&fit=boundsdocument000000http://activitystrea.ms/schema/1.0/posthttp://activitystrea.ms/schema/1.0/posted0https://cdn.slidesharecdn.com/profile-photo-prakash573-48x48.jpg?cb=1566006075I enjoy building products. Getting things done.
I have 3 years of product management experience and 10+ years of experience in software engineering in building highly distributed and scalable real-time and near real-time systems handling high volume of traffic and also offline systems handling huge amount of data for machine learning purposes. I enjoy envisioning the big picture and the business impact as much as working on the details of a project.http://prakashc.googlepages.comhttps://cdn.slidesharecdn.com/ss_thumbnails/thedeltaarchitecture-190817014135-thumbnail.jpg?width=320&height=320&fit=boundsslideshow/the-delta-architecture-164400585/164400585The delta architecturehttps://cdn.slidesharecdn.com/ss_thumbnails/databricksclustersinautopilotmodefinal-171025182936-thumbnail.jpg?width=320&height=320&fit=boundsslideshow/databricks-clusters-in-autopilot-mode/81206425Databricks clusters in...https://cdn.slidesharecdn.com/ss_thumbnails/soyouthinkyoucanstream-160606185647-thumbnail.jpg?width=320&height=320&fit=boundsslideshow/so-you-think-you-can-streampptx/62782005So you think you can s...