際際滷shows by User: SubhasishGuha1 / http://www.slideshare.net/images/logo.gif 際際滷shows by User: SubhasishGuha1 / Mon, 25 Jun 2018 19:34:08 GMT 際際滷Share feed for 際際滷shows by User: SubhasishGuha1 Data Quality, Correctness and Dynamic Transformations using Spark and Scala /slideshow/data-quality-correctness-and-dynamic-transformations-using-spark-and-scala/103016088 dataquality-180625193408
Implementing enterprise metadata driven accelerator for Data Ingestion and linear Transformations. This article contains a road map how to design your framework to handle different Ingestion scenarios]]>

Implementing enterprise metadata driven accelerator for Data Ingestion and linear Transformations. This article contains a road map how to design your framework to handle different Ingestion scenarios]]>
Mon, 25 Jun 2018 19:34:08 GMT /slideshow/data-quality-correctness-and-dynamic-transformations-using-spark-and-scala/103016088 SubhasishGuha1@slideshare.net(SubhasishGuha1) Data Quality, Correctness and Dynamic Transformations using Spark and Scala SubhasishGuha1 Implementing enterprise metadata driven accelerator for Data Ingestion and linear Transformations. This article contains a road map how to design your framework to handle different Ingestion scenarios <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/dataquality-180625193408-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> Implementing enterprise metadata driven accelerator for Data Ingestion and linear Transformations. This article contains a road map how to design your framework to handle different Ingestion scenarios
Data Quality, Correctness and Dynamic Transformations using Spark and Scala from Subhasish Guha
]]>
353 2 https://cdn.slidesharecdn.com/ss_thumbnails/dataquality-180625193408-thumbnail.jpg?width=120&height=120&fit=bounds document Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
Parallel Execution of Jobs in Hadoop /slideshow/parallel-execution-of-jobs-in-hadoop-64592991/64592991 33e10bc5-edef-4c54-b240-1c7adcd66dfa-160801193736
]]>

]]>
Mon, 01 Aug 2016 19:37:36 GMT /slideshow/parallel-execution-of-jobs-in-hadoop-64592991/64592991 SubhasishGuha1@slideshare.net(SubhasishGuha1) Parallel Execution of Jobs in Hadoop SubhasishGuha1 <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/33e10bc5-edef-4c54-b240-1c7adcd66dfa-160801193736-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br>
Parallel Execution of Jobs in Hadoop from Subhasish Guha
]]>
153 3 https://cdn.slidesharecdn.com/ss_thumbnails/33e10bc5-edef-4c54-b240-1c7adcd66dfa-160801193736-thumbnail.jpg?width=120&height=120&fit=bounds document 000000 http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
Parallel execution of jobs in hadoop /slideshow/parallel-execution-of-jobs-in-hadoop/64592721 parallelexecutionofjobsinhadoop-160801192901
Executing jobs in parallel is an important feature of this article. Being Hadoop developer, I faced this issue and tried to provide generic solution throughout a project]]>

Executing jobs in parallel is an important feature of this article. Being Hadoop developer, I faced this issue and tried to provide generic solution throughout a project]]>
Mon, 01 Aug 2016 19:29:01 GMT /slideshow/parallel-execution-of-jobs-in-hadoop/64592721 SubhasishGuha1@slideshare.net(SubhasishGuha1) Parallel execution of jobs in hadoop SubhasishGuha1 Executing jobs in parallel is an important feature of this article. Being Hadoop developer, I faced this issue and tried to provide generic solution throughout a project <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/parallelexecutionofjobsinhadoop-160801192901-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> Executing jobs in parallel is an important feature of this article. Being Hadoop developer, I faced this issue and tried to provide generic solution throughout a project
Parallel execution of jobs in hadoop from Subhasish Guha
]]>
140 2 https://cdn.slidesharecdn.com/ss_thumbnails/parallelexecutionofjobsinhadoop-160801192901-thumbnail.jpg?width=120&height=120&fit=bounds document Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
Dynamic Width File in Spark_2016 /slideshow/dynamic-width-file-in-spark2016/63920366 9cf302e9-4b84-430f-9a0d-1e7ad48c9846-160711180326
]]>

]]>
Mon, 11 Jul 2016 18:03:26 GMT /slideshow/dynamic-width-file-in-spark2016/63920366 SubhasishGuha1@slideshare.net(SubhasishGuha1) Dynamic Width File in Spark_2016 SubhasishGuha1 <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/9cf302e9-4b84-430f-9a0d-1e7ad48c9846-160711180326-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br>
Dynamic Width File in Spark_2016 from Subhasish Guha
]]>
134 1 https://cdn.slidesharecdn.com/ss_thumbnails/9cf302e9-4b84-430f-9a0d-1e7ad48c9846-160711180326-thumbnail.jpg?width=120&height=120&fit=bounds document 000000 http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
Dynamic width file in Spark /slideshow/dynamic-width-file-in-spark/63918640 dynamicwidthfileinspark-2016-160711171248
This is an powerful example of how to handle Dynamic Width Files in Spark. This is one of the common scenarios in Financial Industries.]]>

This is an powerful example of how to handle Dynamic Width Files in Spark. This is one of the common scenarios in Financial Industries.]]>
Mon, 11 Jul 2016 17:12:48 GMT /slideshow/dynamic-width-file-in-spark/63918640 SubhasishGuha1@slideshare.net(SubhasishGuha1) Dynamic width file in Spark SubhasishGuha1 This is an powerful example of how to handle Dynamic Width Files in Spark. This is one of the common scenarios in Financial Industries. <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/dynamicwidthfileinspark-2016-160711171248-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> This is an powerful example of how to handle Dynamic Width Files in Spark. This is one of the common scenarios in Financial Industries.
Dynamic width file in Spark from Subhasish Guha
]]>
39 1 https://cdn.slidesharecdn.com/ss_thumbnails/dynamicwidthfileinspark-2016-160711171248-thumbnail.jpg?width=120&height=120&fit=bounds document Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
ETL and pivoting in spark /slideshow/etl-and-pivoting-in-spark-63881602/63881602 etlandpivotinginspark-160710090101
This presentation explains how real life ETL problems can be overcome using Spark, Data Frame and Spark-SQL]]>

This presentation explains how real life ETL problems can be overcome using Spark, Data Frame and Spark-SQL]]>
Sun, 10 Jul 2016 09:01:01 GMT /slideshow/etl-and-pivoting-in-spark-63881602/63881602 SubhasishGuha1@slideshare.net(SubhasishGuha1) ETL and pivoting in spark SubhasishGuha1 This presentation explains how real life ETL problems can be overcome using Spark, Data Frame and Spark-SQL <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/etlandpivotinginspark-160710090101-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> This presentation explains how real life ETL problems can be overcome using Spark, Data Frame and Spark-SQL
ETL and pivoting in spark from Subhasish Guha
]]>
262 5 https://cdn.slidesharecdn.com/ss_thumbnails/etlandpivotinginspark-160710090101-thumbnail.jpg?width=120&height=120&fit=bounds document 000000 http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
ETL and pivoting in spark /slideshow/etl-and-pivoting-in-spark/63881530 etlandpivotinginspark-160710085529
This blogs explain how Spark can be an effective tool of Next generation of Real Time Data Ware House.]]>

This blogs explain how Spark can be an effective tool of Next generation of Real Time Data Ware House.]]>
Sun, 10 Jul 2016 08:55:29 GMT /slideshow/etl-and-pivoting-in-spark/63881530 SubhasishGuha1@slideshare.net(SubhasishGuha1) ETL and pivoting in spark SubhasishGuha1 This blogs explain how Spark can be an effective tool of Next generation of Real Time Data Ware House. <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/etlandpivotinginspark-160710085529-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> This blogs explain how Spark can be an effective tool of Next generation of Real Time Data Ware House.
ETL and pivoting in spark from Subhasish Guha
]]>
165 5 https://cdn.slidesharecdn.com/ss_thumbnails/etlandpivotinginspark-160710085529-thumbnail.jpg?width=120&height=120&fit=bounds document Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
https://cdn.slidesharecdn.com/profile-photo-SubhasishGuha1-48x48.jpg?cb=1529955226 Started my Career as an ETL developer and having extensive experience in Informatica Powercenter and Teradata on Retail Sector. Worked in different Teradata utilities like FastLoad,MLoad,TPump,Bteq and Teradata parallel transport extensively . Having experience in directly working with the client and onsite counterpart in High level design ,Planing,estimation and execution of the project. Developed more than one tools in Java,Unix and took part in Hard Dollar savings challenge in Cognizant Innovation. Trained in Big Data and joined project in Teradata to Hive Migration and used several tool like Teradata Hadoop Connector,Hive,HBase,MapReduce,Oozie,Apache Spark, Informatica Big data Editio... https://cdn.slidesharecdn.com/ss_thumbnails/dataquality-180625193408-thumbnail.jpg?width=320&height=320&fit=bounds slideshow/data-quality-correctness-and-dynamic-transformations-using-spark-and-scala/103016088 Data Quality, Correctn... https://cdn.slidesharecdn.com/ss_thumbnails/33e10bc5-edef-4c54-b240-1c7adcd66dfa-160801193736-thumbnail.jpg?width=320&height=320&fit=bounds slideshow/parallel-execution-of-jobs-in-hadoop-64592991/64592991 Parallel Execution of ... https://cdn.slidesharecdn.com/ss_thumbnails/parallelexecutionofjobsinhadoop-160801192901-thumbnail.jpg?width=320&height=320&fit=bounds slideshow/parallel-execution-of-jobs-in-hadoop/64592721 Parallel execution of ...