ºÝºÝߣshows by User: gagravarr / http://www.slideshare.net/images/logo.gif ºÝºÝߣshows by User: gagravarr / Wed, 14 Feb 2018 20:53:04 GMT ºÝºÝߣShare feed for ºÝºÝߣshows by User: gagravarr Turning XML to XLS on the JVM, without loosing your Sanity, with Groovy /gagravarr/turning-xml-to-xls-on-the-jvm-without-loosing-your-sanity-with-groovy talk-180214205304
You've got an XML file. You need a XLS or XLSX spreadsheet. You need it urgently. How do you turn one into the other, without going made? Groovy on the Java JVM to the rescue!]]>

You've got an XML file. You need a XLS or XLSX spreadsheet. You need it urgently. How do you turn one into the other, without going made? Groovy on the Java JVM to the rescue!]]>
Wed, 14 Feb 2018 20:53:04 GMT /gagravarr/turning-xml-to-xls-on-the-jvm-without-loosing-your-sanity-with-groovy gagravarr@slideshare.net(gagravarr) Turning XML to XLS on the JVM, without loosing your Sanity, with Groovy gagravarr You've got an XML file. You need a XLS or XLSX spreadsheet. You need it urgently. How do you turn one into the other, without going made? Groovy on the Java JVM to the rescue! <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/talk-180214205304-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> You&#39;ve got an XML file. You need a XLS or XLSX spreadsheet. You need it urgently. How do you turn one into the other, without going made? Groovy on the Java JVM to the rescue!
Turning XML to XLS on the JVM, without loosing your Sanity, with Groovy from gagravarr
]]>
129 2 https://cdn.slidesharecdn.com/ss_thumbnails/talk-180214205304-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
But we're already open source! Why would I want to bring my code to Apache? /slideshow/but-were-already-open-source-why-would-i-want-to-bring-my-code-to-apache/53459732 butwerealreadyopen-151002112907-lva1-app6891
From ApacheCon Europe 2015 in Budapest So, your business has already opened sourced some of its code? Great! Or you're thinking about it? That's fine! But now, someone's asking you about giving it to these Apache people? What's up with that, and why isn't just being open source enough? In this talk, we'll look at several real world examples of where companies have chosen to contribute their existing open source code to the Apache Software Foundation. We'll see the advantages they got from it, the problems they faced along the way, why they did it, and how it helped their business. We'll also look briefly at where it may not be the right fit. Wondering about how to take your business's open source involvement to the next level, and if contributing to projects at the Apache Software Foundation will deliver RoI, then this is the talk for you! ]]>

From ApacheCon Europe 2015 in Budapest So, your business has already opened sourced some of its code? Great! Or you're thinking about it? That's fine! But now, someone's asking you about giving it to these Apache people? What's up with that, and why isn't just being open source enough? In this talk, we'll look at several real world examples of where companies have chosen to contribute their existing open source code to the Apache Software Foundation. We'll see the advantages they got from it, the problems they faced along the way, why they did it, and how it helped their business. We'll also look briefly at where it may not be the right fit. Wondering about how to take your business's open source involvement to the next level, and if contributing to projects at the Apache Software Foundation will deliver RoI, then this is the talk for you! ]]>
Fri, 02 Oct 2015 11:29:07 GMT /slideshow/but-were-already-open-source-why-would-i-want-to-bring-my-code-to-apache/53459732 gagravarr@slideshare.net(gagravarr) But we're already open source! Why would I want to bring my code to Apache? gagravarr From ApacheCon Europe 2015 in Budapest So, your business has already opened sourced some of its code? Great! Or you're thinking about it? That's fine! But now, someone's asking you about giving it to these Apache people? What's up with that, and why isn't just being open source enough? In this talk, we'll look at several real world examples of where companies have chosen to contribute their existing open source code to the Apache Software Foundation. We'll see the advantages they got from it, the problems they faced along the way, why they did it, and how it helped their business. We'll also look briefly at where it may not be the right fit. Wondering about how to take your business's open source involvement to the next level, and if contributing to projects at the Apache Software Foundation will deliver RoI, then this is the talk for you! <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/butwerealreadyopen-151002112907-lva1-app6891-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> From ApacheCon Europe 2015 in Budapest So, your business has already opened sourced some of its code? Great! Or you&#39;re thinking about it? That&#39;s fine! But now, someone&#39;s asking you about giving it to these Apache people? What&#39;s up with that, and why isn&#39;t just being open source enough? In this talk, we&#39;ll look at several real world examples of where companies have chosen to contribute their existing open source code to the Apache Software Foundation. We&#39;ll see the advantages they got from it, the problems they faced along the way, why they did it, and how it helped their business. We&#39;ll also look briefly at where it may not be the right fit. Wondering about how to take your business&#39;s open source involvement to the next level, and if contributing to projects at the Apache Software Foundation will deliver RoI, then this is the talk for you!
But we're already open source! Why would I want to bring my code to Apache? from gagravarr
]]>
907 7 https://cdn.slidesharecdn.com/ss_thumbnails/butwerealreadyopen-151002112907-lva1-app6891-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
What's new with Apache Tika? /slideshow/whats-new-with-apache-tika/53459656 whatsnewwithapachetika-151002112751-lva1-app6891
A presentation from ApacheCon Europe 2015 / Apache Big Data Europe 2015 Apache Tika detects and extracts metadata and text from a huge range of file formats and types. From Search to Big Data, single file to internet scale, if you've got files, Tika can help you get out useful information! Apache Tika has been around for nearly 10 years now, and in that time, a lot has changed. Not only has the number of formats supported gone up and up, but the ways of using Tika have expanded, and some of the philosophies on the best way to handle things have altered with experience. Tika has gained support for a wide range of programming languages to, and more recently, Big-Data scale support, and ways to automatically compare effects of changes to the library. Whether you're an old-hand with Tika looking to know what's hot or different, or someone new looking to learn more about the power of Tika, this talk will have something in it for you!]]>

A presentation from ApacheCon Europe 2015 / Apache Big Data Europe 2015 Apache Tika detects and extracts metadata and text from a huge range of file formats and types. From Search to Big Data, single file to internet scale, if you've got files, Tika can help you get out useful information! Apache Tika has been around for nearly 10 years now, and in that time, a lot has changed. Not only has the number of formats supported gone up and up, but the ways of using Tika have expanded, and some of the philosophies on the best way to handle things have altered with experience. Tika has gained support for a wide range of programming languages to, and more recently, Big-Data scale support, and ways to automatically compare effects of changes to the library. Whether you're an old-hand with Tika looking to know what's hot or different, or someone new looking to learn more about the power of Tika, this talk will have something in it for you!]]>
Fri, 02 Oct 2015 11:27:50 GMT /slideshow/whats-new-with-apache-tika/53459656 gagravarr@slideshare.net(gagravarr) What's new with Apache Tika? gagravarr A presentation from ApacheCon Europe 2015 / Apache Big Data Europe 2015 Apache Tika detects and extracts metadata and text from a huge range of file formats and types. From Search to Big Data, single file to internet scale, if you've got files, Tika can help you get out useful information! Apache Tika has been around for nearly 10 years now, and in that time, a lot has changed. Not only has the number of formats supported gone up and up, but the ways of using Tika have expanded, and some of the philosophies on the best way to handle things have altered with experience. Tika has gained support for a wide range of programming languages to, and more recently, Big-Data scale support, and ways to automatically compare effects of changes to the library. Whether you're an old-hand with Tika looking to know what's hot or different, or someone new looking to learn more about the power of Tika, this talk will have something in it for you! <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/whatsnewwithapachetika-151002112751-lva1-app6891-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> A presentation from ApacheCon Europe 2015 / Apache Big Data Europe 2015 Apache Tika detects and extracts metadata and text from a huge range of file formats and types. From Search to Big Data, single file to internet scale, if you&#39;ve got files, Tika can help you get out useful information! Apache Tika has been around for nearly 10 years now, and in that time, a lot has changed. Not only has the number of formats supported gone up and up, but the ways of using Tika have expanded, and some of the philosophies on the best way to handle things have altered with experience. Tika has gained support for a wide range of programming languages to, and more recently, Big-Data scale support, and ways to automatically compare effects of changes to the library. Whether you&#39;re an old-hand with Tika looking to know what&#39;s hot or different, or someone new looking to learn more about the power of Tika, this talk will have something in it for you!
What's new with Apache Tika? from gagravarr
]]>
2969 11 https://cdn.slidesharecdn.com/ss_thumbnails/whatsnewwithapachetika-151002112751-lva1-app6891-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
What's with the 1s and 0s? Making sense of binary data at scale - Berlin Buzzwords 2015 /slideshow/bbuzz-1s-and0s/48978233 bbuzz-1sand0s-150604080602-lva1-app6892
If you have one or two files, you can take the time to manually work out what they are, what they contain, and how to get the useful bits out (probably....). However, this approach really doesn't scale, mechanical turks or no! Luckily, there are open source projects and libraries out there which can help, and which can scale! In this talk, we'll first look at how we can work out what a given blob of 1s and 0s actually is, be it textual or binary. We'll then see how to extract common metadata from it, along with text, embedded resources, images, and maybe even the kitchen sink! We'll see how to use things like Apache Tika to do this, along with some other libraries to complement it. Once that part's all sorted, we'll look at how to roll this all out for a large-scale Search or Big Data setup, helping you turn those 1s and 0s into useful content at scale! This talk was given at Berlin Buzzwords 2015]]>

If you have one or two files, you can take the time to manually work out what they are, what they contain, and how to get the useful bits out (probably....). However, this approach really doesn't scale, mechanical turks or no! Luckily, there are open source projects and libraries out there which can help, and which can scale! In this talk, we'll first look at how we can work out what a given blob of 1s and 0s actually is, be it textual or binary. We'll then see how to extract common metadata from it, along with text, embedded resources, images, and maybe even the kitchen sink! We'll see how to use things like Apache Tika to do this, along with some other libraries to complement it. Once that part's all sorted, we'll look at how to roll this all out for a large-scale Search or Big Data setup, helping you turn those 1s and 0s into useful content at scale! This talk was given at Berlin Buzzwords 2015]]>
Thu, 04 Jun 2015 08:06:02 GMT /slideshow/bbuzz-1s-and0s/48978233 gagravarr@slideshare.net(gagravarr) What's with the 1s and 0s? Making sense of binary data at scale - Berlin Buzzwords 2015 gagravarr If you have one or two files, you can take the time to manually work out what they are, what they contain, and how to get the useful bits out (probably....). However, this approach really doesn't scale, mechanical turks or no! Luckily, there are open source projects and libraries out there which can help, and which can scale! In this talk, we'll first look at how we can work out what a given blob of 1s and 0s actually is, be it textual or binary. We'll then see how to extract common metadata from it, along with text, embedded resources, images, and maybe even the kitchen sink! We'll see how to use things like Apache Tika to do this, along with some other libraries to complement it. Once that part's all sorted, we'll look at how to roll this all out for a large-scale Search or Big Data setup, helping you turn those 1s and 0s into useful content at scale! This talk was given at Berlin Buzzwords 2015 <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/bbuzz-1sand0s-150604080602-lva1-app6892-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> If you have one or two files, you can take the time to manually work out what they are, what they contain, and how to get the useful bits out (probably....). However, this approach really doesn&#39;t scale, mechanical turks or no! Luckily, there are open source projects and libraries out there which can help, and which can scale! In this talk, we&#39;ll first look at how we can work out what a given blob of 1s and 0s actually is, be it textual or binary. We&#39;ll then see how to extract common metadata from it, along with text, embedded resources, images, and maybe even the kitchen sink! We&#39;ll see how to use things like Apache Tika to do this, along with some other libraries to complement it. Once that part&#39;s all sorted, we&#39;ll look at how to roll this all out for a large-scale Search or Big Data setup, helping you turn those 1s and 0s into useful content at scale! This talk was given at Berlin Buzzwords 2015
What's with the 1s and 0s? Making sense of binary data at scale - Berlin Buzzwords 2015 from gagravarr
]]>
677 2 https://cdn.slidesharecdn.com/ss_thumbnails/bbuzz-1sand0s-150604080602-lva1-app6892-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
The Apache Way /slideshow/the-apache-way-41759378/41759378 theapacheway14-141119094702-conversion-gate02
The ""Apache Way"" is the process by which Apache Software Foundation projects are managed. It has evolved over many years and has produced over 100 highly successful open source projects. But what is it and how does it work?]]>

The ""Apache Way"" is the process by which Apache Software Foundation projects are managed. It has evolved over many years and has produced over 100 highly successful open source projects. But what is it and how does it work?]]>
Wed, 19 Nov 2014 09:47:01 GMT /slideshow/the-apache-way-41759378/41759378 gagravarr@slideshare.net(gagravarr) The Apache Way gagravarr The ""Apache Way"" is the process by which Apache Software Foundation projects are managed. It has evolved over many years and has produced over 100 highly successful open source projects. But what is it and how does it work? <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/theapacheway14-141119094702-conversion-gate02-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> The &quot;&quot;Apache Way&quot;&quot; is the process by which Apache Software Foundation projects are managed. It has evolved over many years and has produced over 100 highly successful open source projects. But what is it and how does it work?
The Apache Way from gagravarr
]]>
439 1 https://cdn.slidesharecdn.com/ss_thumbnails/theapacheway14-141119094702-conversion-gate02-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
The other Apache Technologies your Big Data solution needs /slideshow/the-other-apache-technologies-your-big-data-solution-needs-41759287/41759287 apacheotherbigdata-141119094516-conversion-gate01
An overview of the various Apache Technologies to help you build your own Big Data solution]]>

An overview of the various Apache Technologies to help you build your own Big Data solution]]>
Wed, 19 Nov 2014 09:45:16 GMT /slideshow/the-other-apache-technologies-your-big-data-solution-needs-41759287/41759287 gagravarr@slideshare.net(gagravarr) The other Apache Technologies your Big Data solution needs gagravarr An overview of the various Apache Technologies to help you build your own Big Data solution <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/apacheotherbigdata-141119094516-conversion-gate01-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> An overview of the various Apache Technologies to help you build your own Big Data solution
The other Apache Technologies your Big Data solution needs from gagravarr
]]>
988 3 https://cdn.slidesharecdn.com/ss_thumbnails/apacheotherbigdata-141119094516-conversion-gate01-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
How Big is Big – Tall, Grande, Venti Data? /slideshow/tall-grandeventidata/41759141 tallgrandeventidata-141119094227-conversion-gate01
Apache has a wide range of Big Data projects, some suitable for smaller problem sets, some which scale to huge problems. Today though, that one label "Big Data" can cause confusion for new users, as they may struggle to pick the right project for the right scale for their problem. Do we need new titles for different kinds of Big Data? Does the buzz and VC funding cause confusion? Is the humble requirement dead? Or can we help new users better find the right Apache project for them?]]>

Apache has a wide range of Big Data projects, some suitable for smaller problem sets, some which scale to huge problems. Today though, that one label "Big Data" can cause confusion for new users, as they may struggle to pick the right project for the right scale for their problem. Do we need new titles for different kinds of Big Data? Does the buzz and VC funding cause confusion? Is the humble requirement dead? Or can we help new users better find the right Apache project for them?]]>
Wed, 19 Nov 2014 09:42:27 GMT /slideshow/tall-grandeventidata/41759141 gagravarr@slideshare.net(gagravarr) How Big is Big – Tall, Grande, Venti Data? gagravarr Apache has a wide range of Big Data projects, some suitable for smaller problem sets, some which scale to huge problems. Today though, that one label "Big Data" can cause confusion for new users, as they may struggle to pick the right project for the right scale for their problem. Do we need new titles for different kinds of Big Data? Does the buzz and VC funding cause confusion? Is the humble requirement dead? Or can we help new users better find the right Apache project for them? <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/tallgrandeventidata-141119094227-conversion-gate01-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> Apache has a wide range of Big Data projects, some suitable for smaller problem sets, some which scale to huge problems. Today though, that one label &quot;Big Data&quot; can cause confusion for new users, as they may struggle to pick the right project for the right scale for their problem. Do we need new titles for different kinds of Big Data? Does the buzz and VC funding cause confusion? Is the humble requirement dead? Or can we help new users better find the right Apache project for them?
How Big is Big – Tall, Grande, Venti Data? from gagravarr
]]>
969 1 https://cdn.slidesharecdn.com/ss_thumbnails/tallgrandeventidata-141119094227-conversion-gate01-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
If You Have The Content, Then Apache Has The Technology! /slideshow/if-you-have-the-content-then-apache-has-the-technology/41759070 contenttechnologies-141119094058-conversion-gate02
Within the ASF, there are a wide variety of projects with technologies to help you store, retrieve, host, transform and generate content. This talk will review the landscape of Apache content technologies, provide a quick introduction to the more common and more interesting projects, and flag up new and innovative features within them. It'll also highlight talks from the rest of the week on many of the projects covered, so that you'll know where and when to go to learn more about those projects and technologies which catch your eye!]]>

Within the ASF, there are a wide variety of projects with technologies to help you store, retrieve, host, transform and generate content. This talk will review the landscape of Apache content technologies, provide a quick introduction to the more common and more interesting projects, and flag up new and innovative features within them. It'll also highlight talks from the rest of the week on many of the projects covered, so that you'll know where and when to go to learn more about those projects and technologies which catch your eye!]]>
Wed, 19 Nov 2014 09:40:58 GMT /slideshow/if-you-have-the-content-then-apache-has-the-technology/41759070 gagravarr@slideshare.net(gagravarr) If You Have The Content, Then Apache Has The Technology! gagravarr Within the ASF, there are a wide variety of projects with technologies to help you store, retrieve, host, transform and generate content. This talk will review the landscape of Apache content technologies, provide a quick introduction to the more common and more interesting projects, and flag up new and innovative features within them. It'll also highlight talks from the rest of the week on many of the projects covered, so that you'll know where and when to go to learn more about those projects and technologies which catch your eye! <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/contenttechnologies-141119094058-conversion-gate02-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> Within the ASF, there are a wide variety of projects with technologies to help you store, retrieve, host, transform and generate content. This talk will review the landscape of Apache content technologies, provide a quick introduction to the more common and more interesting projects, and flag up new and innovative features within them. It&#39;ll also highlight talks from the rest of the week on many of the projects covered, so that you&#39;ll know where and when to go to learn more about those projects and technologies which catch your eye!
If You Have The Content, Then Apache Has The Technology! from gagravarr
]]>
1014 3 https://cdn.slidesharecdn.com/ss_thumbnails/contenttechnologies-141119094058-conversion-gate02-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
But We're Already Open Source! Why Would I Want To Bring My Code To Apache? /slideshow/but-werealreadyopen/41758953 butwerealreadyopen-141119093835-conversion-gate02
So, your business has already opened sourced some of it's code? Great! But now, someone's asking you about giving it to these Apache people? What's up with that, and why isn't just being open source enough? In this talk, we'll look at several real world examples of where companies have chosen to contribute their existing open source code to the Apache Software Foundation. We'll see the advantages they got from it, the problems they faced along the way, why they did it, and how it helped their business. We'll also look briefly at where it may not be the right fit. Wondering about how to take your business's open source involvement to the next level, and if contributing to projects at the Apache Software Foundation will deliver RoI, then this is the talk for you!]]>

So, your business has already opened sourced some of it's code? Great! But now, someone's asking you about giving it to these Apache people? What's up with that, and why isn't just being open source enough? In this talk, we'll look at several real world examples of where companies have chosen to contribute their existing open source code to the Apache Software Foundation. We'll see the advantages they got from it, the problems they faced along the way, why they did it, and how it helped their business. We'll also look briefly at where it may not be the right fit. Wondering about how to take your business's open source involvement to the next level, and if contributing to projects at the Apache Software Foundation will deliver RoI, then this is the talk for you!]]>
Wed, 19 Nov 2014 09:38:35 GMT /slideshow/but-werealreadyopen/41758953 gagravarr@slideshare.net(gagravarr) But We're Already Open Source! Why Would I Want To Bring My Code To Apache? gagravarr So, your business has already opened sourced some of it's code? Great! But now, someone's asking you about giving it to these Apache people? What's up with that, and why isn't just being open source enough? In this talk, we'll look at several real world examples of where companies have chosen to contribute their existing open source code to the Apache Software Foundation. We'll see the advantages they got from it, the problems they faced along the way, why they did it, and how it helped their business. We'll also look briefly at where it may not be the right fit. Wondering about how to take your business's open source involvement to the next level, and if contributing to projects at the Apache Software Foundation will deliver RoI, then this is the talk for you! <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/butwerealreadyopen-141119093835-conversion-gate02-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> So, your business has already opened sourced some of it&#39;s code? Great! But now, someone&#39;s asking you about giving it to these Apache people? What&#39;s up with that, and why isn&#39;t just being open source enough? In this talk, we&#39;ll look at several real world examples of where companies have chosen to contribute their existing open source code to the Apache Software Foundation. We&#39;ll see the advantages they got from it, the problems they faced along the way, why they did it, and how it helped their business. We&#39;ll also look briefly at where it may not be the right fit. Wondering about how to take your business&#39;s open source involvement to the next level, and if contributing to projects at the Apache Software Foundation will deliver RoI, then this is the talk for you!
But We're Already Open Source! Why Would I Want To Bring My Code To Apache? from gagravarr
]]>
297 1 https://cdn.slidesharecdn.com/ss_thumbnails/butwerealreadyopen-141119093835-conversion-gate02-thumbnail.jpg?width=120&height=120&fit=bounds presentation White http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
What's With The 1S And 0S? Making Sense Of Binary Data At Scale With Tika And Friends /slideshow/1s-and-0s-41758777/41758777 1sand0s-141119093537-conversion-gate02
]]>

]]>
Wed, 19 Nov 2014 09:35:37 GMT /slideshow/1s-and-0s-41758777/41758777 gagravarr@slideshare.net(gagravarr) What's With The 1S And 0S? Making Sense Of Binary Data At Scale With Tika And Friends gagravarr <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/1sand0s-141119093537-conversion-gate02-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br>
What's With The 1S And 0S? Making Sense Of Binary Data At Scale With Tika And Friends from gagravarr
]]>
301 1 https://cdn.slidesharecdn.com/ss_thumbnails/1sand0s-141119093537-conversion-gate02-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
What's with the 1s and 0s? Making sense of binary data at scale with Tika and friends /gagravarr/1s-and-0s 1sand0s-140409163925-phpapp02
If you have one or two files, you can take the time to manually work out what they are, what they contain, and how to get the useful bits out (probably....). However, this approach really doesn't scale, mechanical turks or no! Luckily, there are Apache projects out there which can help! In this talk, we'll first look at how we can work out what a given blob of 1s and 0s actually is, be it textual or binary. We'll then see how to extract common metadata from it, along with text, embedded resources, images, and maybe even the kitchen sink! We'll see how to do all of this with Apache Tika, and how to dive down to the underlying libraries (including its Apache friends like POI and PDFBox) for specialist cases. Finally, we'll look a little bit about how to roll this all out on a Big Data or Large-Search case.]]>

If you have one or two files, you can take the time to manually work out what they are, what they contain, and how to get the useful bits out (probably....). However, this approach really doesn't scale, mechanical turks or no! Luckily, there are Apache projects out there which can help! In this talk, we'll first look at how we can work out what a given blob of 1s and 0s actually is, be it textual or binary. We'll then see how to extract common metadata from it, along with text, embedded resources, images, and maybe even the kitchen sink! We'll see how to do all of this with Apache Tika, and how to dive down to the underlying libraries (including its Apache friends like POI and PDFBox) for specialist cases. Finally, we'll look a little bit about how to roll this all out on a Big Data or Large-Search case.]]>
Wed, 09 Apr 2014 16:39:25 GMT /gagravarr/1s-and-0s gagravarr@slideshare.net(gagravarr) What's with the 1s and 0s? Making sense of binary data at scale with Tika and friends gagravarr If you have one or two files, you can take the time to manually work out what they are, what they contain, and how to get the useful bits out (probably....). However, this approach really doesn't scale, mechanical turks or no! Luckily, there are Apache projects out there which can help! In this talk, we'll first look at how we can work out what a given blob of 1s and 0s actually is, be it textual or binary. We'll then see how to extract common metadata from it, along with text, embedded resources, images, and maybe even the kitchen sink! We'll see how to do all of this with Apache Tika, and how to dive down to the underlying libraries (including its Apache friends like POI and PDFBox) for specialist cases. Finally, we'll look a little bit about how to roll this all out on a Big Data or Large-Search case. <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/1sand0s-140409163925-phpapp02-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> If you have one or two files, you can take the time to manually work out what they are, what they contain, and how to get the useful bits out (probably....). However, this approach really doesn&#39;t scale, mechanical turks or no! Luckily, there are Apache projects out there which can help! In this talk, we&#39;ll first look at how we can work out what a given blob of 1s and 0s actually is, be it textual or binary. We&#39;ll then see how to extract common metadata from it, along with text, embedded resources, images, and maybe even the kitchen sink! We&#39;ll see how to do all of this with Apache Tika, and how to dive down to the underlying libraries (including its Apache friends like POI and PDFBox) for specialist cases. Finally, we&#39;ll look a little bit about how to roll this all out on a Big Data or Large-Search case.
What's with the 1s and 0s? Making sense of binary data at scale with Tika and friends from gagravarr
]]>
1681 4 https://cdn.slidesharecdn.com/ss_thumbnails/1sand0s-140409163925-phpapp02-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
The other Apache technologies your big data solution needs! /slideshow/the-other-apache-technologies-your-big-data-solution-needs/22431737 apacheotherbigdata-130604075510-phpapp01
An overview of the various Apache Technologies to help you build your own Big Data solution. A talk given at Berlin Buzzwords, in June 2013.]]>

An overview of the various Apache Technologies to help you build your own Big Data solution. A talk given at Berlin Buzzwords, in June 2013.]]>
Tue, 04 Jun 2013 07:55:10 GMT /slideshow/the-other-apache-technologies-your-big-data-solution-needs/22431737 gagravarr@slideshare.net(gagravarr) The other Apache technologies your big data solution needs! gagravarr An overview of the various Apache Technologies to help you build your own Big Data solution. A talk given at Berlin Buzzwords, in June 2013. <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/apacheotherbigdata-130604075510-phpapp01-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> An overview of the various Apache Technologies to help you build your own Big Data solution. A talk given at Berlin Buzzwords, in June 2013.
The other Apache technologies your big data solution needs! from gagravarr
]]>
5138 5 https://cdn.slidesharecdn.com/ss_thumbnails/apacheotherbigdata-130604075510-phpapp01-thumbnail.jpg?width=120&height=120&fit=bounds presentation White http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
Apache Tika end-to-end /slideshow/apache-tika-endtoend/5714034 apachetikaend-to-end-101109063118-phpapp01
From the Fast Feather Track at ApacheCon NA 2010 in Atlanta This quick talk provides an overview of Apache Tika, looks at a new features and supported file formats. It then shows how to create a new parser, and finishes with using Tika from your own application.]]>

From the Fast Feather Track at ApacheCon NA 2010 in Atlanta This quick talk provides an overview of Apache Tika, looks at a new features and supported file formats. It then shows how to create a new parser, and finishes with using Tika from your own application.]]>
Tue, 09 Nov 2010 06:31:13 GMT /slideshow/apache-tika-endtoend/5714034 gagravarr@slideshare.net(gagravarr) Apache Tika end-to-end gagravarr From the Fast Feather Track at ApacheCon NA 2010 in Atlanta This quick talk provides an overview of Apache Tika, looks at a new features and supported file formats. It then shows how to create a new parser, and finishes with using Tika from your own application. <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/apachetikaend-to-end-101109063118-phpapp01-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> From the Fast Feather Track at ApacheCon NA 2010 in Atlanta This quick talk provides an overview of Apache Tika, looks at a new features and supported file formats. It then shows how to create a new parser, and finishes with using Tika from your own application.
Apache Tika end-to-end from gagravarr
]]>
4182 5 https://cdn.slidesharecdn.com/ss_thumbnails/apachetikaend-to-end-101109063118-phpapp01-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
Apache Content Technologies /gagravarr/apache-content-technologies contenttechnologies-101108100815-phpapp01
An overview of all the different content related technologies at the Apache Software Foundation Talk from ApacheCon NA 2010 in Atlanta in November 2010]]>

An overview of all the different content related technologies at the Apache Software Foundation Talk from ApacheCon NA 2010 in Atlanta in November 2010]]>
Mon, 08 Nov 2010 10:08:07 GMT /gagravarr/apache-content-technologies gagravarr@slideshare.net(gagravarr) Apache Content Technologies gagravarr An overview of all the different content related technologies at the Apache Software Foundation Talk from ApacheCon NA 2010 in Atlanta in November 2010 <img style="border:1px solid #C3E6D8;float:right;" alt="" src="https://cdn.slidesharecdn.com/ss_thumbnails/contenttechnologies-101108100815-phpapp01-thumbnail.jpg?width=120&amp;height=120&amp;fit=bounds" /><br> An overview of all the different content related technologies at the Apache Software Foundation Talk from ApacheCon NA 2010 in Atlanta in November 2010
Apache Content Technologies from gagravarr
]]>
398 2 https://cdn.slidesharecdn.com/ss_thumbnails/contenttechnologies-101108100815-phpapp01-thumbnail.jpg?width=120&height=120&fit=bounds presentation Black http://activitystrea.ms/schema/1.0/post http://activitystrea.ms/schema/1.0/posted 0
https://public.slidesharecdn.com/v2/images/profile-picture.png https://cdn.slidesharecdn.com/ss_thumbnails/talk-180214205304-thumbnail.jpg?width=320&height=320&fit=bounds gagravarr/turning-xml-to-xls-on-the-jvm-without-loosing-your-sanity-with-groovy Turning XML to XLS on ... https://cdn.slidesharecdn.com/ss_thumbnails/butwerealreadyopen-151002112907-lva1-app6891-thumbnail.jpg?width=320&height=320&fit=bounds slideshow/but-were-already-open-source-why-would-i-want-to-bring-my-code-to-apache/53459732 But we&#39;re already open... https://cdn.slidesharecdn.com/ss_thumbnails/whatsnewwithapachetika-151002112751-lva1-app6891-thumbnail.jpg?width=320&height=320&fit=bounds slideshow/whats-new-with-apache-tika/53459656 What&#39;s new with Apache...