Personal Information
Organization / Workplace
Ithaca, New York Area United States
Occupation
Applying Schemas for Natural Language Processing, Distributed Systems, Classification and Text Mining and Data Lakes
Industry
Technology / Software / Internet
Website
http://gen5.info/q/
About
For six years, my company has been focused on building commercial consumer-facing systems based on Linked data sources such as Freebase, DBpedia and Wikidata. I created :BaseKB, the first correct conversion of Freebase to RDF, which ensures that Freebase data will live on after Google shutters the service.
From this experience, we've methods for matching (i) syntax and schemas and (ii) instance data (specific things such as people, places, and legal entities) that use expressive business rules running inside a scalable fabric such as Spark or Hadoop to rapidly understand and clean up data from "data lakes" and other large collections. This technology also applies to communication...
Contact Details
Tags
rdf
semantic web
linked data
freebase
programming
c#
rich internet application
silverlight
php
asynchronous
javascript
software engineering
physics
java
deterministic chaos
classical chaos
solid state physics
ir
big data
search
dbpedia
flex
commonspot
oracle database
.net
software development
information retrieval
mapreduce
mysql
dba
ajax
apache hadoop
tools
sql
enterprise search
amazon web services
data lakes
patents
jena
quantum mechanics
quantum chaos
spin systems
reference data
sales management
lei
open access
master data management
artificial intelligence
metadata
postgresql
work
digital library
web applications
neural networks
user management
software management
eclipse
business strategy
non-functional requirements
business process
speed
creative commons
image
resource description framework
data science
machine learning
time series analysis
academic publishing
support vector machine
anharmonic localization
fractals
acoustic emission
nonlinearity
power law
black swan
usability
www
ltef
accessability
instruction
opac
tutoria
catalog
proxy web applications
http
cfmx
web server
apache httpd
documentation
arxiv
text classifcation. time series
owl
finance
fibo
aws
software
rdfeasy
aws marketplace
superman
taxonomy
ontology
comic books
lucene
relevance
enterprise software
ce
statistics
wikipedia
hashtables
product management
visual
user experience
ibm
value
blockchain
lei reference data bitemporal analysis big data
big data smart data cisco fog iot
reasoning
business rules
inference
neo4j
graph databases
schemas
hadoop ibm watson big data matching semantics
corporate entities
data quality
legal entity identifier
ria
gwt
error handling
exceptions
namespaces
methods
object orientation
nested sets
sets
trees
star schema
java server pages
jsp
xml
gis
sales force alignment
deep learning
neural network
maps
dictionaries
extension methods
sql server
stored procedures
constraints
microsoft sql server
microsoft
casting
microsoft sql
stored procedure
basekb
resume
hadoop
paul houle
callbacks
flash
dynamic
glopad
See more
- Presentations
- Documents
- Infographics