際際滷

際際滷Share a Scribd company logo
INFSCI 2711
Advanced Topics in
Database Management
Instructor:
Evgeny Karataev
The instructor
 Evgeny Karataev
 Where/how to 鍖nd:
 In class: Tuesdays, 12:00 noon - 2:50 pm, IS 403
 Of鍖ce Hours: online and by appointment
 email: epk8@pitt.edu
What this class is about
 Prerequisites:
 You know what a relational database and a database
management system are (INFSCI 2710)
 Do you think that [one] [R]DBMS on [single] machine is
enough to handle todays volume, velocity and variety
of the data?
What this class is about
 Topics that will be covered in this class:
 Data Integration (OLAP and Data Warehousing,
Virtual Data Integration)
 Distributed and Parallel Databases (including
distributed transactions and query execution)
 NoSQL databases, NewSQL databases (Main Memory
Databases)
 Cluster Computing (Hadoop and other animals, Spark)
The textbooks
 Too many to list here
 there is no single book that covers all topics
 so I will post selected chapters of online available
books, blog posts and research papers before or after
each lecture
Class components
 Lectures/Demos/Labs
 Homework Assignments
 Students DB tools overview presentations
 Term Research & Development Project
 Midterm exam
 Final exam
Lectures/Demos/Labs
 Lectures slides will be available online usually a day
before the class.
 Sometime you might be asked to bring your laptop to
the class for lab work.
 Sometimes I will do demos of the DB systems related to
the class material.
Homework Assignments
 So far I planned 4 assignments fairly well spread over the
semester. However this might change to 5, 6 or 3.
 Assignment are usually very practical and are based on
the material learned in the class. You might be need to do
some programing.
 All assignments need to be submitted ONLINE
(assignments will have submission instructions)
 All assignments are group based (2 or 3 people per group)
Students DB tools overview presentations
 Each student (or maybe in groups of 2 or 3) will have to make 10
minutes presentation about a database system sometime during
the term (I will provide the list of the database systems). The
presentation must include:
 Architecture/Main idea/Main approach.
 Advantages and Disadvantages.
 How it differs from other systems.
 When it is applicable and when not.
 Where to learn more about it.
Term Research Project
 An original R&D project in groups of 5-6 people
 Most probably a lot learning and programming
 In class project progress reports/demos every other week (up to 15
minutes max)
 One 鍖nal written report
 One 鍖nal demo
 Project ideas will be provided by me, but you are welcome to
propose yours
 Projects development will be managed via github
Exams
 Both Midterm and Final exams are open notes, but no
computers and/or phones.
 Final exam is cumulative.
 No sample exam questions will be posted or
distributed.
Late Policy
 Homework and Project reports are due at the beginning
of class on the due date. Homework and project reports
can be turned in the following class for a 25% penalty.
Nothing will be accepted after that time.
Grading
 This course is being offered for three credits. The
grading is as follows:
 Homework Assignments: 20 %
 DB tool presentation: 10 %
 Midterm exam: 20 %
 Project: 25 %
 Final exam: 25 %
Class Q&A (and more) Management System
 This term we will be using Piazza for class discussion.
The system is highly catered to getting you help fast and
ef鍖ciently from classmates, and myself. Rather than
emailing questions to me, I encourage you to post your
questions on Piazza. If you have any problems or
feedback for the developers, email team@piazza.com.
 Find our class page at: https://piazza.com/pitt/
spring2015/infsci2711/home
Piazza Demo
Extra Credits to your grades
 Top 5 most active users on Piazza will get 5 extra points
 Active users are those who:
 ask many and GOOD questions
 answer questions posted by others (preferably
before I answer)

More Related Content

What's hot (20)

Blackboard
BlackboardBlackboard
Blackboard
Derek D'Angelo
OCLC WorldShare. The Right Decision?
OCLC WorldShare. The Right Decision?OCLC WorldShare. The Right Decision?
OCLC WorldShare. The Right Decision?
Georgia Libraries Conference (formerly Ga COMO).
Ga como lmu worldshare pres1
Ga como lmu worldshare pres1Ga como lmu worldshare pres1
Ga como lmu worldshare pres1
Laura Slavin
Open Web Mapping: How do we teach this stuff?
Open Web Mapping: How do we teach this stuff?Open Web Mapping: How do we teach this stuff?
Open Web Mapping: How do we teach this stuff?
Carl Sack
2 Devices and 3 Sites That Have Changed Our Classroom
2 Devices and 3 Sites That Have Changed Our Classroom2 Devices and 3 Sites That Have Changed Our Classroom
2 Devices and 3 Sites That Have Changed Our Classroom
Europortfolio / ePIC
Test Fest and the Tale of Too Many Post-its
Test Fest and the Tale of Too Many Post-itsTest Fest and the Tale of Too Many Post-its
Test Fest and the Tale of Too Many Post-its
Sarah Joy Arnold
Test Fest and the Tale of Too Many Post-its
Test Fest and the Tale of Too Many Post-itsTest Fest and the Tale of Too Many Post-its
Test Fest and the Tale of Too Many Post-its
Sarah Joy Arnold
2017 blg252 e_giris
2017 blg252 e_giris2017 blg252 e_giris
2017 blg252 e_giris
Serkan Turkeli
Learning Flow Management and Semantic Data Exchange between Blog-based Person...
Learning Flow Management and Semantic Data Exchange between Blog-based Person...Learning Flow Management and Semantic Data Exchange between Blog-based Person...
Learning Flow Management and Semantic Data Exchange between Blog-based Person...
Vladimir Tomberg
ELMS:LN Beyond accessibility
ELMS:LN Beyond accessibilityELMS:LN Beyond accessibility
ELMS:LN Beyond accessibility
Bryan Ollendyke
Amatyc 2017 Math Lit Online
Amatyc 2017 Math Lit OnlineAmatyc 2017 Math Lit Online
Amatyc 2017 Math Lit Online
kathleenalmy
ASAC Presentation for 2015 WSHETC (Final)
ASAC Presentation for 2015 WSHETC (Final)ASAC Presentation for 2015 WSHETC (Final)
ASAC Presentation for 2015 WSHETC (Final)
Dave Dean
5.12.evans,marfani
5.12.evans,marfani5.12.evans,marfani
5.12.evans,marfani
afacct
Digital T
Digital TDigital T
Digital T
KTI_PD
Fabulous, Fun, Freebies!!2
Fabulous, Fun, Freebies!!2Fabulous, Fun, Freebies!!2
Fabulous, Fun, Freebies!!2
johnnakp
20110829 upgrade blackboard 2011 what's new
20110829 upgrade blackboard 2011 what's new20110829 upgrade blackboard 2011 what's new
20110829 upgrade blackboard 2011 what's new
Ellen Zillig-Straatman
Introduction To Web Development Course
Introduction To Web Development CourseIntroduction To Web Development Course
Introduction To Web Development Course
Digital Insights - Digital Marketing Agency
Dan Sich CIL 2008
Dan Sich CIL 2008Dan Sich CIL 2008
Dan Sich CIL 2008
dansich
Webquest
WebquestWebquest
Webquest
teerasak ch.
Itec 2014 - Session 1 (Blended Learning
Itec 2014 - Session 1 (Blended LearningItec 2014 - Session 1 (Blended Learning
Itec 2014 - Session 1 (Blended Learning
Evan Abbey
Ga como lmu worldshare pres1
Ga como lmu worldshare pres1Ga como lmu worldshare pres1
Ga como lmu worldshare pres1
Laura Slavin
Open Web Mapping: How do we teach this stuff?
Open Web Mapping: How do we teach this stuff?Open Web Mapping: How do we teach this stuff?
Open Web Mapping: How do we teach this stuff?
Carl Sack
2 Devices and 3 Sites That Have Changed Our Classroom
2 Devices and 3 Sites That Have Changed Our Classroom2 Devices and 3 Sites That Have Changed Our Classroom
2 Devices and 3 Sites That Have Changed Our Classroom
Europortfolio / ePIC
Test Fest and the Tale of Too Many Post-its
Test Fest and the Tale of Too Many Post-itsTest Fest and the Tale of Too Many Post-its
Test Fest and the Tale of Too Many Post-its
Sarah Joy Arnold
Test Fest and the Tale of Too Many Post-its
Test Fest and the Tale of Too Many Post-itsTest Fest and the Tale of Too Many Post-its
Test Fest and the Tale of Too Many Post-its
Sarah Joy Arnold
2017 blg252 e_giris
2017 blg252 e_giris2017 blg252 e_giris
2017 blg252 e_giris
Serkan Turkeli
Learning Flow Management and Semantic Data Exchange between Blog-based Person...
Learning Flow Management and Semantic Data Exchange between Blog-based Person...Learning Flow Management and Semantic Data Exchange between Blog-based Person...
Learning Flow Management and Semantic Data Exchange between Blog-based Person...
Vladimir Tomberg
ELMS:LN Beyond accessibility
ELMS:LN Beyond accessibilityELMS:LN Beyond accessibility
ELMS:LN Beyond accessibility
Bryan Ollendyke
Amatyc 2017 Math Lit Online
Amatyc 2017 Math Lit OnlineAmatyc 2017 Math Lit Online
Amatyc 2017 Math Lit Online
kathleenalmy
ASAC Presentation for 2015 WSHETC (Final)
ASAC Presentation for 2015 WSHETC (Final)ASAC Presentation for 2015 WSHETC (Final)
ASAC Presentation for 2015 WSHETC (Final)
Dave Dean
5.12.evans,marfani
5.12.evans,marfani5.12.evans,marfani
5.12.evans,marfani
afacct
Digital T
Digital TDigital T
Digital T
KTI_PD
Fabulous, Fun, Freebies!!2
Fabulous, Fun, Freebies!!2Fabulous, Fun, Freebies!!2
Fabulous, Fun, Freebies!!2
johnnakp
20110829 upgrade blackboard 2011 what's new
20110829 upgrade blackboard 2011 what's new20110829 upgrade blackboard 2011 what's new
20110829 upgrade blackboard 2011 what's new
Ellen Zillig-Straatman
Dan Sich CIL 2008
Dan Sich CIL 2008Dan Sich CIL 2008
Dan Sich CIL 2008
dansich
Itec 2014 - Session 1 (Blended Learning
Itec 2014 - Session 1 (Blended LearningItec 2014 - Session 1 (Blended Learning
Itec 2014 - Session 1 (Blended Learning
Evan Abbey

Similar to Intro (20)

Data carpentry instructor-onboarding
Data carpentry instructor-onboardingData carpentry instructor-onboarding
Data carpentry instructor-onboarding
tracykteal
Using Cloud-based statistics applications to enhance statistics education
Using Cloud-based statistics applications to enhance statistics educationUsing Cloud-based statistics applications to enhance statistics education
Using Cloud-based statistics applications to enhance statistics education
smackinnon
Developing Educational Technology Resources for Faculty
Developing Educational Technology Resources for FacultyDeveloping Educational Technology Resources for Faculty
Developing Educational Technology Resources for Faculty
Kaitlin Walsh
WF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the course
WF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the courseWF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the course
WF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the course
Penn State University
Introduction.pptx
Introduction.pptxIntroduction.pptx
Introduction.pptx
Samar954063
DLCV_W1-1.pdf
DLCV_W1-1.pdfDLCV_W1-1.pdf
DLCV_W1-1.pdf
ssusere50634
EAD-523 Presentation.pptx Get feedback..
EAD-523 Presentation.pptx Get feedback..EAD-523 Presentation.pptx Get feedback..
EAD-523 Presentation.pptx Get feedback..
caldwell1991
CIS_170_05_F15F2F_Paurus
CIS_170_05_F15F2F_PaurusCIS_170_05_F15F2F_Paurus
CIS_170_05_F15F2F_Paurus
Jordan Bushaw
Data Science: Introduction
Data Science: IntroductionData Science: Introduction
Data Science: Introduction
Jinho Choi
1. course introduction
1. course introduction1. course introduction
1. course introduction
Saeed Parsa
Post-it Up: Qualitative Data Analysis of a Test Fest
Post-it Up: Qualitative Data Analysis of a Test FestPost-it Up: Qualitative Data Analysis of a Test Fest
Post-it Up: Qualitative Data Analysis of a Test Fest
Sarah Joy Arnold
BbWorld 2010 notes
BbWorld 2010 notesBbWorld 2010 notes
BbWorld 2010 notes
Thomas Bishop
Syllabus
SyllabusSyllabus
Syllabus
Evgeny Karataev
2007 LITA National Forum 2007. Denver, Colorado
2007 LITA National Forum  2007. Denver, Colorado2007 LITA National Forum  2007. Denver, Colorado
2007 LITA National Forum 2007. Denver, Colorado
Western Illinois University
lecture01_Introduction.pdf
lecture01_Introduction.pdflecture01_Introduction.pdf
lecture01_Introduction.pdf
MarlonMagtibay2
-There is no limits for the number of sources but for a project li.docx
-There is no limits for the number of sources but for a project li.docx-There is no limits for the number of sources but for a project li.docx
-There is no limits for the number of sources but for a project li.docx
mercysuttle
How will the MOOC Change Between Now and 2020?
How will the MOOC Change Between Now and 2020?How will the MOOC Change Between Now and 2020?
How will the MOOC Change Between Now and 2020?
Charles Severance
Assessment Forum 2013 - Columbia University Libraries - 13_0620
Assessment Forum 2013 - Columbia University Libraries - 13_0620Assessment Forum 2013 - Columbia University Libraries - 13_0620
Assessment Forum 2013 - Columbia University Libraries - 13_0620
jeffreylancaster
Wk1 - L1.ppsx
Wk1 - L1.ppsxWk1 - L1.ppsx
Wk1 - L1.ppsx
SobiaShujaat2
TLC2016 - A showcase of using BB LEARN in large courses
TLC2016 - A showcase of using BB LEARN in large coursesTLC2016 - A showcase of using BB LEARN in large courses
TLC2016 - A showcase of using BB LEARN in large courses
BlackboardEMEA
Data carpentry instructor-onboarding
Data carpentry instructor-onboardingData carpentry instructor-onboarding
Data carpentry instructor-onboarding
tracykteal
Using Cloud-based statistics applications to enhance statistics education
Using Cloud-based statistics applications to enhance statistics educationUsing Cloud-based statistics applications to enhance statistics education
Using Cloud-based statistics applications to enhance statistics education
smackinnon
Developing Educational Technology Resources for Faculty
Developing Educational Technology Resources for FacultyDeveloping Educational Technology Resources for Faculty
Developing Educational Technology Resources for Faculty
Kaitlin Walsh
WF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the course
WF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the courseWF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the course
WF ED 540, Fall Semester 2018, Class Meeting 1 - Intro to the course
Penn State University
Introduction.pptx
Introduction.pptxIntroduction.pptx
Introduction.pptx
Samar954063
EAD-523 Presentation.pptx Get feedback..
EAD-523 Presentation.pptx Get feedback..EAD-523 Presentation.pptx Get feedback..
EAD-523 Presentation.pptx Get feedback..
caldwell1991
CIS_170_05_F15F2F_Paurus
CIS_170_05_F15F2F_PaurusCIS_170_05_F15F2F_Paurus
CIS_170_05_F15F2F_Paurus
Jordan Bushaw
Data Science: Introduction
Data Science: IntroductionData Science: Introduction
Data Science: Introduction
Jinho Choi
1. course introduction
1. course introduction1. course introduction
1. course introduction
Saeed Parsa
Post-it Up: Qualitative Data Analysis of a Test Fest
Post-it Up: Qualitative Data Analysis of a Test FestPost-it Up: Qualitative Data Analysis of a Test Fest
Post-it Up: Qualitative Data Analysis of a Test Fest
Sarah Joy Arnold
BbWorld 2010 notes
BbWorld 2010 notesBbWorld 2010 notes
BbWorld 2010 notes
Thomas Bishop
2007 LITA National Forum 2007. Denver, Colorado
2007 LITA National Forum  2007. Denver, Colorado2007 LITA National Forum  2007. Denver, Colorado
2007 LITA National Forum 2007. Denver, Colorado
Western Illinois University
lecture01_Introduction.pdf
lecture01_Introduction.pdflecture01_Introduction.pdf
lecture01_Introduction.pdf
MarlonMagtibay2
-There is no limits for the number of sources but for a project li.docx
-There is no limits for the number of sources but for a project li.docx-There is no limits for the number of sources but for a project li.docx
-There is no limits for the number of sources but for a project li.docx
mercysuttle
How will the MOOC Change Between Now and 2020?
How will the MOOC Change Between Now and 2020?How will the MOOC Change Between Now and 2020?
How will the MOOC Change Between Now and 2020?
Charles Severance
Assessment Forum 2013 - Columbia University Libraries - 13_0620
Assessment Forum 2013 - Columbia University Libraries - 13_0620Assessment Forum 2013 - Columbia University Libraries - 13_0620
Assessment Forum 2013 - Columbia University Libraries - 13_0620
jeffreylancaster
TLC2016 - A showcase of using BB LEARN in large courses
TLC2016 - A showcase of using BB LEARN in large coursesTLC2016 - A showcase of using BB LEARN in large courses
TLC2016 - A showcase of using BB LEARN in large courses
BlackboardEMEA

Intro

  • 1. INFSCI 2711 Advanced Topics in Database Management Instructor: Evgeny Karataev
  • 2. The instructor Evgeny Karataev Where/how to 鍖nd: In class: Tuesdays, 12:00 noon - 2:50 pm, IS 403 Of鍖ce Hours: online and by appointment email: epk8@pitt.edu
  • 3. What this class is about Prerequisites: You know what a relational database and a database management system are (INFSCI 2710) Do you think that [one] [R]DBMS on [single] machine is enough to handle todays volume, velocity and variety of the data?
  • 4. What this class is about Topics that will be covered in this class: Data Integration (OLAP and Data Warehousing, Virtual Data Integration) Distributed and Parallel Databases (including distributed transactions and query execution) NoSQL databases, NewSQL databases (Main Memory Databases) Cluster Computing (Hadoop and other animals, Spark)
  • 5. The textbooks Too many to list here there is no single book that covers all topics so I will post selected chapters of online available books, blog posts and research papers before or after each lecture
  • 6. Class components Lectures/Demos/Labs Homework Assignments Students DB tools overview presentations Term Research & Development Project Midterm exam Final exam
  • 7. Lectures/Demos/Labs Lectures slides will be available online usually a day before the class. Sometime you might be asked to bring your laptop to the class for lab work. Sometimes I will do demos of the DB systems related to the class material.
  • 8. Homework Assignments So far I planned 4 assignments fairly well spread over the semester. However this might change to 5, 6 or 3. Assignment are usually very practical and are based on the material learned in the class. You might be need to do some programing. All assignments need to be submitted ONLINE (assignments will have submission instructions) All assignments are group based (2 or 3 people per group)
  • 9. Students DB tools overview presentations Each student (or maybe in groups of 2 or 3) will have to make 10 minutes presentation about a database system sometime during the term (I will provide the list of the database systems). The presentation must include: Architecture/Main idea/Main approach. Advantages and Disadvantages. How it differs from other systems. When it is applicable and when not. Where to learn more about it.
  • 10. Term Research Project An original R&D project in groups of 5-6 people Most probably a lot learning and programming In class project progress reports/demos every other week (up to 15 minutes max) One 鍖nal written report One 鍖nal demo Project ideas will be provided by me, but you are welcome to propose yours Projects development will be managed via github
  • 11. Exams Both Midterm and Final exams are open notes, but no computers and/or phones. Final exam is cumulative. No sample exam questions will be posted or distributed.
  • 12. Late Policy Homework and Project reports are due at the beginning of class on the due date. Homework and project reports can be turned in the following class for a 25% penalty. Nothing will be accepted after that time.
  • 13. Grading This course is being offered for three credits. The grading is as follows: Homework Assignments: 20 % DB tool presentation: 10 % Midterm exam: 20 % Project: 25 % Final exam: 25 %
  • 14. Class Q&A (and more) Management System This term we will be using Piazza for class discussion. The system is highly catered to getting you help fast and ef鍖ciently from classmates, and myself. Rather than emailing questions to me, I encourage you to post your questions on Piazza. If you have any problems or feedback for the developers, email team@piazza.com. Find our class page at: https://piazza.com/pitt/ spring2015/infsci2711/home
  • 16. Extra Credits to your grades Top 5 most active users on Piazza will get 5 extra points Active users are those who: ask many and GOOD questions answer questions posted by others (preferably before I answer)