Probabilistic Ranking of Database Query Results Surajit Chaudhuri, Microsoft Research Gautam Das, Microsoft Research Vagelis Hristidis, Florida International University Gerhard Weikum, MPI Informatik Presented by Raghunath Ravi Sivaramakrishnan Subramani CSE@UTA 1
Probabilistic Ranking of Database Query Results. Surajit Chaudhuri , Microsoft Research Gautam Das, Microsoft Research Vagelis Hristidis , Florida International University Gerhard Weikum , MPI Informatik. Presented by Raghunath Ravi Sivaramakrishnan Subramani CSE@UTA. Roadmap. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
1
Probabilistic Ranking of Database Query Results
Surajit Chaudhuri, Microsoft ResearchGautam Das, Microsoft ResearchVagelis Hristidis, Florida International UniversityGerhard Weikum, MPI Informatik
Presented by Raghunath Ravi
Sivaramakrishnan SubramaniCSE@UTA
2
RoadmapMotivationKey ProblemsSystem ArchitectureConstruction of Ranking FunctionImplementationExperimentsConclusion and open problems
3
MotivationMany-answers problemTwo alternative solutions:
Query reformulation Automatic rankingApply probabilistic model in IR to
DB tuple ranking
4
Example – Realtor DatabaseHouse Attributes: Price, City,
Bedrooms, Bathrooms, SchoolDistrict, Waterfront, BoatDock, Year
Query: City =`Seattle’ AND Waterfront = TRUE
Too Many Results!
Intuitively, Houses with lower Price, more Bedrooms, or BoatDock are generally preferable
5
Rank According to Unspecified AttributesScore of a Result Tuple t depends onGlobal Score: Global Importance of
Unspecified Attribute Values [CIDR2003]◦ E.g., Newer Houses are generally preferred
Conditional Score: Correlations between Specified and Unspecified Attribute Values◦ E.g., Waterfront BoatDock
Many Bedrooms Good School District
6
RoadmapMotivationKey ProblemsSystem ArchitectureConstruction of Ranking FunctionImplementationExperimentsConclusion and open problems
7
Key ProblemsGiven a Query Q, How to
Combine the Global and Conditional Scores into a Ranking Function.Use Probabilistic Information Retrieval (PIR).
How to Calculate the Global and Conditional Scores.Use Query Workload and Data.
8
RoadmapMotivationKey ProblemsSystem ArchitectureConstruction of Ranking FunctionImplementationExperimentsConclusion and open problems
9
System Architecture
10
RoadmapMotivationKey ProblemsSystem ArchitectureConstruction of Ranking
FunctionImplementationExperimentsConclusion and open problems