DOC PREVIEW
USC CSCI 599 - Ali Presentation1

This preview shows page 1 out of 3 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 3 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 3 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

8/27/20091Search EngineforforShoah FoundationPresented byAli Khodaei([email protected])Outline• Shoah Foundation• Our project• What we need Shoah Foundation• The USC Shoah Foundation Institute for Visual History and Education, with an archive of nearly 52,000 videotaped testimonies from Holocaust survivors and other witnesses, is part of the College of Letters Arts & Sciences at theCollege of Letters, Arts & Sciences at the University of Southern California • The USC Shoah Foundation Institute’s Visual History Archive (VHA) is a software tool that allows users to search data and view digital video in the USC Shoah Foundation Institute’s archive. Shoah Foundation•A segment is a one-minute unit of a testimony in the VHA. Testimonies are divided into one-minute segments which can be retrieved by the end user through keyword searches. –Not every segment has keywords attached–Not every segment has keywords attached. • Keywords are attached to one-minute segments when a topic is discussed or described in some detail. – If the discussion or description spans several segments, the relevant keywords are usually applied once Shoah Foundation• Testimonies can be searched based on keywords– You can choose to perform an AND or OR search • The AND search retrieves segments that include all of your Selected Keywords (up to 35 keywords)Selected Keywords (up to 35 keywords). • The AND search also permits you to chose a Segment Range. It is possible to search for all Selected Keywords appearing in the same segment (i.e. 1 segment), within 5 (consecutive) segments, within 10 (consecutive) segments, within 15 (consecutive) segments or within the Entire testimony. Shoah Foundation• Two types of search– Quick search• Regular keyword search– Global keyword search• search for segments of testimonies that discuss specific topics• Topics are predefined• using more than 50,000 geographic and experiential keywords8/27/20092Shoah Foundation• Global Keyword SearchShoah Foundation• ResultOur project• Robust, efficient and interactive search engine ranking testimonies based on combination of– Textual (regular) keywords– Spatial keywordsTemporal keywords–Temporal keywords• This search engine finds and ranks the most textually, spatially and temporally relevant testimonies (segments) according to – query keywords– query location – query time interval Input• Query Keywords– Set of keywords inputted as text • Query Location–A region drawn on the map ORA region drawn on the map OR– A spatial keyword inputted as text• Query time interval– An interval specified by a time slider OR– An interval inputted as two numbersOutput Tasks1- Data tier– Data Cleansing • Understand / format / standardize the data–Geocoding–Geocoding• Find missing lat/long information for some of spatial keywords– Index Construction• Create inverted files for regular keywords• Create inverted files for spatial keywords8/27/20093Tasks2- Middle tier– Intelligent web-services • Talk to interface – Receive input (query parameters)–Send output (query result)–Send output (query result)• Talk to data tier – Get data– Access index• Perform necessary operations– Process data– Calculates scores– Format the resultsTasks3- Interface (GUI)– User friendly interface to receive input from the user• Textbox for textual keywords• Map interface to draw/show query location– A textbox can be used to input a location’s name•Time slider to specify time interval•Time slider to specify time interval– A textbox can be used to input time interval– Displays the result dynamically and interactively • Results should be changed on-the-fly based on map location and time slider – Provides mechanism to show the testimonies from the interface • Show testimonies on the same page• Link to a new page for showing the testimoniesTasks4- Research/Algorithm– Hybrid index structure• captures spatial and textual keywords (probably using inverted files) as well as temporal keywordsg)py– Relevance ranking function• Formulas for spatial, textual and temporal scores• A combined scoring function with different weights for different featuresExpertise• Database– Sysbase –SQL• Web-services– ASP.NET –servlets / jspservlets / jsp• Interface/GUI–Ajax– Google maps API– XHTML / CSS• Research– Information retrieval– Spatial keyword


View Full Document

USC CSCI 599 - Ali Presentation1

Documents in this Course
Week8_1

Week8_1

22 pages

Week2_b

Week2_b

10 pages

LECT6BW

LECT6BW

20 pages

LECT6BW

LECT6BW

20 pages

5

5

44 pages

12

12

15 pages

16

16

20 pages

Nima

Nima

8 pages

Week1

Week1

38 pages

Week11_c

Week11_c

30 pages

afsin

afsin

5 pages

October5b

October5b

43 pages

Week11_2

Week11_2

20 pages

final

final

2 pages

c-4

c-4

12 pages

0420

0420

3 pages

Week9_b

Week9_b

20 pages

S7Kriegel

S7Kriegel

21 pages

Week4_2

Week4_2

16 pages

sandpres

sandpres

21 pages

Week6_1

Week6_1

20 pages

4

4

33 pages

Week10_c

Week10_c

13 pages

fft

fft

18 pages

LECT7BW

LECT7BW

19 pages

24

24

15 pages

14

14

35 pages

Week9_c

Week9_c

24 pages

Week11_67

Week11_67

22 pages

Week1

Week1

37 pages

LECT3BW

LECT3BW

28 pages

Week8_c2

Week8_c2

19 pages

Week5_1

Week5_1

19 pages

LECT5BW

LECT5BW

24 pages

Week10_b

Week10_b

16 pages

Week11_1

Week11_1

43 pages

Week7_2

Week7_2

15 pages

Week5_b

Week5_b

19 pages

Week11_a

Week11_a

29 pages

LECT14BW

LECT14BW

24 pages

T7kriegel

T7kriegel

21 pages

0413

0413

2 pages

3

3

23 pages

C2-TSE

C2-TSE

16 pages

10_19_99

10_19_99

12 pages

s1and2-v2

s1and2-v2

37 pages

Week10_3

Week10_3

23 pages

jalal

jalal

6 pages

1

1

25 pages

T3Querys

T3Querys

47 pages

CS17

CS17

15 pages

porkaew

porkaew

20 pages

LECT4BW

LECT4BW

21 pages

Week10_1

Week10_1

25 pages

wavelet

wavelet

17 pages

October5a

October5a

22 pages

p289-korn

p289-korn

12 pages

2

2

33 pages

rose

rose

36 pages

9_7_99

9_7_99

18 pages

Week10_2

Week10_2

28 pages

Week7_3

Week7_3

37 pages

Load more
Download Ali Presentation1
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Ali Presentation1 and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Ali Presentation1 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?