BEGIN:VCALENDAR VERSION:2.0 PRODID:-//ChamberMaster//Event Calendar 2.0//EN METHOD:PUBLISH X-PUBLISHED-TTL:P3D REFRESH-INTERVAL:P3D CALSCALE:GREGORIAN BEGIN:VTIMEZONE TZID:America/New_York BEGIN:DAYLIGHT RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=+2SU DTSTART:20070101T000000 TZOFFSETFROM:-0500 TZOFFSETTO:-0400 TZNAME:Eastern Daylight Time END:DAYLIGHT BEGIN:STANDARD RRULE:FREQ=YEARLY;BYMONTH=11;BYDAY=+1SU DTSTART:99991231T000000 TZOFFSETFROM:-0400 TZOFFSETTO:-0500 TZNAME:Eastern Standard Time END:STANDARD END:VTIMEZONE BEGIN:VEVENT DTSTART;TZID=America/New_York:20200730T160000 DTEND;TZID=America/New_York:20200730T170000 X-MICROSOFT-CDO-ALLDAYEVENT:FALSE SUMMARY:Identifying Related and Same-Work Relationships in Large Digital Libraries DESCRIPTION:The rapid growth of scanned-work digital libraries presents a new opportunity for learning more about our collections. With digital access to text inside the books of a collection\, content-based text mining methods can be leveraged to learn more about the relationships between works\, helping correct inaccurate metadata\, suggest classification information\, recommend similar works\, and label the nature of links between works.This talk will introduce the Similarities and Duplication in Digital Libraries project\, SADDL\, a project identifying same-work relationships among the 17 million works seen in the HathiTrust Digital Library. SaDDL is identifying exact duplicates as well as traditionally difficult-to-identify relationships such as derivatives\, different editions\, abridgments\, and whole or part relationships. We present the challenges of the problem\, our project's approach to meeting them\, and a new dataset for cataloguers and scholars to apply our outcomes. X-ALT-DESC;FMTTYPE=text/html:

The rapid growth of scanned-work digital libraries presents a new opportunity for learning more about our collections. With digital access to text inside the books of a collection\, content-based text mining methods can be leveraged to learn more about the relationships between works\, helping correct inaccurate metadata\, suggest classification information\, recommend similar works\, and label the nature of links between works.

This talk will introduce the Similarities and Duplication in Digital Libraries project\, SADDL\, a project identifying same-work relationships among the 17 million works seen in the HathiTrust Digital Library. SaDDL is identifying exact duplicates as well as traditionally difficult-to-identify relationships such as derivatives\, different editions\, abridgments\, and whole or part relationships. We present the challenges of the problem\, our project'\;s approach to meeting them\, and a new dataset for cataloguers and scholars to apply our outcomes.

LOCATION: UID:e.1224.200016 SEQUENCE:3 DTSTAMP:20200806T062813Z URL:https://members.asist.org/events/Details/identifying-related-and-same-work-relationships-in-large-digital-libraries-219153?sourceTypeId=Hub END:VEVENT END:VCALENDAR