Digital Libraries
Agenda Digital Libraries Information retrieval concepts Search and browsing on unstructured data sources Digital libraries applications
What is Library Collection of books, documents, newspapers, audio visual materials kept and organized for people to read or borrow Characteristics 1. Collection of data objects 2. Collection of Metadata Structures 3. Collection of Services 4. Domain Focus 5. Quality Control 6. Preservation
Definitions The first research-oriented definition belongs to Christine Borgman (1993) A National Electronic Library is 1. A services; 2. An architecture; 3. A set of information resources, databases of text, numbers, graphics, sound, video, etc. And 4. A set of tools and capabilities to locate, retrieve and utilize the information resources available
Definitions Elements according Bishop and Star (1996) 1. Some sense of a collection, with some kind of organization 2. A collection that is not entirely bibliographic or exclusively a set of pointers to other material, it must contain some full-form online material, and may be in a variety of formats; and 3. A goal exist to link audience, group, action, patron, or community with attributes of the collection"
Definitions A practical digital libraries (Lesk, 1997) A digital library is a collection of information which is both digitized and organized
Digital Library Federation definition (1998) DLs are organizations that provide the resources, including the specialized staff to select, structure, offer intellectual access to interpret, distribute, preserve the integrity of and ensure the persistence over time of collections of digital works so that they are readily and economically available for use by a defined community or set of communities
Definitions Four dimensions according to Marchionini and Fox (1999) The community dimension Technology as the engine moving the DL Services constitute the central focus of DLs digital reference services, real time question answering, information literacy Content
The 5S Model Streams The flow of information in various formats Structures Organizational aspects of the DL Spaces Views of components; real or abstract images Scenarios Services and behaviors Societies Communities and relationships among them
Model Primitives Formalisms Objectives Stream Text; video, audio, software program Sequences, types Describes properties of the DL content, encoding and textual material or particular forms of multimedia data. Structure Collection, catalog; hypertext; document; metadata; organizational tools Graphs; nodes; links; labels; hierarchies Specifies organizational aspects of the DL content Space User Interface; index; retrieval model Sets; operations; vector space; measure space; probability space Defines logical and presentational views of several DL components Scenarios Service, event; condition; action Sequence diagrams; collaboration diagrams Details the behavior of DL services Societies Community; managers; actors; classes; relationships; attributes; operators Object-oriented modeling constructs; design patterns Defines managers responsible for running DL services; actors that use those services, and relationships among them
Applying the model, informally Personal Photos; Movie, TV, media Stream - what types of data? Gif, jpg, avi? Structure - How are the elements organized? Is there a hierarchy? Are there multiple structures? Spaces - How would you index the items? How would you divide them into related groups Scenarios - what services would you provide? What information do we need to provide those services? Societies - who is the library intended to serve? Remember to include agents and other processes as well as users
Digital Library Definitions Collection of electronic resources that provides direct/indirect access to a systematically organized collection of digital objects. Hybrid Library Provides services in a mixed-mode, electronic and paper, environment, particularly in a co-coordinated way. Derived from a strand of elib which explored the issues surrounding the retrieval and delivery of information in these types of environment but also investigated the integration of different electronic services so that single search approach could be offered to the End user. Virtual Library Access to electronic information in a variety of remote locations through a local online catalogue or other gateway, such as the internet
Building block of digital libraries Digital Libraries Computing Networking Content Collections Services Community
Content Types
Early projects
Types of DLs
History H. G. Wells (1938): the notion of a World Encyclopedia Vennever Bush (1945): Memex Douglas Engelbart (1963): electronic technology, the first mouse and hypertext technology Licklider s (1965) vision: Library of the Future Invention of Internet (1983) Tim Berners Lee (1989): World Wide Web, hypertext Digital Library Initiative I (1992-1998) Digital Library Initiative II (1999-2002)
Memex by Vennever Bush
Goal for the digital libraries To Dramatically advance the means to select, store, organize, and use widely distributed knowledge resources containing diverse types of information and content stored in a variety of electronic forms Digital library Initiative Interagency Coordinating Committee (1996)
Why Digital Libraries (DLs)? DL brings library to the user Ubiquitous anytime, anywhere access to information Open access paradigm (democracy, education, etc.) Increasing cost of buildings and storages (preservation issues) Variety of materials Budget pressures
Various DL perspectives Library and information science community Computer science community Politicians and Governments Publishers Teachers Archivists Commercial enterprises
LIS community Views a DL as an institution rather than a machine DLs are Libraries without walls A logical extension of what libraries have been doingacquiring, organizaing, and disseminating information Enhancing resources, services and audiences of libraries
Computer science community Views DLs as an extension of networking technologies DLs are an extension of databases and information retrieval systems For the database community, DLs are large databases
Politicians and Governments Can DLs bridge the digital divide between information rich and information poor? Many believe that DLs can bring equity of access Governments try to develop DLs to provide access to information for all citizens
Publishers DLs as new opportunities Production and distribution of information Licensing issues Competitiveness Paradigm shift
Teachers and educators New learning resources Variety of educational materials Virtual learning and e-learning environments Tearchers/educators/students ad content producers
Archivists View DLs as a means of preserving national and cultural heritage DLs can be used to enhance access to fragile materials
Commercial enterprises View DLs as a new market Database vendors and brokers Pricing and economic models
United Nations Digital Library
Characteristics of Digital Library 1. Universal and unlimited access 2. Contents will be in digital format which can be utilized only with the help of computer 3. Contents in textual, image, sound and video form 4. Collection to connection
Principles of Digital Library Development 1. Expect Change 2. Know your contents 3. Involve the right people 4. Design usable systems 5. Ensure open access 6. Be(a)ware of data rights 7. Automate whenever possible 8. Adopt and adhere to standards 9. Ensure quality 10. Be concerned about persistence
Components of Digital Library
Types of Digital Libraries 1. Stand-alone Digital Library (SDL) 2. Federated Digital Library (FDL) 3. Harvested Digital Library (HDL)
The ACM Digital Library
IEEE Computer Society DL
Federated Digital Library (FDL) This is a federation of several independent SDLs in the network, organized around a common theme, and coupled together on the network. A FDL composes several autonomous SDLs that form a networked library with a transparent user interface. The different SDLs are heterogeneous and are connected via communication networks.
Networked Digital Library of Theses & Dissertation
Bibliographic Navigation Tools for Digital Libraries SCOPUS ELIN Knowledge Cite Library Database Advisor OCLS FirstSearch
Harvested Digital Library (HDL) This is a virtual library providing summarized access to related material scattered over the network. Examples of HDLs are the Internet Public Library (IPL) 1. A HDL holds only metadata with pointers to the holdings that are "one click away" in Cyberspace. 2. Developed by Library Professionals, or Computer Scientists
Community Four Corner Stones of Digital Library Communication technologies Computer Content
Community 1. Library Professionals 2. Library Users 3. IT Professionals 4. Vendors
Communication Technologies Communication Networks Web Servers, Bandwidth, Local Area Network, Internet and software Network standards 1. The Digital Object Identifier; http://www.doi.org/ 2. Hypertext Transfer Protocol (HTTP); http://www.w3.org/protocols/ 3. Persistent URL Home Page; http://purl.oclc.org/ 4. Z39.50; http://www.loc.gov/z3950/agency/
Contents Images.BMP.TIF.GIF.PNG.WMF.PICT.PCD.EPS.EMF.CGM.TGA.JPG Animation.ANI.FLI.FLC Video.AVI.MOV.MPG.QT
Contents Audio.WAV.MID.SND.AUD.mp3 Web Page.HTM.HTML.DHTML.HTMLS.XML Text.DOC.TXT.RTF.PDF Programs.COM.EXE
Contents Markup standards 1. Hypertext Markup Language (HTML); http://www.w3.org/markup/ 2. Extensible Markup Language (XML); http://www.w3.org/xml/ 3. Standard Generalized Markup Language (SGML); http://www.w3.org/markup/sgml/
Contents Metadata standards 1. Dublin Core; http://dublincore.org/ 2. MARC 21; http://icweb.loc.gov/marc/ 3. Encoded Archival Description (EAD); http://icweb.loc.gov/ead/
Computer Personal Computer Server Machines
Digital Library Conceptual Models
PakLAG Digital Library Model Relational Database (MySQL) Stores Metadata Web Server (Tomcat)/ Digital Object Repository Web Browser (IE, Netscape etc.) User User
Myths 1. The internet is the digital library 2. Fully automated library is digital library 3. The myth of a single digital library or onewindow view of digital library collections 4. Digital libraries will provide more equitable access, anywhere, any time 5. Digital libraries will be cheaper than print libraries
Challenges 1. Interoperability Technical, process, language, ( Different platforms etc) 2. Building digital collections 3. Metadata 2.a Digitization 2.b Acquisition of original digital works 2.c Access to external materials 4. Naming, identifiers, and persistence 4.a Uniform Resource Locator (URL) 4.b Persistent Uniform Resource Locator (PURL) 4.c Uniform Resource Name (URN) 4.d Digital Object Identifier
5. Copyright / rights management 5.a Usage tracking 5.b Identifying and authenticating users 5.c Providing the copyright status of each digital object, and the restrictions on its use or the fees associated with it 6. Preservation Challenges 6.a Preservation of the storage medium 6.b Preservation of access to content
Conclusion The concepts of Digital and Virtual libraries are actually services added to the traditional libraries wherein the resources which can only be utilized with computers. We hope digital libraries will coexist with traditional libraries whether within them or as separate entities. However the role of professionals will be drastically changed in digital environment. Digital libraries have customers instead of users and provide pull and push information delivery methods.
References Introduction to Digital Libraries http://www.powershow.com/view/b1cd3- NzcyZ/Introduction_to_Digital_libraries_History_ Definitions_and_Types_powerpoint_ppt_presentati on
Fin