Results 1 - 10
of
95
A Visual Digital Library Approach for Time-Oriented Scientific Primary Data
"... Digital Library support for textual and certain types of nontextual documents has significantly advanced over the last years. While Digital Library support implies many aspects along the whole library workflow model, interactive and visual retrieval allowing effective query formulation and result pr ..."
Abstract
-
Cited by 11 (10 self)
- Add to MetaCart
(Show Context)
Digital Library support for textual and certain types of nontextual documents has significantly advanced over the last years. While Digital Library support implies many aspects along the whole library workflow model, interactive and visual retrieval allowing effective query formulation and result presentation are important functions. Recently, new kinds of non-textual documents which merit Digital Library support, but yet cannot be accommodated by existing Digital Library technology, have come into focus. Scientific primary data, as produced for example, by scientific experimentation, earth observation, or simulation, is such a data type. We report on a concept and first implementation of Digital Library functionality, supporting visual retrieval and exploration in a specific important class of scientific primary data, namely, time-oriented data. The approach is developed in an interdisciplinary effort by experts from the library, natural sciences, and visual analytics communities. In addition to presenting the concept and discussing relevant challenges, we present results from a first implementation of our approach as applied on a real-world scientific primary data set.
Beyond digital incunabula: Modeling the next generation of digital libraries
- In Proceedings of the 10th European Conference on Research and Advanced Technology for Digital Libraries (ECDL 2006
, 2006
"... Abstract. This paper describes several incunabular assumptions that impose upon early digital libraries the limitations drawn from print, and argues for a design strategy aimed at providing customization and personalization services that go beyond the limiting models of print distribution, based on ..."
Abstract
-
Cited by 10 (8 self)
- Add to MetaCart
(Show Context)
Abstract. This paper describes several incunabular assumptions that impose upon early digital libraries the limitations drawn from print, and argues for a design strategy aimed at providing customization and personalization services that go beyond the limiting models of print distribution, based on services and experiments developed for the Greco-Roman collections in the Perseus Digital Library. Three features fundamentally characterize a successful digital library design: finer granularity of collection objects,automatedprocesses,anddecentralizedcommunitycontributions. 1
SCOPE: A scientific compound object publishing and editing system. The International Journal of Digital Curation, 3(2), 4 - 18. Retrieved from http://ijdc.net/index.php/ijdc/article/viewFile/84/55
- Proceedings of the 2004 ACM conference on Computer Supported Cooperative work
, 2008
"... This paper presents the SCOPE (Scientific Compound Object Publishing and Editing) system which is designed to enable scientists to easily author, publish and edit scientific compound objects. Scientific compound objects encapsulate the various datasets and resources generated or utilized during a s ..."
Abstract
-
Cited by 10 (4 self)
- Add to MetaCart
(Show Context)
This paper presents the SCOPE (Scientific Compound Object Publishing and Editing) system which is designed to enable scientists to easily author, publish and edit scientific compound objects. Scientific compound objects encapsulate the various datasets and resources generated or utilized during a scientific experiment or discovery process, within a single compound object, for publishing and exchange. The adoption of “named graphs ” to represent these compound objects enables provenance information to be captured via the typed relationships between the components. This approach is also endorsed by the OAI-ORE initiative and hence ensures that we generate OAI-ORE-compliant Scientific Compound Objects. The SCOPE system is an extension of the Provenance Explorer tool – which supports access-controlled viewing of scientific provenance trails. Provenance Explorer provided dynamic rendering of RDF graphs of scientific discovery processes, showing the lineage from raw data to publication. Views of different granularity can be inferred automatically using SWRL (Semantic Web Rules Language) rules and an inferencing engine. SCOPE extends the Provenance Explorer tool and GUI by: 1) Adding an embedded web browser that can be used for incorporating objects discoverable via the Web; 2) Representing compound objects as Named Graphs, that can be saved in RDF, TriX, TriG or as an Atom syndication feed; 3) Enabling scientists to attach Creative Commons Licenses to the compound objects to specify how they may be re-used; 4) Enabling compound objects to be published as Fedora Object XML (FOXML) files within a Fedora digital library.
CiteSeerX - a scalable autonomous scientific digital library
- CONFERENCE ON SCALABLE INFORMATION SYSTEMS
, 2006
"... CiteSeer is a scientific literature digital library and search engine which automatically crawls and indexes scientific documents in the fields of computer and information science. Since it's inception in 1997 CiteSeer has grown to index over 730,000 documents and serves over 800,000 requests d ..."
Abstract
-
Cited by 8 (3 self)
- Add to MetaCart
CiteSeer is a scientific literature digital library and search engine which automatically crawls and indexes scientific documents in the fields of computer and information science. Since it's inception in 1997 CiteSeer has grown to index over 730,000 documents and serves over 800,000 requests daily, pushing the limits of the current system's capabilities. In addition, CiteSeer's monolithic architecture inconveniences system maintenance and reduces the flexibility of the system in terms of new feature development, algorithm updates, and system interoperability. In this paper, we discuss the problems of the current CiteSeer architecture and propose a new architecture for a next generation CiteSeer application. The new architecture is based on modular web services and pluggable service components. Preliminary results based on a prototype system show the new architecture enhances flexibility, scalability, and performance for CiteSeer. In addition, new services in development for the next generation CiteSeer system are discussed.
A Data Model and Architecture for Long-term Preservation
- Proceedings of the 8th ACM/IEEECS Joint Conference on Digital Libraries (JCDL
, 2008
"... The National Geospatial Digital Archive, one of eight initial projects funded under the Library of Congress’s NDIIPP program, has been researching how geospatial data can be preserved on a national scale and be made available to future generations. In this paper we describe an archive architecture t ..."
Abstract
-
Cited by 6 (1 self)
- Add to MetaCart
(Show Context)
The National Geospatial Digital Archive, one of eight initial projects funded under the Library of Congress’s NDIIPP program, has been researching how geospatial data can be preserved on a national scale and be made available to future generations. In this paper we describe an archive architecture that provides a minimal approach to the long-term preservation of digital objects based on co-archiving of object semantics, uniform representation of objects and semantics, explicit storage of all objects and semantics as files, and abstraction of the underlying storage system. This architecture ensures that digital objects can be easily migrated from archive to archive over time and that the objects can, in principle, be made usable again at any point in the future; its primary benefit is that it serves as a fallback strategy against, and as a foundation for, more sophisticated (and costly) preservation strategies. We describe an implementation of this architecture in a protoype archive running at UCSB that also incorporates a suite of ingest and access components.
Epidemic marketplace: an information management system for epidemiological data
- Information Technology in Bio-and Medical Informatics, ITBAM 2010
, 2010
"... Abstract. The Epidemic Marketplace is part of a computational framework for organizing data for epidemic modeling and forecasting. It is a distributed data management platform where epidemiological data can be stored, managed and made available to the scientific community. It includes tools for the ..."
Abstract
-
Cited by 5 (3 self)
- Add to MetaCart
(Show Context)
Abstract. The Epidemic Marketplace is part of a computational framework for organizing data for epidemic modeling and forecasting. It is a distributed data management platform where epidemiological data can be stored, managed and made available to the scientific community. It includes tools for the automatic interaction with other applications through web services, for the collection of epidemiological data from internet social networks and for discussion of related topics. This paper defines its requirements, architecture and implementation plan based on open-source software. This platform will assist epidemiologists and public health scientists in finding, sharing and exchanging data. 1
Representing Aggregate Works in the Digital Library
"... This paper studies the challenge of representing aggregate works such as encyclopaedia, collected poems and journals in digital libraries. Reflecting on materials used by humanities academics, it demonstrates the complex range of aggregate types and the problems of representing this heterogeneity i ..."
Abstract
-
Cited by 5 (1 self)
- Add to MetaCart
(Show Context)
This paper studies the challenge of representing aggregate works such as encyclopaedia, collected poems and journals in digital libraries. Reflecting on materials used by humanities academics, it demonstrates the complex range of aggregate types and the problems of representing this heterogeneity in the digital library interface. We demonstrate that aggregates are complex and pervasive, challenge many common assumptions and confuse the boundaries between organisational levels within the library. The challenge is amplified by concrete examples.
Content-based layouts for exploratory metadata search in scientific research data
- In JCDL. ACM
, 2012
"... tobias.schreck ..."
(Show Context)
oreChem ChemXSeer: a semantic digital library for chemistry
- In Proc. JCDL 2010
, 2010
"... Representing the semantics of unstructured scientific publications will certainly facilitate access and search and hopefully lead to new discoveries. However, current digital libraries are usually limited to classic flat structured metadata even for scientific publications that potentially contain r ..."
Abstract
-
Cited by 4 (2 self)
- Add to MetaCart
(Show Context)
Representing the semantics of unstructured scientific publications will certainly facilitate access and search and hopefully lead to new discoveries. However, current digital libraries are usually limited to classic flat structured metadata even for scientific publications that potentially contain rich semantic metadata. In addition, how to search the scientific literature of linked semantic metadata is an open problem. We have developed a semantic digital library oreChem ChemxSeer that models chemistry papers with semantic metadata. It stores and indexes extracted metadata from a chemistry paper repository ChemxSeer using “compound objects”. We use the Open Archives Initiative Object Reuse and Exchange (OAI-ORE) ) 1 standard to define a compound object that aggregates metadata fields related to a digital object. Aggregated metadata can be managed and retrieved easily as one unit resulting in improved ease-of-use and has the potential to improve the semantic interpretation of shared data. We show how metadata can be extracted from documents and aggregated using OAI-ORE. ORE objects are created on demand; thus, we are able to search for a set of linked metadata with one query. We were also able to model new types of metadata easily. For example, chemists are especially interested in finding information related to experiments in documents. We show how paragraphs containing experiment information in chemistry papers can be extracted and tagged based on a chemistry ontology with 470 classes, and then represented in ORE along with other document-related metadata. Our algorithm uses a classifier with features that are words that are typically only used to describe experiments, such as “apparatus”, “prepare”, etc. Using a dataset comprised of documents from the Royal Society of Chemistry digital library, we show that the our proposed method performs well in extracting experimentrelated paragraphs from chemistry documents.
Evaluation of Semantic and Social Technologies for Digital Libraries
- Semantic Digital Libraries
, 2009
"... ..."
(Show Context)