Results 1 -
4 of
4
Analyzing the dynamics of research by extracting key aspects of scientific papers
- In Proceedings of IJCNLP
, 2011
"... We present a method for characterizing a research work in terms of its focus, domain of application, and techniques used. We show how tracing these aspects over time provides a novel measure of the influence of research communities on each other. We extract these characteristics by matching semantic ..."
Abstract
-
Cited by 9 (1 self)
- Add to MetaCart
We present a method for characterizing a research work in terms of its focus, domain of application, and techniques used. We show how tracing these aspects over time provides a novel measure of the influence of research communities on each other. We extract these characteristics by matching semantic extraction patterns, learned using bootstrapping, to the dependency trees of sentences in an article’s abstract. We combine this information with pre-calculated article-to-community assignments to study the influence of a community on others in terms of techniques borrowed and the ‘maturing ’ of some communities to solve other problems. As a case study, we show how the computational linguistics community and its sub-fields have changed over the years with respect to their foci, methods used, and domain problems. For instance, we show that part-of-speech tagging and parsing have increasingly been adopted as tools for solving problems in other domains. We also observe that speech recognition and probability theory have had the most seminal influence. 1
CINET: A CyberInfrastructure for Network Science
"... Abstract—Networks are an effective abstraction for representing real systems. Consequently, network science is increasingly used in academia and industry to solve problems in many fields. Computations that determine structure properties and dynamical behaviors of networks are useful because they giv ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
Abstract—Networks are an effective abstraction for representing real systems. Consequently, network science is increasingly used in academia and industry to solve problems in many fields. Computations that determine structure properties and dynamical behaviors of networks are useful because they give insights into the characteristics of real systems. We introduce a newly built and deployed cyberinfrastructure for network science (CINET) that performs such computations, with the following features: (i) it offers realistic networks from the literature and various random and deterministic network generators; (ii) it provides many algorithmic modules and measures to study and characterize networks; (iii) it is designed for efficient execution of complex algorithms on distributed high performance computers so that they scale to large networks; and (iv) it is hosted with web interfaces so that those without direct access to high performance computing resources and those who are not computing experts can still reap the system benefits. It is a combination of application design and cyberinfrastructure that makes these features possible. To our knowledge, these capabilities collectively make CINET novel. We describe the system and illustrative use cases, with a focus on the CINET user.
NMRexSeer: Metadata Extraction and Search for Large Scale Nuclear Magnetic Resonance (NMR) Experimental Data
"... Abstract—Sciences have become both complex and demanding for cutting-edge technology and resources to perform experi-ments. Since 1997, the Environmental Molecular Sciences Lab-oratory (EMSL) has served as a user facility housing resources for global scientists to perform experiments necessary to th ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract—Sciences have become both complex and demanding for cutting-edge technology and resources to perform experi-ments. Since 1997, the Environmental Molecular Sciences Lab-oratory (EMSL) has served as a user facility housing resources for global scientists to perform experiments necessary to their research. Overtime, the generated data has become both massive and redundant. To encourage better management and reuse of such experimental data, MyEMSL has emerged as an in-house centralized data management tool that collects and distributes data from the experiments at EMSL. Nuclear Magnetic Res-onance Spectroscopy (NMR) is one of the major experiment resources that EMSL houses. We discuss NMRexSeer, a proposed digital library system that automatically extracts and indexes NMR specific metadata from NMR experimental data packages. The system also generates visualized previews and provides a search interface for easy access and discovery of desired data.
University
"... We report on the Gaussian file search system designed as part of the ChemXSeer digital library. Gaussian files are produced by the Gaussian software [4], a software package used for calculating molecular electronic structure and properties. The output files are semi-structured, allowing relatively e ..."
Abstract
- Add to MetaCart
(Show Context)
We report on the Gaussian file search system designed as part of the ChemXSeer digital library. Gaussian files are produced by the Gaussian software [4], a software package used for calculating molecular electronic structure and properties. The output files are semi-structured, allowing relatively easy access to the Gaussian at-tributes and metadata. Our system is currently capable of searching Gaussian documents using a boolean combination of atoms (chem-ical elements) and attributes. We have also implemented a faceted browsing feature on three important Gaussian attribute types- Ba-sis Set, Job Type and Method Used. The faceted browsing feature enables a user to view and process a smaller, filtered subset of doc-uments.