Results 11 - 20
of
5,584
D³: data-driven documents
- IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS
, 2011
"... Data-Driven Documents (D3) is a novel representation-transparent approach to visualization for the web. Rather than hide the underlying scenegraph within a toolkit-specific abstraction, D³ enables direct inspection and manipulation of a native represen-tation: the standard document object model (DO ..."
Abstract
-
Cited by 209 (11 self)
- Add to MetaCart
(DOM). With D3, designers selectively bind input data to arbitrary document elements, applying dynamic transforms to both generate and modify content. We show how representational transparency improves expressive-ness and better integrates with developer tools than prior approaches, while offering
Self-taught learning: Transfer learning from unlabeled data
- Proceedings of the Twenty-fourth International Conference on Machine Learning
, 2007
"... We present a new machine learning framework called “self-taught learning ” for using unlabeled data in supervised classification tasks. We do not assume that the unlabeled data follows the same class labels or generative distribution as the labeled data. Thus, we would like to use a large number of ..."
Abstract
-
Cited by 299 (20 self)
- Add to MetaCart
of unlabeled images (or audio samples, or text documents) randomly downloaded from the Internet to improve performance on a given image (or audio, or text) classification task. Such unlabeled data is significantly easier to obtain than in typical semi-supervised or transfer learning settings, making selftaught
A comparison of classifiers and document representations for the routing problem
- ANNUAL ACM CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL - ACM SIGIR
, 1995
"... In this paper, we compare learning techniques based on statistical classification to traditional methods of relevance feedback for the document routing problem. We consider three classification techniques which have decision rules that are derived via explicit error minimization: linear discriminant ..."
Abstract
-
Cited by 196 (2 self)
- Add to MetaCart
In this paper, we compare learning techniques based on statistical classification to traditional methods of relevance feedback for the document routing problem. We consider three classification techniques which have decision rules that are derived via explicit error minimization: linear
J.C.: Best practices for convolutional neural networks applied to visual document analysis
- In: Int’l Conference on Document Analysis and Recognition
, 2003
"... Neural networks are a powerful technology for classification of visual inputs arising from documents. However, there is a confusing plethora of different neural network methods that are used in the literature and in industry. This paper describes a set of concrete best practices that document analys ..."
Abstract
-
Cited by 201 (7 self)
- Add to MetaCart
Neural networks are a powerful technology for classification of visual inputs arising from documents. However, there is a confusing plethora of different neural network methods that are used in the literature and in industry. This paper describes a set of concrete best practices that document
Title: Status: Purpose: H.264/14496-10 AVC Reference Software Manual Input Document to JVT Proposed Amended Draft Author(s) or Contact(s):
, 2009
"... Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG ..."
Job Destruction and Propagation of Shocks
- American Economic Review
"... This paper considers propagation of aggregate shocks in a dynamic general-equilibrium model with labor-market matching and endogenous job destruction. Cyclical fluctuations in the job-destruction rate magnify the output effects of shocks, as well as making them much more persistent. Interactions bet ..."
Abstract
-
Cited by 219 (10 self)
- Add to MetaCart
. (JEL E24, E32) It has been well documented that the cyclical adjustment of labor input chiefly represents move-ment of workers into and out of employment, rather than adjustment of hours at given jobs. Thus, in understanding business cycles, it is cen-trally important to understand the formation
Document analysis system
- IBM Journal of Research and Development
, 1982
"... This paper outlines the requirements and components for a proposed Document Analysis System, which assists a user in encoding printed documents for computer processing. Several critical functions have been investigated and the technical approaches are discussed. The first is the segmentation and cla ..."
Abstract
-
Cited by 128 (0 self)
- Add to MetaCart
is an adaptive approach to the recognition of the hundreds of font styles and sizes that can occur on printed documents. A preclassifier is constructed during the input process and used to speed up a well-known pattern-matching method for clustering characters from an arbitrary print source into a small sample
The value of prior knowledge in discovering motifs with MEME
, 1995
"... MEME is a tool for discovering motifs in sets of protein or DNA sequences. This paper describes several extensions to MEME which increase its ability to find motifs in a totally unsupervisedfashion, but which also allow it to benefit when prior knowledge is available. When no background knowledge is ..."
Abstract
-
Cited by 211 (10 self)
- Add to MetaCart
is asserted, MEME obtains increased robustness from a method for determining motif widths automatically, and from probabilistic models that allow motifs to be absent in some input sequences. On the other hand, MEME can exploit prior knowledgeabout a motif being present in all input sequences, about the length
NoDoSE - A tool for Semi-Automatically Extracting Structured and Semistructured Data from Text Documents.
- SIGMOD Record
, 1998
"... Often interesting structured or semistructured data is not in database systems but in HTML pages, text files, or on paper. The data in these formats is not usable by standard query processing engines and hence users need a way of extracting data from these sources into a DBMS or of writing wrappers ..."
Abstract
-
Cited by 168 (2 self)
- Add to MetaCart
interesting regions and then describing their semantics. This task is expedited by a mining component that attempts to infer the grammar of the file from the information the user has input so far. Once the format of a document has been determined, its data can be extracted into a number of useful forms
Projecting XML Documents
- In Proceedings of the 29 th VLDB Conference
, 2003
"... XQuery is not only useful to query XML in databases, but also to applications that must process XML documents as files or streams. These applications suffer from the limitations of current mainmemory XQuery processors which break for rather small documents. In this paper we propose techniques, ba ..."
Abstract
-
Cited by 89 (6 self)
- Add to MetaCart
, based on a notion of projection for XML, which can be used to drastically reduce memory requirements in XQuery processors. The main contribution of the paper is a static analysis technique that can identify at compile time which parts of the input document are needed to answer an arbitrary XQuery. We
Results 11 - 20
of
5,584