• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 9,777
Next 10 →

Probabilistic Counting Algorithms for Data Base Applications

by Philippe Flajolet, G. N. Martin, G. Nigel Martin , 1985
"... This paper introduces a class of probabilistic counting lgorithms with which one can estimate the number of distinct elements in a large collection of data (typically a large file stored on disk) in a single pass using only a small additional storage (typically less than a hundred binary words) a ..."
Abstract - Cited by 444 (6 self) - Add to MetaCart
This paper introduces a class of probabilistic counting lgorithms with which one can estimate the number of distinct elements in a large collection of data (typically a large file stored on disk) in a single pass using only a small additional storage (typically less than a hundred binary words

Semantic similarity based on corpus statistics and lexical taxonomy

by Jay J. Jiang, David W. Conrath - Proc of 10th International Conference on Research in Computational Linguistics, ROCLING’97 , 1997
"... This paper presents a new approach for measuring semantic similarity/distance between words and concepts. It combines a lexical taxonomy structure with corpus statistical information so that the semantic distance between nodes in the semantic space constructed by the taxonomy can be better quantifie ..."
Abstract - Cited by 873 (0 self) - Add to MetaCart
quantified with the computational evidence derived from a distributional analysis of corpus data. Specifically, the proposed measure is a combined approach that inherits the edge-based approach of the edge counting scheme, which is then enhanced by the node-based approach of the information content

Pin: building customized program analysis tools with dynamic instrumentation

by Chi-keung Luk, Robert Cohn, Robert Muth, Harish Patil, Artur Klauser, Geoff Lowney, Steven Wallace, Vijay Janapa Reddi, Kim Hazelwood - IN PLDI ’05: PROCEEDINGS OF THE 2005 ACM SIGPLAN CONFERENCE ON PROGRAMMING LANGUAGE DESIGN AND IMPLEMENTATION , 2005
"... Robust and powerful software instrumentation tools are essential for program analysis tasks such as profiling, performance evaluation, and bug detection. To meet this need, we have developed a new instrumentation system called Pin. Our goals are to provide easy-to-use, portable, transparent, and eff ..."
Abstract - Cited by 991 (35 self) - Add to MetaCart
is designed to be architecture independent whenever possible, making Pintools source compatible across different architectures. However, a Pintool can access architecture-specific details when necessary. Instrumentation with Pin is mostly transparent as the application and Pintool observe the application’s

Randomized Experiments from Non-random Selection in the U.S. House Elections

by David S. Lee - Journal of Econometrics , 2008
"... This paper establishes the relatively weak conditions under which causal inferences from a regression-discontinuity (RD) analysis can be as credible as those from a randomized experiment, and hence under which the validity of the RD design can be tested by examining whether or not there is a discont ..."
Abstract - Cited by 377 (17 self) - Add to MetaCart
discontinuity in any pre-determined (or “baseline”) variables at the RD threshold. Specifically, consider a standard treatment evaluation problem in which treatment is assigned to an individual if and only if V> v0, but where v0 is a known threshold, and V is observable. V can depend on the individual’s

Systematic social observation of public spaces: A new look at disorder in urban neighborhoods

by Robert J. Sampson, Stephen W. Raudenbush - American Journal of Sociology , 1999
"... This article assesses the sources and consequences of public disorder. Based on the videotaping and systematic rating of more than 23,000 street segments in Chicago, highly reliable scales of social and physi-cal disorder for 196 neighborhoods are constructed. Census data, police records, and an ind ..."
Abstract - Cited by 253 (9 self) - Add to MetaCart
, and an independent survey of more than 3,500 resi-dents are then integrated to test a theory of collective efficacy and structural constraints. Defined as cohesion among residents com-bined with shared expectations for the social control of public space, collective efficacy explains lower rates of crime and observed

A large-scale study of the evolution of web pages

by Dennis Fetterly, Mark Manasse, Marc Najork, Janet L. Wiener - In Proceedings of the 12th International World Wide Web Conference , 2003
"... How fast does the web change? Does most of the content remain unchanged once it has been authored, or are the documents continuously updated? Do pages change a little or a lot? Is the extent of change correlated to any other property of the page? All of these questions are of interest to those who m ..."
Abstract - Cited by 241 (5 self) - Add to MetaCart
mine the web, including all the popular search engines, but few studies have been performed to date to answer them. One notable exception is a study by Cho and Garcia-Molina, who crawled a set of 720,000 pages on a daily basis over four months, and counted pages as having changed if their MD5 checksum

Bayesian inference for generalized stochastic popula- tion growth models with application to aphids

by Colin S. Gillespie, Andrew Golightly
"... Summary. In this paper we analyse the effects of various treatments on cotton aphids (Aphis gossypii). The standard analysis of count data on cotton aphids determines parameter val-ues by assuming a deterministic growth model and combines these with the corresponding stochastic model to make predict ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
predictions on population sizes, depending on treatment. Here, we use an integrated stochastic model to capture the intrinsic stochasticity, of both observed aphid counts and unobserved cumulative population size for all treatment combinations simultane-ously. Unlike previous approaches, this allows us

The importance of shape in early lexical learning

by B. Smith, S. Jones - Cognitive Development , 1988
"... We ask if certain dimensions of perceptual similarity are weighted more heavily than others in determining word extension. The specific dimensions examined were shape, size, and texture. In four experiments, subjects were asked either to extend a novel count noun to new instances or, in a nonword cl ..."
Abstract - Cited by 235 (31 self) - Add to MetaCart
We ask if certain dimensions of perceptual similarity are weighted more heavily than others in determining word extension. The specific dimensions examined were shape, size, and texture. In four experiments, subjects were asked either to extend a novel count noun to new instances or, in a nonword

Hop-count filtering: an effective defense against spoofed DDoS traffic

by Cheng Jin, Haining Wang , 2003
"... IP spoofing has been exploited by Distributed Denial of Service (DDoS) attacks to (1) conceal flooding sources and localities in flooding traffic, and (2) coax legitimate hosts into becoming reflectors, redirecting and amplifying flooding traffic. Thus, the ability to filter spoofed IP packets near ..."
Abstract - Cited by 187 (4 self) - Add to MetaCart
from the Time-to-Live (TTL) value in the IP header. Using a mapping between IP addresses and their hop-counts to an Internet server, the server can distinguish spoofed IP packets from legitimate ones. Base on this observation, we present a novel filtering technique that is immediately deployable

Model-based Geostatistics

by P.J. Diggle, R. A. Moyeed, J. A. Tawn - Applied Statistics , 1998
"... Conventional geostatistical methodology solves the problem of predicting the realised value of a linear functional of a Gaussian spatial stochastic process, S(x), based on observations Y i = S(x i ) + Z i at sampling locations x i , where the Z i are mutually independent, zero-mean Gaussian random v ..."
Abstract - Cited by 228 (9 self) - Add to MetaCart
Conventional geostatistical methodology solves the problem of predicting the realised value of a linear functional of a Gaussian spatial stochastic process, S(x), based on observations Y i = S(x i ) + Z i at sampling locations x i , where the Z i are mutually independent, zero-mean Gaussian random
Next 10 →
Results 1 - 10 of 9,777
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University