Paper: Finding Clauses In Unrestricted Text By Finitary And Stochastic Methods

Webmaster's Note: The whole dataset is available Here. Please download the dataset instead of crawling the website.

Basic Info:

id: A88-1030
title: Finding Clauses In Unrestricted Text By Finitary And Stochastic Methods
authors: Ejerhed, Eva I. (AT&T Bell Labs, Murray Hill NJ; University of Umea, Umea Sweden)
venue: ANLP
year: 1988
pdf: link


Abstract


The paper presents and compares two different methods of parsing, a regular expression method and a stochastic method, with respect to their success in identifying basic clauses in unrestricted English text. These methods of parsing were developed in order to be applied to the task of improving the detection of large prosodic units in the Bell Labs text-to-speech system, and were so applied experimentally. The paper also discusses the notion of basic clause that was defined as the parsing target. The result of a comparison of the error rates of the two parsing methods in the recognition of basic clauses showed that there was a 13% error rate for the regular expression method and a 6.5% error rate for the stochastic method.








Top Similar Papers
By Title
ID Title
P99-1020A Method For Word Sense Disambiguation Of Unrestricted Text
A88-1019A Stochastic Parts Program And Noun Phrase Parser For Unrestricted Text
W98-1120A Decision Tree Method For Finding And Classifying Names In Japanese Texts
W98-1117A Maximum-Entropy Partial Parser For Unrestricted Text
A92-1027An Efficient Chart-Based Algorithm For Partial-Parsing Of Unrestricted Texts
C92-1060An Algorithm For Estimating The Parameters Of Unrestricted Hidden Stochastic Context-Free Grammars
P97-1013The Rhetorical Parsing Of Unrestricted Natural Language Texts
J00-3005The Rhetorical Parsing Of Unrestricted Texts: A Surface-Based Approach
W01-0724Memory-Based Clause Identification
W97-0105Probabilistic Parsing Of Unrestricted English Text With A Highly-Detailed Grammar


By Abstract
ID Title
W01-1405Stochastic Modelling: From Pattern Classification To Language Translation
M98-1005University Of Durham: Description Of The LOLITA System As Used In MUC-7
P98-2196Recognizing Syntactic Errors In The Writing Of Second Language Learners
J97-2002Adaptive Multilingual Sentence Boundary Disambiguation
H89-2077White Paper On Spoken Language Systems
P98-1107A Method For Correcting Errors In Speech Recognition Using The Statistical Features Of Character Co-Occurrence
C98-1103A Method for Correcting Errors in Speech Recognition Using the Statistical Features of Character Co-occurrence
C02-2025The LinGO Redwoods Treebank: Motivation And Preliminary Applications
H92-1015Speech Understanding In Open Tasks
C92-1031The Proper Treatment Of Word Order In HPSG


By Full Text
ID Title
J02-3003Toward An Aposynthesis Of Topic Continuity And Intrasentential Anaphora
E87-1045Fragmentation And Part Of Speech Disambiguation
E06-3003An Approach To Summarizing Short Stories
P85-1039The Use Of Syntactic Clues In Discourse Processing
C80-1024Efficiency Tools In The Speeches Of Martin Luther King Jr.
C00-2104Experiments In Automated Lexicon Building For Text Searching
C69-4401Syntactic Patterns In A Sample Of Technical English
T87-1043Limits On The Human Sentence Generator
P84-1085A Syntactic Approach To Discourse Semantics
W03-0505Summarising Legal Texts: Sentential Tense And Argumentative Roles


By Co-citation
ID Title Num Co-citations
A88-1019A Stochastic Parts Program And Noun Phrase Parser For Unrestricted Text 3
W93-0306NPtool A Detector Of English Noun Phrases 2
J93-2004Building A Large Annotated Corpus Of English: The Penn Treebank 2
C92-3150Surface Grammatical Analysis For The Extraction Of Terminological Noun Phrases 2
W95-0107Text Chunking Using Transformation-Based Learning 2
W00-0726Introduction To The CoNLL-2000 Shared Task: Chunking 1
P98-1081Improving Data Driven Wordclass Tagging By System Combination 1
W01-0721Boosted Decision Graphs For NLP Learning Tasks 1
W01-0722Clause Identification With Long Short-Term Memory 1
W01-0723Using ALLiS For Clausing 1


Copyright © Univ. of Mich. and the CLAIR Group at the Univ. of Mich.
All information provided herein should be considered tentative and still under construction. Further analysis and correction is still being performed. Please remember that all statistics contained herein are the results of independent research and should not be considered a statement of fact regarding any of the papers, authors, or other entities they refer to.