: Marie Francine Moens
: Automatic Indexing and Abstracting of Document Texts
: Kluwer Academic Publishers
: 9780306470172
: 1
: CHF 113.60
:
: Naturwissenschaft
: English
: 284
: DRM
: PC/MAC/eReader/Tablet
: PDF
Automatic Indexing and Abstracting of Document Texts summarizes the latest techniques of automatic indexing and abstracting, and the results of their application. It also places the techniques in the context of the study of text, manual indexing and abstracting, and the use of the indexing descriptions and abstracts in systems that select documents or information from large collections. Important sections of the book consider the development of new techniques for indexing and abstracting. The techniques involve the following: using text grammars, learning of the themes of the texts including the identification of representative sentences or paragraphs by means of adequate cluster algorithms, and learning of classification patterns of texts. In addition, the book is an attempt to illuminate new avenues for future research. Automatic Indexing and Abstracting of Document Texts is an excellent reference for researchers and professionals working in the field of content management and information retrieval. 
CONTENTS7
PREFACE11
ACKNOWLEDGEMENTS15
PART I THE INDEXING AND ABSTRACTING ENVIRONMENT18
Chapter 1 THE NEED FOR INDEXING AND ABSTRACTING TEXTS19
1. INTRODUCTION19
2. ELECTRONIC DOCUMENTS20
3. COMMUNICATION THROUGH NATURAL LANGUAGE TEXT21
4. UNDERSTANDING OF NATURAL LANGUAGE TEXT: THE COGNITIVE PROCESS23
5. UNDERSTANDING OF NATURAL LANGUAGE TEXT: THE AUTOMATED PROCESS24
6. IMPORTANT CONCEPTS IN INFORMATION RETRIEVAL AND SELECTION26
7. GENERAL SOLUTIONS TO THE INFORMATION RETRIEVAL PROBLEM33
8. THE NEED FOR BETTER AUTOMATIC INDEXING AND ABSTRACTING TECHNIQUES38
Chapter 2 THE ATTRIBUTES OF TEXT43
1. INTRODUCTION43
2. THE STUDY OF TEXT43
3. AN OVERVIEW OF SOME COMMON TEXT TYPES45
4. TEXT DESCRIBED AT A MICRO LEVEL46
5. TEXT DESCRIBED AT A MACRO LEVEL54
6. CONCLUSIONS63
Chapter 3 TEXT REPRESENTATIONS AND THEIR USE65
1. INTRODUCTION65
2. DEFINITIONS65
3. REPRESENTATIONS THAT CHARACTERIZE THE CONTENT OF TEXT 3.1 Set of Natural Language Index Terms66
4. INTELLECTUAL INDEXING AND ABSTRACTING 4.1 Gene ral71
5. USE OF THE TEXT REPRESENTATIONS76
6. A NOTE ABOUT THE STORAGE OF TEXT REPRESENTATIONS85
7. CHARACTERISTICS OF GOOD TEXT REPRESENTATIONS86
8. CONCLUSIONS89
PART II METHODS OF AUTOMATIC INDEXING AND ABSTRACTING91
Chapter 4 AUTOMATIC INDEXING: THE SELECTION OF NATURAL LANGUAGE INDEX TERMS93
1. INTRODUCTION93
2. A NOTE ABOUT EVALUATION94
3. LEXICAL ANALYSIS94
4. USE OF A STOPLIST96
5. STEMMING97
6. THE SELECTION OF PHRASES100
7. INDEX TERM WEIGHTING105
8. ALTERNATIVE PROCEDURES FOR SELECTING INDEX TERMS114
9. SELECTION OF NATURAL LANGUAGE INDEX TERMS: ACCOMPLISHMENTS AND PROBLEMS117
10. CONCLUSIONS118
Chapter 5 AUTOMATIC INDEXING: THE ASSIGNMENT OF CONTROLLED LANGUAGE INDEX TERMS119
1. INTRODUCTION119
2. A NOTE ABOUT EVALUATION120
3. THESAURUS TERMS122
4. SUBJECT AND CLASSIFICATION CODES127
5. LEARNING APPROACHES TO TEXT CATEGORIZATION131
6. ASSIGNMENT OF CONTROLLED LANGUAGE INDEX TERMS: ACCOMPLISHMENTS AND PROBLEMS147
7. CONCLUSIONS148
Chapter 6 AUTOMATIC ABSTRACTING: THE CREATION OF TEXT SUMMARIES149
1. INTRODUCTION149
2. A NOTE ABOUT EVALUATION150
3. THE TEXT ANALYSIS STEP152
4. THE TRANSFORMATION STEP 4.1 Selection and Generalization of the Content164
5. GENERATION OF THE ABSTRACT166
6. TEXT ABSTRACTING: ACCOMPLISHMENTS AND PROBLEMS168
7. CONCLUSIONS170
PART III APPLICATIONS172
Chapter 7 TEXT STRUCTURING AND CATEGORIZATION WHEN SUMMARIZING LEGAL CASES173
1. INTRODUCTION173
2. TEXT CORPUS AND OUTPUT OF THE SYSTEM174
3. METHODS: THE USE OF A TEXT GRAMMAR177
4. RESULTS AND DISCUSSION181
5. CONTRIBUTIONS OF THE RESEARCH184
6. CONCLUSIONS188
Chapter 8 CLUSTERING OF PARAGRAPHS WHEN SUMMARIZING LEGAL CASES189
1. INTRODUCTION189
2. TEXT CORPUS AND OUTPUT OF THE SYSTEM190
3. METHODS: THE CLUSTERING TECHNIQUES191
4. RESULTS AND DISCUSSION197
5. CONTRIBUTIONS OF THE RESEARCH204
6. CONCLUSIONS206
Chapter 9 THE CREATION OF HIGHLIGHT ABSTRACTS OF MAGAZINE ARTICLES207
1. INTRODUCTION207
2. TEXT CORPUS AND OUTPUT OF THE SYSTEM208
3. METHODS: THE USE OF A TEXT GRAMMAR210
4. RESULTS AND DISCUSSION217
5. CONTRIBUTIONS OF THE RESEARCH220
6. CONCLUSIONS221
Chapter 10 THE ASSIGNMENT OF SUBJECT DESCRIPTORS TO MAGAZINE ARTICLES223
1. INTRODUCTION223
2. TEXT CORPUS AND OUTPUT OF THE SYSTEM224
3. METHODS: SUPERVISED LEARNING OF CLASSIFICATION PATTERNS226
4. RESULTS AND DISCUSSION233
5. CONTRIBUTIONS OF THE RESEARCH240
6. CONCLUSIONS241
SUMMARY AND FUTURE PROSPECTS243
1. SUMMARY243
2. FUTURE PROSPECTS251
REFERENCES253
SUBJECT INDEX277