: Antony Unwin, Martin Theus, Heike Hofmann
: Graphics of Large Datasets Visualizing a Million
: Springer-Verlag
: 9780387379777
: 1
: CHF 70.40
:
: Sonstiges
: English
: 271
: Wasserzeichen
: PC/MAC/eReader/Tablet
: PDF

This book shows how to look at ways of visualizing large datasets, whether large in numbers of cases, or large in numbers of variables, or large in both. All ideas are illustrated with displays from analyses of real datasets and the importance of interpreting displays effectively is emphasized. Graphics should be drawn to convey information and the book includes many insightful examples. New approaches to graphics are needed to visualize the information in large datasets and most of the innovations described in this book are developments of standard graphics. The book is accessible to readers with some experience of drawing statistical graphics.

Preface7
Contents10
1 Introduction15
1.1 Introduction15
1.2 Data Visualization18
1.3 Research Literature21
1.4 How Large Is a Large Dataset?23
1.5 The Effects of Largeness31
1.6 What Is in This Book36
1.7 Software37
1.8 What Is on the Website38
1.9 Contributing Authors40
Basics42
2 Statistical Graphics43
2.1 Introduction43
2.2 Plots for Categorical Data43
2.3 Plots for Continuous Data48
2.4 Data on Mixed Scales56
2.5 Maps59
2.6 Contour Plots and Image Maps61
2.7 Time Series Plots62
2.8 Structure Plots63
3 Scaling Up Graphics67
3.1 Introduction67
3.2 Upscaling as a General Problem in Statistics67
3.3 Area Plots68
3.4 Point Plots74
3.5 From Areas to Points and Back79
3.6 Modifying Plots83
3.7 Summary84
4 Interacting with Graphics85
4.1 Introduction85
4.2 Interaction86
4.3 Interaction and Data Displays87
4.4 Interaction and Large Datasets100
4.5 New Interactive Tasks110
4.6 Summary and Future Directions113
Applications114
5 Multivariate Categorical Data — Mosaic Plots115
5.1 Introduction115
5.2 Area-based Displays115
5.3 Displays and Techniques in One Dimension117
5.4 Mosaic Plots123
5.5 Summary133
6 Rotating Plots135
6.1 Introduction135
6.2 Beginning to Work with a Million Cases138
6.3 Software System145
6.4 Application147
6.5 Current and Future Developments150
7 Multivariate Continuous Data — Parallel Coordinates152
7.1 Introduction152
7.2 Interpolations and Inner Products153
7.3 Generalized Parallel Coordinate Geometry154
7.4 A New Family of Smooth Plots158
7.5 Examples159
7.6 Detecting Second–Order Structures163
7.7 Summary164
8 Networks165
8.1 Introduction165
8.2 Layout Algorithms166
8.3 Interactivity170
8.4 NicheWorks174
8.5 Example: International Calling Fraud175
8.6 Languages for Description and Layouts180
8.7 Summary182
9 Trees184
9.1 Introduction184
9.2 Growing Trees for Large Datasets185
9.3 Visualization of Large Trees194
9.4 Forests for Large Datasets205
9.5 Summary209
10 Transactions210
10.1 Introduction and Background210
10.2 Mice and Elephant Plots and Random Sampling212
10.3 Biased Sampling217
10.4 Quantile Window Sampling222
10.5 Commonality of Flow Rates228
11 Graphics of a Large Dataset234
11.1 Introduction234
11.2 QuickStart Guide Data Visualization for Large Datasets235
11.3 Visualizing the InfoVis 2005 Contest Dataset236
References257
Authors268
Index272