Standing on the Shoulders of Giants and Variations on a Theme Data Mining at Microsoft
Eavesdropping on the Brain
Statistical Data Mining Bioinformatics
Making Trees Interactive – KLIMT Some Graphics for Recursive Partitioning
Predictive Data Mining with Multiple Additive Regression Trees Towards Understanding Boosting Why Does Model Averaging Work?
A Model Based Approach to Text Categorization and Clustering Unsupervised Segmentation and Classification of Mixtures of Markovian Sources Evaluating Sequential Tests for a Class of Stochastic Processes A Comparison of Reversible Jump MCMC algorithms for DNA Sequence Segmentation
Using Hidden Markov Models
Functional Analysis of Computer Network Data Inferring Internal Losses and Delays in Communication Networks from Edge
Measurements Texture Modeling Using Self-Similar Wavelets and POMMs The Adaptive Data Cube: An Experiment in Hyperspectral Pattern Recognition
GGobi: XGobi Redesigned and Extended Visual Post Analysis of Association Rules Uncovering Complexity in Data through Sound
Causal Inference in Statistics: A Gentle Introduction The Defining Role of "Principal Effects" in Comparing Treatments Using General Post-
Treatment Variables
Active Learning for Support Vector Machines with Applications to Text Classification Conditional Random Fields for Text Processing Relevant Encoding of Linguistic Data via the Information Bottleneck Method
A Split Merge Markov Chain Sampling Algorithm for Bayesian Mixture Models Priors for Bayesian Neural Networks Adaptive Metropolis-Hastings Samplers for the Bayesian Analysis of Large Linear
Gaussian Systems Genetic Analysis of Melanoma Onset by using Estimating Equations and Bayesian
Random Effects Models GDAGsim: Sparse Matrix Algorithms for Bayesian Computation
Approximations to Dirichlet Processes with Applications Banks of Interacting Bayesian Filters Data Reduction by Quantization The Principle and Practice of Minimum Description Length
A Bayesian Approach to Analysis of cDNA Microarray Data A Statistical Analysis of Radiolabeled Gen Expression Data Replication and Appropriate Statistical Analysis are Required for Accurate Interpretation
of DNA Microarray Experiments Identifying Statistically Significant Similarieties in Gene Expression Patterns via
Bayesian Infinite Mixture Models An Interdisciplinary Program Employing Computational, Biochemical and Genomic
Methods to Examine the Effects of Chromosome Structure on the Regulation of Gene
Expression
Variational Models and Bayesian Estimation Advanced Mean Field Methods for Probabilistic Models Probability Assessment with Maximum Entropy in Bayesian Networks
Functional Data Analysis of Complex Computer Simulation Output: A Case Study in
Nuclear Waste Disposal Waste Assessment Integrated Assessment of Drinking Water Regulations Bayesian Sensitivity Analysis and Uncertainty Analysis Sensitivity Analysis of a Buried Radioactive Waste Risk Model
Learning to Trade via Direct Reinforcement Statistical Inference, the Bootstrap, and Neural Network Modeling with Application to
Foreign Exchange Rates
Dynamic Visualization of Changing Prior and Posterior in Bayesian Analysis Nonparametric Clustering
...Reflections on a Workshop
Statistical Learning Problems Associated with the World Wide Web Finite State Approaches to Information Extraction Graph Structure in the Web
Genome-Wide Binding Motif Discovery via Microarray and Prospect Sampler Hierarchial Models for Gene Expression Data Analysis Stochastic Models for Sequences with Non-Local Dependency Structure
John Tukey and the Correlation Coefficient On the Interaction between Statistics and Computing: In Memory of John W. Tukey The Legacy of John Tukey
Spatio-Temporal Prediction of Incomplete Precipitation Records Bayesian and Frequentist Inference for Ecological Inference: The R x C Case Using the Chemical Mass Balance Model to Estimate Pollution Source Contributions
from Correlated Air Quality Observations Land Cover Mapping using Combination and Ensemble Classifiers Mining for Knowledge about Ostracode Assemblages in the Tecolutla River Delta
Developing Data Mining Systems Graphical and Statistical Pruning of Association Rules
Searching the Web: Current Limitations, New Techniques, and Future Directions How Large is the World Wide Web?
A Tutorial on Support Vector Machines Kernel Methods for Unsupervised Learning
Bernhard Schollkopf
Graphical Representation as a Discipline Clustering and Genetics of Complex Disease Multivariate Statistical Process Control and Signature Analysis using Eigenfactor
Detection Methods
Data Sharpening for Higher0Order Density Estimation Robust Detection of Multivariate Outliers in High Dimensions and High Levels of
Contamination The Complexity of Computing the MCD Estimator Finding Committee Solutions by Clustering Models in Function Space Detecting Novel Samples in Mass Spectral Data: A Clustering Approach
A Computational Approach to Full Nonparametric Bayesian Inference under Dirichlet
Process Mixture Models Hierarchical Model-Based Clustering for Large Datasets
Computing Environments for Bayesian Statistics Stochastic Parameterized Grammars for Bayesian Model Composition The Bayes Net Toolbox for Matlab
Data Squashing: Constructing Summary Data Sets Exploratory Analysis of Retail Sales of Billions of Items Mining Large Datasets
Technology and 2010 Census The U. S. Census Bureau's MAF/TIGER System, Internal and External Interfaces
Assessing Patient Survival using Microarray Gene Expression Data via Partial Least
Squares Proportional Hazard Regression Lessons Learned from Analyzing the Differential Gene Expression Data between Normal
and Tumor Tissues in Head and Neck Cancer Patients Taming Genetic Microarray Data: A Paradigm using a Well-Known Case Study Statistical Modelling of Micro Array Data
Unraveling and Defining Biocomplexity Theoretical and Computational Challenges in Entropy Evaluation of Macromolecules
Ciphertext Size Requirement of Ciphertext-Only Attack on Vignere Cipher Interval Computation of Gamma Probabilities and Their Inverses Smooth Quadratures of Volterra Integral Equations with Applications to Estimation of
HIV Infection Rates and Projection of AIDS Incidence Designing Experiments for Causal Networks Multi-Layer Structured Correlation Designs for Heterogeneous and Unbalanced
Clustered Data On Perfect Stability in Characteristic Functions An Environment for Creating Interactive Statistical Documents Experiences with a Course on "Web-Based Statistics" ASSIST: A Package for Spline Smoothing in S-Plus Template JAVA Implementation of Multiple Linear Regression Models for Patient-Specific
Longitudinal Data to Monitor Chemotherapy-Induced Anemia The Development of Community Nutrition Map (CNMap)
Cost Growth Models for NASA's Programs: A Summary Series Approximations in Analysis of Risk An Adequate Statistics for the Exponentially Distributed Censoring Data Comparing Two Measurement Devices: Review and Extensions to Estimate New Device
Variability Computationally Intensive Techniques for a Fully Bayesian, Decision Theoretic
Approach to Financial Forecasting and Portfolio Selection
A Statistical View of the Support Vector Machine Lazy Class Probability Estimators PERT – Perfect Random Tree Ensembles Multicategory Support Vector Machines Using Pseudo-Predictors to Improve the Performance of a Classification Rule
Inference for Self-Modeling Regression with Random Effects Support Vector Machine Regression in Chemometrics Data-Driven and Optimal Denoising of a Signal and Recovery of Its Derivative Using
Multiwavelets RIP-GAMs with an Application to Human Brain Research An Adaptive-Learned Temporal Radial Basis Function Network for Recursive Function
Estimation
A Statistical Approach to the Segmentation of MR Imagery and Volume Estimation of
Stroke Lesions Visualizing Spatial Autocorrelation with Dynamically Linked Windows Compressions and Analysis of Very Large Imagery Data Sets using Spatial Statistics Statistical Visualization of Environmental Data on the Web using nViZn A Principled Approach to Interactive Hierarchical Non-Linear Visualization of High-
Dimensional Data
A Tree-Based Scan Statistic for Database Disease Surveillance Creating Ensembles of Decision Trees through Sampling Data Mining Diabetic Databases: Are Rough Sets a Useful Addition? Model Complexity Based Design of Radial Basis Function Networks with Data Mining
Applications Combining Decision Trees using Systematic Patterns
Resampling Time Series with Seasonal Components Correlation and Sampling in Relational Data Mining Inference for the Sample Maximum in the Presence of Serial Correlation and Heavy-
Tailed Distributions BootQC: Bootstrap for Statistical Quality Control and Applications to Aviation Safety
Analysis Selection of the Shrinkage Factor for the Two Stage Testimator of the Normal Mean
using Bootstrap Likelihood
James F. Goodnight
David Heckerman
Banquet Address
Terrence Sejnowski
Short Courses
Edward J. Wegman
Pierre Baldi
Statistical Graphics
Simon Urbanek and Antony R. Unwin
Daniel B. Carr and Ru Sun
Flexible Models for Prediction
Jerome H. Friedman
Bin Yu and Peter Buhlmann
Yoav Freund
Model-Based Clustering
Alejandro Murua, Jeremy Tantrum, Werner Stuetzle and Solveig Sieberts
Yevgeny Seldin, Gill Bejerano, and Naftali Tishby
Xiaoping Xiong and Ming Tan
Richard J. Boys and Daniel A. Henderson
Office of Naval Research Overview
J. L. Solka and D. J. Marchette
Robert Nowak
Jennifer Davidson and Richard Barton
Carey E. Priebe
Visualization for Data Mining
Deborah F. Swayne, Duncan Temple Lang, Andreas Buja, and Dianne Cook
H. Hofmann
Mark H. Hansen and Ben Rubin
Beyond Correlation
Judea Pearl
Constatine E. Frangakis and Donald B. Rubin
Statistical Models for Text
Simon Tong and Daphne Koller
John Lafferty, Andrew McCallum, and Fernando Periera
Naftali Tishby
Bayesian Methods
Sonia Jain and Radford M. Neal
Mark Robinson
Stephen K. H. Yeung and Darren J. Wilkinson
K-A. Do, P. Kuhnert, S-J. Lee, J. F. Aitken, A. Green, and N. G.
Martin
Darren J. Wilkinson
Army Research Office Overview
Jayaram Sethuraman
Roris L. Rozovskii, R. Blazek and A. Petrov
Edward J. Wegman and Nkem-Amin (Martin) Khumbah
Bin Yu
Gene Expression – I
M. A. Black, B. A. Craig, M. Tanurdzic, and R. W. Doerge
Rafael A. Irizarry, Giovanni Parigiani, Mingzhou Guo, Tatiana Dracheva, Jin Jen
She-pin Hung and G. Wesley Hatfield
Mario Medvedovic and Siva Vivaganesan
Lorenzo Tolleri, Craig J. Benham, Pierre Baldi , and G. Wesley Hatfield
Graphical Models
Tommi Jaakkola
Manfred Opper and Ole Winther
Wim Wiegerinck and Tom Heskes
Environmental Modeling
David Draper and Bruno Mendes
Mitchell J. Small, Patrick Gurian, Mark Schervish, and J. R. Lockwood
Jeremy E. Oakley and Anthony O'Hagan
Tom Stockton
Computational Finance
John Moody
Jeff Racine and Halbert White
National Security Agency Overview
Hani Doss and B. Narasimhan
David W. Scott
Massive Data Sets
Jon R. Kettenring
Analyzing Web Data
Byron Dom
Andrew McCallum, Fernando Pereira, John Lafferty, and Dayne Freitag
Andrew Tomkins
Bayesian Bioinformatics
Jun Liu and Xiaole Liu
Michael Newton and Christina Kendziorski
Scott C. Schmidler
John Tukey and the Interface
David R. Brillinger
Luisa Fernholz
Robert L. Launer
Ecological and Earth Science Applications
Craig Johns and Douglas Nychka
Ori Rosen, Wengxin Jiang, Gary King, and Martin Tanner
William F. Christensen
Brian M. Steele and David A. Patterson
A. Dale Magoun, Melvin Kontrovitz and Daniel J. Stanley
International Association for Statistical Computing Overview
Arno Siebes
Adalbert Wilhelm
How Large is the Web?
C. Lee Giles
Adrian Dobra and Stephen E. Fienberg
Support Vector Machines
Bernhard Schollkopf
Chernoff Faces the Interface
Herman Chernoff
Richard Olshen
Kuang-Han Chen, Duane S. Boning, and Roy E. Welsch
Clusters, Outliers, and Density Models
Michael C. Minnotte and Peter Hall
Mark Werner and Karen Kafadar
Thorsten Bernholt and Paul Fischer
Thomas Ragg
Vladimir Svetnik and Andy I. Liaw
Journal of Computational and Graphical Statistics Overview
Alan E. Gelfand and Athanasios Kottas
Christian Posse
Software Support for Bayesian Analysis Systems
Robert Gentleman
Eric Mjolsness, Michael Turmon, and Wolfgang Fink
Kevin P. Murphy
Massive Data Sets
William DuMouchel
Dunja Mladenic, William F. Eddy, and Scott Ziolko
Johannes Gehrke
Census 2000: Lessons for Census 2010
Carol M. Van Horn
Robert Marx and Linda M. Franz
Gene Expression II
Danh V. Nguyen and David M. Rocke
J. Jack Lee, Hyung Woo Kim, Feng Zhan, and Adel K. El-Naggar
Howard T. Thaler
Ziad Taib
National Science Foundation Overview
William K. Michener and James L. Rosenberger
H. Singh, J. Harner, V. Hnizdo and E. Demchek
Computational Tools and Methods
Qiong Yang and Song Guo
Trong Wu
John J. Hsieh
William D. Heavlin
Edward C. Chao
Jinhyo Kim and Bongsu Ko
Samuel E. Buttrey, Deborah Nolan and Duncan Temple Lang
Jürgen Symanzik and Natascha Vukasinovic
Yuedong Wang and Chunlei Ke
Christine E. McLaren, Wagner Truppel, Randall F. Holcombe, and Edward L. Kambour
Alvin B. Nowverl
Decision Support and Forecasting
Tze-San Lee and L. Dale Thomas
Costas A. Christophi and Reza Modarres
P. S. Nair and S-C. Cheng
Brian J. Eastwood
Andrew Simpson and Darren J. Wilkinson
Classification Methods
Yi Lin
Dragois D. Margineantu and Thomas G. Dietterich
Adele Cutler and Guohua Zhao
Yoonkyung Lee, Yi Lin, and Grace Wahba
Majid Mojirsheibani
Regression and Function Estimation
Naomi Altman
Ayhan Demiriz, Kristin P. Bennett, Curt M. Breneman, and Mark J. Embrechts
Nathanial Tymes, Jr., Sam Efromovich, M. Christina Pereyra, and Joseph D. Lakey
Michael G. Schimek
Yiu Ming Cheung and Lei Xu
Visualization and Image Data
Benjamin Stein and Joseph Horowitz
Luc Anselin, Ibnu Syabri, Oleg Smirnov, and Yanqui Ren
James A. Shine
Lacey Jones and Jürgen Symanzik
Peter Tino, Ian Nabney, Yi Sun, and Bruce S. Williams
Data Mining
Martin Kulldorff, Zixing Fang, and Stephen Walsh
Chandrika Kamath and Erick Cantú-Paz
Joseph L. Breault
Miyoung Shin and Amrit L. Goel
Hyunjoong Kim
Sampling and Resampling Methods
Dimitris N. Politis
David Jensen and Jennifer Neville
Tucker McElroy and Dimitris N. Politis
Regina Y. Liu and Hueychung Teng
Makarand V. Ratnaparkhi, Vasant B. Waikar, and Frederick
J. Schuurmann
Comparative Genomics and the Future of Biological Knowledge The Public Working Draft of the Human Genome Identification of Post-Translationally Modified and Mutated Proteins via Mass-
Spectrometry
Improved Statistical Inference from DNA Microarray Data using Analysis of Variance
and a Bayesian Statistical Framework Statistical Issues, Data Analysis, and Modelling for Gene Expression Profiling Plaid Models for DNA Microarrays
Integrating Data and Disciplines: Biostatistics and Biomedical Informatics The Trouble with Text: Challenges and Promises of Biomedical Information Retrieval
Technology Public Health Aspects of Bioinformatics and Medical Informatics
On Metrics and Variational Equations of Computational Anatomy Visual Analysis of Variance: A Tool for Quantitative Assessment of fMRI Data
Processing and Analysis Positron Emission Tomography: Image Formation and Analysis
Is Cross-Validation the Best Approach for Principal Component and Ridge Regression?
Anthony Kerlavage
David Haussler
Pavel Pevzner
Gene Expression Data Analysis
G. Wesley Hatfield
Mike West
Art Owen
Medical Informatics
Joyce Niland
Wanda Pratt
Abdelmonem A. Afifi
Automated Analysis of Brain Images
Michael Miller
William F. Eddy and R. L. McNamee
Richard Leahy
Corrected Paper from Volume 32
Roy E. Welsch