University of Minnesota
University Relations

Minnesota Supercomputing Institute

Log out of MyMSI

Research Abstracts Online
January - December 2011

Main TOC ...... Next Abstract

University of Minnesota Twin Cities
College of Science and Engineering
Department of Computer Science and Engineering

PI: Vipin Kumar, Fellow

Mining Earth Science and Biomedical Data

The primary objective of this research is to develop novel, high-performance data-mining algorithms and tools for mining large-scale datasets that arise in a variety of applications. Some examples are gigabyte datasets collected by earth-observing satellites that must be processed to better understand global scale changes in biosphere processes and patterns, data generated by scientific simulations that can be used to gain insight into the underlying physical processes, data obtained through monitoring network traffic to detect illegal network activities, and large collections of text and hypertext analyzed to extract relevant information. The key technical challenges in mining these datasets include: high volume, dimensionality, and heterogeneity; the spatio-temporal aspect of the data; possible skewed class distribution; the distributed nature of the data; and complexity in converting raw collected data into high level features. High-performance data mining is essential to analyze the growing data and provide analysts with automated tools that facilitate some of the steps needed for hypothesis generation and evaluation.

Data mining has also become a key tool for analyzing biomedical data. In collaboration with the Mayo Clinic of Rochester, Minnesota, these researchers are developing advanced data-mining techniques for several medical problems. Since data mining has established itself as an effective methodology for the analysis of large amounts of biological data, the researchers are also trying to identify its impact on the automatic prediction of protein function from proteomics data, genetic and genomic marker discovery from SNP and gene-expression data, and next-generation sequencing data. Computational challenges imposed by the large size of the datasets will be addressed by building upon past research in highly parallel formulations of key data-mining kernels for anomaly/outlier detection, finding association patterns, clustering, and building rare-class predictive models that can take advantage of high performance computers. MSI resources are critical for this research.

Group Members

Kshitij Agrawal, Visiting Researcher
Saurabh Agrawal Airan, Graduate Student
Divya Alla, Graduate Student
Shyam Boriah, Graduate Student
Laina Ramsey Breidenbach, Undergraduate Student
Luchiana Brodeala, Visiting Researcher
Ivan C. Brugere, Graduate Student
Yashu Chamber, Graduate Student
Vijay Chaudhari, Graduate Student
Xi Chen, Graduate Student
Kelly Cutler, Undergraduate Student
Debashish Das, Temple University, Philadelphia, Pennsylvania
Sanjoy Dey, Graduate Student
Marc Dunham, Graduate Student
James H. Faghmous, Graduate Student
Gang Fang, Graduate Student
Filippo Farraris, Collaborator
Ashish Garg, Graduate Student
Arthita Ghosh, Collaborator
Dhruv Goel, Undergraduate Student
Atluri Gowtham, Graduate Student
Rohit Gupta, Graduate Student
Tushar Gupta, Visiting Researcher
Vibhor Gupta, Visiting Researcher
Ryan Haasken, Undergraduate Student
Ravi Janardan, Faculty Collaborator
Anthony Joyner, Graduate Student
Anuj Karpatne, Visiting Researcher
Jaya Kawale, Research Associate
Vikrant Krishna, Graduate Student
Sairam Krishnamurthy, Graduate Student
Aditya Kulkarni, Graduate Student
Arjun Kumar, Graduate Student
Shashank Kumar, Visiting Researcher
Sean Landman, Graduate Student
Michael Lau, Graduate Student
Aleksander Lazarevic, Research Associate
Peter Li, Research Associate
Stefan Liess, Research Associate
Kelvin O. Lim, Faculty Collaborator
Xiaoye Liu, Undergraduate Student
Lydia Manikonda, Collaborator
Joseph McNair, Graduate Student
Sanyam Mehta, Graduate Student
Varun Mithal, Undergraduate Student
Zachary R. O’Connor, Undergraduate Student
Benjamin W. Oatley, Undergraduate Student
Dominick Ormsby, Undergraduate Student
Gaurav Pandey, Graduate Student
Vanja Paunic, Graduate Student
Atma Persaud, Undergraduate Student
Sumit Raj, Collaborator
Saket Saurabh, Visiting Researcher
Garima Sharma, Visiting Researcher
Greg Simpson, Graduate Student
Ayush Singhal, Graduate Student
Rahul Singhania, Undergraduate Student
Graham D. Smith, Undergraduate Student
Michael S. Steinbach, Research Associate
Karsten Steinhaeuser, Research Associate
Rahni Sumler, Graduate Student
Pang Tan, Research Associate
Sruthi Vangala, Graduate Student
Mark Wagy, Undergraduate Student
Libing Wang, Collaborator
Wen Wang, Graduate Student
Roland Welter, Undergraduate Student
Baylor Wetzel, Graduate Student
Hui Xiong, Rutgers University, New Brunswick, New Jersey
Yi Yang, Graduate Student