Scripps VIVO scripps research logo

  • Index
  • Log in
  • Home
  • People
  • Organizations
  • Research
  • Events
Search form

Tclust: A fast method for clustering genome-scale expression data

Academic Article
uri icon
  • Overview
  • Research
  • Identity
  • Additional Document Info
  • View All
scroll to property group menus

Overview

authors

  • Dost, B.
  • Wu, Chunlei
  • Su, Andrew
  • Bafna, V.

publication date

  • May 2011

journal

  • IEEE ACM Transactions on Computational Biology and Bioinformatics  Journal

abstract

  • Genes with a common function are often hypothesized to have correlated expression levels in mRNA expression data, motivating the development of clustering algorithms for gene expression data sets. We observe that existing approaches do not scale well for large data sets, and indeed did not converge for the data set considered here. We present a novel clustering method TCLUST that exploits coconnectedness to efficiently cluster large, sparse expression data. We compare our approach with two existing clustering methods CAST and K-means which have been previously applied to clustering of gene-expression data with good performance results. Using a number of metrics, TCLUST is shown to be superior to or at least competitive with the other methods, while being much faster. We have applied this clustering algorithm to a genome-scale gene-expression data set and used gene set enrichment analysis to discover highly significant biological clusters. (Source code for TCLUST is downloadable at http://www.cse.ucsd.edu/~bdost/tclust.)

subject areas

  • Algorithms
  • Animals
  • Cluster Analysis
  • Computer Simulation
  • Databases, Genetic
  • Gene Expression Profiling
  • Genomics
  • Mice
  • Mice, Inbred Strains
  • Models, Molecular
  • Oligonucleotide Array Sequence Analysis
scroll to property group menus

Research

keywords

  • Microarray expression
  • clustering
  • coconnectedness
  • graph algorithms
scroll to property group menus

Identity

International Standard Serial Number (ISSN)

  • 1545-5963

Digital Object Identifier (DOI)

  • 10.1109/tcbb.2010.34

PubMed ID

  • 20479508
scroll to property group menus

Additional Document Info

start page

  • 808

end page

  • 818

volume

  • 8

issue

  • 3

©2021 The Scripps Research Institute | Terms of Use | Powered by VIVO

  • About
  • Contact Us
  • Support