• Upcoming Events
  • Awards
  • Distinguished Lecture
  • Latest Seminars and Events
  • Others
  • Seminars
  • Workshop and Conference
  • Past Events
  • Student Issue
Upcoming Events
Topic:Statistical Significance of Clustering for High Dimensional Data
Date:25/06/2024
Time:2:30 pm - 3:30 pm
Venue:Lady Shaw Building LT2
Category:Latest Seminars and Events
Speaker:Professor Yufeng Liu
PDF:PROF-Yufeng-Liu_25-JUNE-2024.pdf
Details:

Abstract

Clustering serves as a fundamental tool for exploratory data analysis, but a key challenge lies in determining the reliability of the clusters identified by these methods, differentiating them from artifacts resulting from natural sampling variations. In this talk, I will present statistical significance of clustering (SigClust) as a cluster evaluation tool for high dimensional data. To begin, we define a cluster as data originating from a single Gaussian distribution and frame the assessment of statistical significance of clustering as a formal testing procedure. Addressing the challenge of high-dimensional covariance estimation in SigClust, we employ a combination of invariance principles and a factor analysis model. I’ll also discuss an enhanced SigClust using multidimensional scaling (MDS) on dissimilarity matrices. SigClust for hierarchical clustering will be presented as well. Simulations and real data, including cancer subtype analysis, validate SigClust’s effectiveness in assessing clustering significance.

İstanbul escort mersin escort kocaeli escort sakarya escort antalya Escort adana Escort escort bayan escort mersin İstanbul escort bayan mersin escort kocaeli escort sakarya escort antalya Escort adana Escort escort bayan escort mersin