DOC PREVIEW
CMU CS 10701 - Feature Selection for fMRI Classification

This preview shows page 1-2-3 out of 8 pages.

Save
View full document
View full document
Premium Document
Do you want full access? Go Premium and unlock all 8 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 8 pages.
Access to all documents
Download any document
Ad free experience
View full document
Premium Document
Do you want full access? Go Premium and unlock all 8 pages.
Access to all documents
Download any document
Ad free experience
Premium Document
Do you want full access? Go Premium and unlock all 8 pages.
Access to all documents
Download any document
Ad free experience

Unformatted text preview:

1 Introduction1.1 Motivation1.2 Goal of this work2 Method2.1 Intuition2.2 Description of the algorithms3 Experiment4 ConclusionsFeature Selection for fMRI Classification Chuang Wu Program of Computational Biology Carnegie Mellon University Pittsburgh, PA 15213 [email protected] Abstract The functional Magnetic Resonance Imaging (fMRI) has provided us with an approach of revealing the activity of brain. Due to the large amount of data in fMRI studies, feature selection techniques are used to select particular features for classifier. In this project, Spectral Clustering is implemented to construct features to achieve best reconstruction of the data and be most efficient for making predictions. 1 Introduction 1.1 Motivation Over the past decade, a variety of different functional Magnetic Resonance Imaging (fMRI) experiments have been done in order to understand the human brain activity pattern when doing some certain task. By recording the activity pattern of human brain as images of 3D voxel, we are able to visualize the picture of the pattern, find statistical differences in bran activity during different tasks, and a more challenge problem is to train the data with a classifier so as to predict brain activity given any of the pattern, such as whether the human subject is reading a sentence or looking at a picture, or whether the subject is reading an ambiguous or non-ambiguous sentence, etc. Machine Learning is a most powerful approach to train the classifier and then use the classifier to discriminate between different cognitive states. A typical fMRI experiment produces a three-dimensional image related to the human subject’s brain activity every half second. The experiment consists of a set of trials, and the data is partitioned into trails, (reading a sentence, observing a picture, and determining whether the sentence correctly described the picture). There are about 6 human subjects in the data; each of the 40 trials lasts approximately 30 seconds. Only a fraction of the brain of each subject was imaged. The data is marked up with 25-30 anatomically defined regions (called "Regions of Interest"). Each image contains approximately 5,000 voxels (3D pixels), across a portion of the brain. Learning this series of brain data to an experimental condition label requires many challenges,one of which is the extremely sparse noisy data with extremely high dimensional features. This would cause the over-fitting problem for the classifier. Hence it is necessary to apply some feature selection method to make learning tractable and prevent over-fitting due to spurious correlations. The objective of feature selection is three-fold: improving the prediction performance of the predictors, providing faster and more cost-effective predictors, and providing a better understanding of the underlying process that generated the data. There are a number of generic feature construction methods, including: clustering; basic linear transforms of the input variables (PCA/SVD, LDA); more sophisticated linear transforms like spectral transforms (Fourier, Hadamard), wavelet transforms or convolutions of kernels; and applying simple functions to subsets of variables, like products to create monomials. 1.2 Goal of this work The goal of this work is to derive features with Spectral Clustering from the original brain image data as the input for the classifiers to achieve best reconstruction of the data and be most efficient for making predictions. Clustering has long been used for feature construction. The idea is to replace a group of “similar” variables by a cluster centroid, which becomes a new derived feature. The new derived feature then is treated as a representative for the whole cluster as the new input for the classifier. The clustering will greatly reduce the number of features and meanwhile without losing much information. The most popular algorithms include K-means and hierarchical clustering. The clustering method used in this project is ‘Spectral Clustering’. Spectral methods recently emerge as effective methods for data clustering, image segmentation, Web ranking analysis and dimension reduction. The spectral clustering algorithm is based on the concept of similarity between points instead of distance, as other algorithms do. The implemented algorithm is formulated as graph partition problem where the weight of each edge is the similarity between points that correspond to vertex connected by the edge. The goal of the algorithm is find the minimum weight cuts in the graph, but this problem can be addressed by the means of linear algebra, in particular by the eigenvalue decomposition techniques, from which the term "spectral" derives. 2 Method 2.1 Intuition Spectral cluster could tell the intrinsic features of the data, revealing the underlying cluster. One of the biggest differences between Spectral clustering and K-mean clustering is that, K-mean requires the initial value of K, i.e. the number of clusters. The initial value of K is kind of arbitrary; in contrast Spectral clustering has a rich structure with interesting properties and deep connections to principal component analysis. Hence the goal of this work is to implement a spectral clustering method to cluster the image data in order to reduce the feature number, construct new features, and improve accuracy of classifier. The code for manipulation and visualization of the fMRI data has been provided, as well as ‘Naïve Bayes Classifier’ and ‘Logistic Regression Classifier’. Hence the work needed to be done in this project is toimplement the Spectral Clustering algorithm, and combine with the existing code to increase the prediction accuracy. 2.2 Description of the algorithms The algorithm starts with well-motivated objective functions; optimization eventually leads to eigenvectors, with many clear and interesting algebraic properties. At the core of spectral clustering is the Laplacian of the graph adjacency (pairwise similarity) matrix, evolved from spectral graph partitioning. The detailed steps for the 'Spectral Clustering' in this project are [2]: 1, Constructing a matrix M, in which rows are corresponding to image and the columns are corresponding to voxels. 2, Normalize over the column (if the affinity is defined by Euclidean distance). 3, Construct Affinity Matrix A. The affinity is defined as Aij = exp(C(i,j)2/2σ2) if i≠j, and Aii=0, where C(i,j) is the correlation between the two


View Full Document

CMU CS 10701 - Feature Selection for fMRI Classification

Documents in this Course
lecture

lecture

12 pages

lecture

lecture

17 pages

HMMs

HMMs

40 pages

lecture

lecture

15 pages

lecture

lecture

20 pages

Notes

Notes

10 pages

Notes

Notes

15 pages

Lecture

Lecture

22 pages

Lecture

Lecture

13 pages

Lecture

Lecture

24 pages

Lecture9

Lecture9

38 pages

lecture

lecture

26 pages

lecture

lecture

13 pages

Lecture

Lecture

5 pages

lecture

lecture

18 pages

lecture

lecture

22 pages

Boosting

Boosting

11 pages

lecture

lecture

16 pages

lecture

lecture

20 pages

Lecture

Lecture

20 pages

Lecture

Lecture

39 pages

Lecture

Lecture

14 pages

Lecture

Lecture

18 pages

Lecture

Lecture

13 pages

Exam

Exam

10 pages

Lecture

Lecture

27 pages

Lecture

Lecture

15 pages

Lecture

Lecture

24 pages

Lecture

Lecture

16 pages

Lecture

Lecture

23 pages

Lecture6

Lecture6

28 pages

Notes

Notes

34 pages

lecture

lecture

15 pages

Midterm

Midterm

11 pages

lecture

lecture

11 pages

lecture

lecture

23 pages

Boosting

Boosting

35 pages

Lecture

Lecture

49 pages

Lecture

Lecture

22 pages

Lecture

Lecture

16 pages

Lecture

Lecture

18 pages

Lecture

Lecture

35 pages

lecture

lecture

22 pages

lecture

lecture

24 pages

Midterm

Midterm

17 pages

exam

exam

15 pages

Lecture12

Lecture12

32 pages

lecture

lecture

19 pages

Lecture

Lecture

32 pages

boosting

boosting

11 pages

pca-mdps

pca-mdps

56 pages

bns

bns

45 pages

mdps

mdps

42 pages

svms

svms

10 pages

Notes

Notes

12 pages

lecture

lecture

42 pages

lecture

lecture

29 pages

lecture

lecture

15 pages

Lecture

Lecture

12 pages

Lecture

Lecture

24 pages

Lecture

Lecture

22 pages

Midterm

Midterm

5 pages

mdps-rl

mdps-rl

26 pages

Load more
Download Feature Selection for fMRI Classification
Our administrator received your request to download this document. We will send you the file to your email shortly.
Loading Unlocking...
Login

Join to view Feature Selection for fMRI Classification and access 3M+ class-specific study document.

or
We will never post anything without your permission.
Don't have an account?
Sign Up

Join to view Feature Selection for fMRI Classification 2 2 and access 3M+ class-specific study document.

or

By creating an account you agree to our Privacy Policy and Terms Of Use

Already a member?