Psst this week:
room EM 3.03, 1:15 PM, Thursday 23rd Feb 2012
Ioanna Filippou:
TBC
TBC
Dissanayake, Manjula: Feature Selection and Classification in Complex Biological Datasets
Feature selection continues to grow in importance in many areas of science and engineering as large datasets become increasingly common. For example, DNA micro arrays can measure expression levels of many thousands of genes simultaneously.
For effective data mining in such datasets, tools are required that can reliably distinguish the most relevant features. Previous work has shown promise for an evolutionary algorithm/classifier combination (EA/k-NN), which, in successive phases of the same algorithm, serves first as the feature selection mechanism and second as the machine learning method yielding an accurate classifier.
This work has been followed up by investigating the effect of introducing a weighting scheme for k-NN. The weighting scheme used here has been adopted from the adaptive local hyper plane method. The areas investigated included the way to apply adaptive weights to the k-NN method, the way to scale the weighting scheme up to work with large biological datasets and its effectiveness when combined with two phase EA/k-NN algorithm.