Publication
Exploiting the bin-class histograms for feature selection on discrete data
dc.contributor.author | J. Ferreira, Artur | |
dc.contributor.author | Figueiredo, Mário A. T. | |
dc.date.accessioned | 2016-04-21T11:32:07Z | |
dc.date.available | 2016-04-21T11:32:07Z | |
dc.date.issued | 2015 | |
dc.description.abstract | In machine learning and pattern recognition tasks, the use of feature discretization techniques may have several advantages. The discretized features may hold enough information for the learning task at hand, while ignoring minor fluctuations that are irrelevant or harmful for that task. The discretized features have more compact representations that may yield both better accuracy and lower training time, as compared to the use of the original features. However, in many cases, mainly with medium and high-dimensional data, the large number of features usually implies that there is some redundancy among them. Thus, we may further apply feature selection (FS) techniques on the discrete data, keeping the most relevant features, while discarding the irrelevant and redundant ones. In this paper, we propose relevance and redundancy criteria for supervised feature selection techniques on discrete data. These criteria are applied to the bin-class histograms of the discrete features. The experimental results, on public benchmark data, show that the proposed criteria can achieve better accuracy than widely used relevance and redundancy criteria, such as mutual information and the Fisher ratio. | pt_PT |
dc.identifier.citation | FERREIRA, Artur J.; FIGUEIREDO, Mário A. T. - Exploiting the Bin-Class Histograms for Feature Selection on Discrete Data. In 7th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA). Santiago de Compostela: SPRINGER-VERLAG BERLIN, 2015. ISBN. 978-3-319-19390-8. Vol. 9117, pp. 345-353 | pt_PT |
dc.identifier.doi | 10.1007/978-3-319-19390-8_39 | pt_PT |
dc.identifier.isbn | 978-3-319-19390-8 | |
dc.identifier.issn | 0302-9743 | |
dc.identifier.uri | http://hdl.handle.net/10400.21/6075 | |
dc.language.iso | eng | pt_PT |
dc.peerreviewed | yes | pt_PT |
dc.publisher | Springer-Verlag Berlin | pt_PT |
dc.relation.publisherversion | http://link.springer.com/chapter/10.1007%2F978-3-319-19390-8_39 | pt_PT |
dc.subject | Feature selection | pt_PT |
dc.subject | Feature discretization | pt_PT |
dc.subject | Discrete features | pt_PT |
dc.subject | Bin-class histogram | pt_PT |
dc.subject | Matrix norm | pt_PT |
dc.subject | Supervised learning | pt_PT |
dc.subject | Classification | pt_PT |
dc.title | Exploiting the bin-class histograms for feature selection on discrete data | pt_PT |
dc.type | conference object | |
dspace.entity.type | Publication | |
oaire.citation.conferencePlace | Santiago de Compostela | pt_PT |
oaire.citation.endPage | 353 | pt_PT |
oaire.citation.startPage | 345 | pt_PT |
oaire.citation.title | 7th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA) | pt_PT |
oaire.citation.volume | 9117 | pt_PT |
person.familyName | Ferreira | |
person.givenName | Artur | |
person.identifier | 1049438 | |
person.identifier.ciencia-id | 091A-96FB-A88C | |
person.identifier.orcid | 0000-0002-6508-0932 | |
person.identifier.rid | AAL-4377-2020 | |
person.identifier.scopus-author-id | 35315359300 | |
rcaap.rights | closedAccess | pt_PT |
rcaap.type | conferenceObject | pt_PT |
relation.isAuthorOfPublication | 734bfe75-0c68-4cdf-8a87-2aef3564f5bd | |
relation.isAuthorOfPublication.latestForDiscovery | 734bfe75-0c68-4cdf-8a87-2aef3564f5bd |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Exploiting the Bin-Class Histograms for Feature.pdf
- Size:
- 436.82 KB
- Format:
- Adobe Portable Document Format
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.71 KB
- Format:
- Item-specific license agreed upon to submission
- Description: