Repository logo
 
Publication

A mutual information based discretization-selection technique

dc.contributor.authorFerreira, Artur
dc.contributor.authorFigueiredo, Mário
dc.date.accessioned2024-03-27T17:34:27Z
dc.date.available2024-03-27T17:34:27Z
dc.date.issued2024
dc.description.abstractIn machine learning (ML) and data mining (DM) one often has to resort to data pre-processing techniques to achieve adequate data representations. Among these techniques, we find feature discretization (FD) and feature selection (FS), with many available methods for each one. The use of FD and FS techniques improves the data representation for ML and DM tasks. However, these techniques are usually applied in an independent way, that is, we may use a FD technique but not a FS technique or the opposite case. Using both FD and FS techniques in sequence, may not produce the most adequate results. In this paper, we propose a supervised discretization-selection technique; the discretization step is done in an incremental approach and keeps information regarding the features and the number of bits allocated per feature. Then, we apply a selection criterion based upon the discretization bins, yielding a discretized and dimensionality reduced dataset. We evaluate our technique on different typ es of data and in most cases the discretized and reduced version of the data is the most suited version, achieving better classification performance, as compared to the use of the original features.pt_PT
dc.description.versioninfo:eu-repo/semantics/publishedVersionpt_PT
dc.identifier.citationFerreira, A. and Figueiredo, M. (2024). A Mutual Information Based Discretization-Selection Technique. In Proceedings of the 13th International Conference on Pattern Recognition Applications and Methods - ICPRAM; ISBN 978-989-758-684-2; ISSN 2184-4313, SciTePress, pages 436-443. DOI: 10.5220/0012467300003654pt_PT
dc.identifier.doi10.5220/0012467300003654pt_PT
dc.identifier.isbn978-989-758-684-2
dc.identifier.issn2184-4313
dc.identifier.urihttp://hdl.handle.net/10400.21/17230
dc.language.isoengpt_PT
dc.peerreviewedyespt_PT
dc.publisherSciTePresspt_PT
dc.relation.publisherversionhttps://www.scitepress.org/PublicationsDetail.aspx?ID=Z10HgKay70g=&t=1pt_PT
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/pt_PT
dc.subjectBit Allocationpt_PT
dc.subjectClassificationpt_PT
dc.subjectExplainability,pt_PT
dc.subjectFeature Discretizationpt_PT
dc.subjectFeature Selectionpt_PT
dc.subjectMachine Learningpt_PT
dc.subjectMutual Informationpt_PT
dc.subjectSupervised Learningpt_PT
dc.titleA mutual information based discretization-selection techniquept_PT
dc.typejournal article
dspace.entity.typePublication
oaire.citation.conferencePlaceItáliapt_PT
oaire.citation.endPage443pt_PT
oaire.citation.startPage436pt_PT
oaire.citation.titleProceedings of the 13th International Conference on Pattern Recognition Applications and Methods - ICPRAMpt_PT
person.familyNameFerreira
person.givenNameArtur
person.identifier.orcid0009-0002-9141-3638
rcaap.rightsclosedAccesspt_PT
rcaap.typearticlept_PT
relation.isAuthorOfPublication7c81ed2a-c63d-4221-8626-0a83ad541a2f
relation.isAuthorOfPublication.latestForDiscovery7c81ed2a-c63d-4221-8626-0a83ad541a2f

Files

Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Mutual information.pdf
Size:
1.1 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: