HTML Microdata document

This HTML5 document contains 24 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

Namespace Prefixes

Prefix	IRI
dcterms	http://purl.org/dc/terms/
n16	doi:10.1093/bib/
n2	https://kar.kent.ac.uk/id/eprint/
n6	https://kar.kent.ac.uk/id/eprint/73238#
wdrs	http://www.w3.org/2007/05/powder-s#
n19	http://purl.org/ontology/bibo/status/
rdfs	http://www.w3.org/2000/01/rdf-schema#
n10	https://kar.kent.ac.uk/id/subject/
n17	https://demo.openlinksw.com/about/id/entity/https/raw.githubusercontent.com/annajordanous/CO644Files/main/
n14	http://eprints.org/ontology/
n13	https://kar.kent.ac.uk/73238/
bibo	http://purl.org/ontology/bibo/
n18	https://kar.kent.ac.uk/id/publication/
n9	https://kar.kent.ac.uk/id/org/
rdf	http://www.w3.org/1999/02/22-rdf-syntax-ns#
owl	http://www.w3.org/2002/07/owl#
n8	https://kar.kent.ac.uk/id/
xsdh	http://www.w3.org/2001/XMLSchema#
n5	https://demo.openlinksw.com/about/id/entity/https/www.cs.kent.ac.uk/people/staff/akj22/materials/CO644/
n11	https://kar.kent.ac.uk/id/person/

Statements

Subject Item: n2:73238
rdf:type: bibo:AcademicArticle bibo:Article n14:EPrint n14:ArticleEPrint
rdfs:seeAlso: n13:
owl:sameAs: n16:bbz028
dcterms:title: Comparing enrichment analysis and machine learning for identifying gene properties that discriminate between gene classes
wdrs:describedby: n5:export_kar_RDFN3.n3 n17:export_kar_RDFN3.n3
dcterms:date: 2020-05
dcterms:creator: n11:ext-f.fabris@kent.ac.uk n11:ext-a.a.freitas@kent.ac.uk n11:ext-efefe6ca5b17354a282e598f9f8ed396 n11:ext-d1f63cf99ba4e343f8121300f037bba2
bibo:status: n19:peerReviewed n19:published
dcterms:publisher: n9:ext-ffae441f908983694f410e3721f2491d
bibo:abstract: Biologists very often use enrichment methods based on statistical hypothesis tests to identify gene properties that are significantly over-represented in a given set of genes of interest, by comparison with a ‘background’ set of genes. These enrichment methods, although based on rigorous statistical foundations, are not always the best single option to identify patterns in biological data. In many cases, one can also use classification algorithms from the machine-learning field. Unlike enrichment methods, classification algorithms are designed to maximize measures of predictive performance and are capable of analysing combinations of gene properties, instead of one property at a time. In practice, however, the majority of studies use either enrichment or classification methods (rather than both), and there is a lack of literature discussing the pros and cons of both types of method. The goal of this paper is to compare and contrast enrichment and classification methods, offering two contributions. First, we discuss the (to some extent complementary) advantages and disadvantages of both types of methods for identifying gene properties that discriminate between gene classes. Second, we provide a set of high-level recommendations for using enrichment and classification methods. Overall, by highlighting the strengths and the weaknesses of both types of methods we argue that both should be used in bioinformatics analyses.
dcterms:isPartOf: n8:repository n18:ext-14774054
dcterms:subject: n10:Q
bibo:authorList: n6:authors
bibo:issue: 3
bibo:volume: 21