<?xml version="1.0" encoding="UTF-8"?>
<feed xmlns="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <title>DSpace Collection: Training and assessing classification rules with unbalanced data</title>
  <link rel="alternate" href="http://www.openstarts.units.it:80/dspace/handle/10077/4000" />
  <subtitle>Training and assessing classification rules with unbalanced data</subtitle>
  <id>http://www.openstarts.units.it:80/dspace/handle/10077/4000</id>
  <updated>2013-06-18T06:39:32Z</updated>
  <dc:date>2013-06-18T06:39:32Z</dc:date>
  <entry>
    <title>Training and assessing classification rules with unbalanced data</title>
    <link rel="alternate" href="http://www.openstarts.units.it:80/dspace/handle/10077/4002" />
    <author>
      <name>Menardi, Giovanna</name>
    </author>
    <author>
      <name>Torelli, Nicola</name>
    </author>
    <id>http://www.openstarts.units.it:80/dspace/handle/10077/4002</id>
    <updated>2011-02-17T23:33:25Z</updated>
    <published>2010-01-01T00:00:00Z</published>
    <summary type="text">Title: Training and assessing classification rules with unbalanced data
Authors: Menardi, Giovanna; Torelli, Nicola
Abstract: The problem of modeling binary responses by using cross-sectional data has been addressed&#xD;
with a number of satisfying solutions that draw on both parametric and nonparametric&#xD;
methods. However, there exist many real situations where one of the two responses (usually&#xD;
the most interesting for the analysis) is rare. It has been largely reported that this class&#xD;
imbalance heavily compromises the process of learning, because the model tends to focus on&#xD;
the prevalent class and to ignore the rare events. However, not only the estimation of the&#xD;
classification model is affected by a skewed distribution of the classes, but also the evaluation&#xD;
of its accuracy is jeopardized, because the scarcity of data leads to poor estimates of the&#xD;
model’s accuracy.&#xD;
In this work, the effects of class imbalance on model training and model assessing are&#xD;
discussed. Moreover, a unified and systematic framework for dealing with both the problems is proposed, based on a smoothed bootstrap re-sampling technique.
Type: Libro / capitolo</summary>
    <dc:date>2010-01-01T00:00:00Z</dc:date>
  </entry>
</feed>

