Great Deal! Get Instant $10 FREE in Account on First Order + 10% Cashback on Every Order Order Now

1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and Fisheries, Tasmania...

1 answer below »
1. Title of Database: Abalone data
2. Sources:
(a) Original owners of database:
    Marine Resources Division
    Marine Research Laboratories - Taroona
    Department of Primary Industry and Fisheries, Tasmania
    GPO Box 619F, Hobart, Tasmania 7001, Australia
    (contact: Warwick Nash XXXXXXXXXX, XXXXXXXXXX)
(b) Donor of database:
    Sam Waugh ( XXXXXXXXXX)
    Department of Computer Science, University of Tasmania
    GPO Box 252C, Hobart, Tasmania 7001, Australia
(c) Date received: December 1995
3. Past Usage:
Sam Waugh (1995) "Extending and benchmarking Cascade-Co
elation", PhD
thesis, Computer Science Department, University of Tasmania.
-- Test set performance (final 1044 examples, first 3133 used for training):
    24.86% Cascade-Co
elation (no hidden nodes)
    26.25% Cascade-Co
elation (5 hidden nodes)
    21.5% C4.5
     0.0% Linear Discriminate Analysis
     3.57% k=5 Nearest Neighbou
(Problem encoded as a classification task)
-- Data set samples are highly overlapped. Further information is required
    to separate completely using affine combinations. Other restrictions
    to data set examined.
David Clark, Zoltan Schreter, Anthony Adams "A Quantitative Comparison of
Dystal and Backpropagation", submitted to the Australian Conference on
Neural Networks (ACNN'96). Data set treated as a 3-category classification
problem (grouping ring classes 1-8, 9 and 10, and 11 on).
-- Test set performance (3133 training, 1044 testing as above):
    64% Backprop
    55% Dystal
-- Previous work (Waugh, 1995) on same data set:
    61.40% Cascade-Co
elation (no hidden nodes)
    65.61% Cascade-Co
elation (5 hidden nodes)
    59.2% C4.5
    32.57% Linear Discriminate Analysis
    62.46% k=5 Nearest Neighbou
4. Relevant Information Paragraph:
Predicting the age of abalone from physical measurements. The age of
abalone is determined by cutting the shell through the cone, staining it,
and counting the number of rings through a microscope -- a boring and
time-consuming task. Other measurements, which are easier to obtain, are
used to predict the age. Further information, such as weather patterns
and location (hence food availability) may be required to solve the problem.
From the original data examples with missing values were removed (the
majority having the predicted value missing), and the ranges of the
continuous values have been scaled for use with an ANN (by dividing by 200).
Data comes from an original (non-machine-learning) study:
    Warwick J Nash, Tracy L Sellers, Simon R Talbot, Andrew J Cawthorn and
    Wes B Ford (1994) "The Population Biology of Abalone (_Haliotis_
    species) in Tasmania. I. Blacklip Abalone (_H. ru
a_) from the North
    Coast and Islands of Bass Strait", Sea Fisheries Division, Technical
    Report No. 48 (ISSN XXXXXXXXXX)
5. Number of Instances: 4177
6. Number of Attributes: 8
7. Attribute information:
Given is the attribute name, attribute type, the measurement unit and a

ief description. The number of rings is the value to predict: eithe
as a continuous value or as a classification problem.
    Name        Data Type    Meas.    Description
    ----        ---------    -----    -----------
    Sex        nominal            M, F, and I (infant)
    Length        continuous    mm    Longest shell measurement
    Diameter    continuous    mm    perpendicular to length
    Height        continuous    mm    with meat in shell
    Whole weight    continuous    grams    whole abalone
    Shucked weight    continuous    grams    weight of meat
    Viscera weight    continuous    grams    gut weight (after bleeding)
    Shell weight    continuous    grams    after being dried
    Rings        integer            +1.5 gives the age in years
Statistics for numeric domains:
        Length    Diam    Height    Whole    Shucked    Viscera    Shell    Rings
    Min    0.075    0.055    0.000    0.002    0.001    0.001    0.002     1
    Max    0.815    0.650    1.130    2.826    1.488    0.760    1.005     29
    Mean    0.524    0.408    0.140    0.829    0.359    0.181    0.239    9.934
    SD    0.120    0.099    0.042    0.490    0.222    0.110    0.139    3.224
    Co
el    0.557    0.575    0.557    0.540    0.421    0.504    0.628     1.0
8. Missing Attribute Values: None
9. Class Distribution:
    Class    Examples
    -----    --------
    1    1
    2    1
    3    15
    4    57
    5    115
    6    259
    7    391
    8    568
    9    689
    10    634
    11    487
    12    267
    13    203
    14    126
    15    103
    16    67
    17    58
    18    42
    19    32
    20    26
    21    14
    22    6
    23    9
    24    2
    25    1
    26    1
    27    2
    29    1
    -----    ----
    Total    4177

M,0.455,0.365,0.095,0.514,0.2245,0.101,0.15,15
M,0.35,0.265,0.09,0.2255,0.0995,0.0485,0.07,7
F,0.53,0.42,0.135,0.677,0.2565,0.1415,0.21,9
M,0.44,0.365,0.125,0.516,0.2155,0.114,0.155,10
I,0.33,0.255,0.08,0.205,0.0895,0.0395,0.055,7
I,0.425,0.3,0.095,0.3515,0.141,0.0775,0.12,8
F,0.53,0.415,0.15,0.7775,0.237,0.1415,0.33,20
F,0.545,0.425,0.125,0.768,0.294,0.1495,0.26,16
M,0.475,0.37,0.125,0.5095,0.2165,0.1125,0.165,9
F,0.55,0.44,0.15,0.8945,0.3145,0.151,0.32,19
F,0.525,0.38,0.14,0.6065,0.194,0.1475,0.21,14
M,0.43,0.35,0.11,0.406,0.1675,0.081,0.135,10
M,0.49,0.38,0.135,0.5415,0.2175,0.095,0.19,11
F,0.535,0.405,0.145,0.6845,0.2725,0.171,0.205,10
F,0.47,0.355,0.1,0.4755,0.1675,0.0805,0.185,10
M,0.5,0.4,0.13,0.6645,0.258,0.133,0.24,12
I,0.355,0.28,0.085,0.2905,0.095,0.0395,0.115,7
F,0.44,0.34,0.1,0.451,0.188,0.087,0.13,10
M,0.365,0.295,0.08,0.2555,0.097,0.043,0.1,7
M,0.45,0.32,0.1,0.381,0.1705,0.075,0.115,9
M,0.355,0.28,0.095,0.2455,0.0955,0.062,0.075,11
I,0.38,0.275,0.1,0.2255,0.08,0.049,0.085,10
F,0.565,0.44,0.155,0.9395,0.4275,0.214,0.27,12
F,0.55,0.415,0.135,0.7635,0.318,0.21,0.2,9
F,0.615,0.48,0.165,1.1615,0.513,0.301,0.305,10
F,0.56,0.44,0.14,0.9285,0.3825,0.188,0.3,11
F,0.58,0.45,0.185,0.9955,0.3945,0.272,0.285,11
M,0.59,0.445,0.14,0.931,0.356,0.234,0.28,12
M,0.605,0.475,0.18,0.9365,0.394,0.219,0.295,15
M,0.575,0.425,0.14,0.8635,0.393,0.227,0.2,11
M,0.58,0.47,0.165,0.9975,0.3935,0.242,0.33,10
F,0.68,0.56,0.165,1.639,0.6055,0.2805,0.46,15
M,0.665,0.525,0.165,1.338,0.5515,0.3575,0.35,18
F,0.68,0.55,0.175,1.798,0.815,0.3925,0.455,19
F,0.705,0.55,0.2,1.7095,0.633,0.4115,0.49,13
M,0.465,0.355,0.105,0.4795,0.227,0.124,0.125,8
F,0.54,0.475,0.155,1.217,0.5305,0.3075,0.34,16
F,0.45,0.355,0.105,0.5225,0.237,0.1165,0.145,8
F,0.575,0.445,0.135,0.883,0.381,0.2035,0.26,11
M,0.355,0.29,0.09,0.3275,0.134,0.086,0.09,9
F,0.45,0.335,0.105,0.425,0.1865,0.091,0.115,9
F,0.55,0.425,0.135,0.8515,0.362,0.196,0.27,14
I,0.24,0.175,0.045,0.07,0.0315,0.0235,0.02,5
I,0.205,0.15,0.055,0.042,0.0255,0.015,0.012,5
I,0.21,0.15,0.05,0.042,0.0175,0.0125,0.015,4
I,0.39,0.295,0.095,0.203,0.0875,0.045,0.075,7
M,0.47,0.37,0.12,0.5795,0.293,0.227,0.14,9
F,0.46,0.375,0.12,0.4605,0.1775,0.11,0.15,7
I,0.325,0.245,0.07,0.161,0.0755,0.0255,0.045,6
F,0.525,0.425,0.16,0.8355,0.3545,0.2135,0.245,9
I,0.52,0.41,0.12,0.595,0.2385,0.111,0.19,8
M,0.4,0.32,0.095,0.303,0.1335,0.06,0.1,7
M,0.485,0.36,0.13,0.5415,0.2595,0.096,0.16,10
F,0.47,0.36,0.12,0.4775,0.2105,0.1055,0.15,10
M,0.405,0.31,0.1,0.385,0.173,0.0915,0.11,7
F,0.5,0.4,0.14,0.6615,0.2565,0.1755,0.22,8
M,0.445,0.35,0.12,0.4425,0.192,0.0955,0.135,8
M,0.47,0.385,0.135,0.5895,0.2765,0.12,0.17,8
I,0.245,0.19,0.06,0.086,0.042,0.014,0.025,4
F,0.505,0.4,0.125,0.583,0.246,0.13,0.175,7
M,0.45,0.345,0.105,0.4115,0.18,0.1125,0.135,7
M,0.505,0.405,0.11,0.625,0.305,0.16,0.175,9
F,0.53,0.41,0.13,0.6965,0.302,0.1935,0.2,10
M,0.425,0.325,0.095,0.3785,0.1705,0.08,0.1,7
M,0.52,0.4,0.12,0.58,0.234,0.1315,0.185,8
M,0.475,0.355,0.12,0.48,0.234,0.1015,0.135,8
F,0.565,0.44,0.16,0.915,0.354,0.1935,0.32,12
F,0.595,0.495,0.185,1.285,0.416,0.224,0.485,13
F,0.475,0.39,0.12,0.5305,0.2135,0.1155,0.17,10
I,0.31,0.235,0.07,0.151,0.063,0.0405,0.045,6
M,0.555,0.425,0.13,0.7665,0.264,0.168,0.275,13
F,0.4,0.32,0.11,0.353,0.1405,0.0985,0.1,8
F,0.595,0.475,0.17,1.247,0.48,0.225,0.425,20
M,0.57,0.48,0.175,1.185,0.474,0.261,0.38,11
F,0.605,0.45,0.195,1.098,0.481,0.2895,0.315,13
F,0.6,0.475,0.15,1.0075,0.4425,0.221,0.28,15
M,0.595,0.475,0.14,0.944,0.3625,0.189,0.315,9
F,0
Answered 7 days After Apr 21, 2021

Solution

Sandeep Kumar answered on Apr 28 2021
143 Votes
ABALONE CLUSTERING
In part 1, with odd wards being clustered by hierarchical clustering it was found from the dendrogram the number of classes adequate is 3.
While for even wards being clustered by K Means the number of clusters, optimum was 3.
In part II, after performing labelencoder() for the classes, and using silhouette_score for finding the optimal K, the result was found to be 2. For K=2, the dataset had a clear threshold and perfectly fitted model for future prediction
The threshold was found to be 2.
Moreover, PCA was conducted...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here