Great Deal! Get Instant $10 FREE in Account on First Order + 10% Cashback on Every Order Order Now

Specimen,Num attachments,inc excutable,inc ZIP,inc PDF,inc DOC,Unknown Format,URL count,outside network,Email Size,Verified as Malware VS0001,1,Yes,No,No,No,No,1,Yes,76172,...

1 answer below »
Specimen,Num attachments,inc excutable,inc ZIP,inc PDF,inc DOC,Unknown Format,URL count,outside network,Email Size,Verified as Malware
VS0001,1,Yes,No,No,No,No,1,Yes,76172,
VS0002,1,No,Yes,No,No,No,0,Yes,248404,
VS0003,1,No,No,No,No,Yes,2,Yes,2841,Yes
VS0004,2,Yes,No,No,Yes,No,0,Yes,132988,Yes
VS0005,1,No,No,No,Yes,No,1,Yes,117140,
VS0006,2,No,No,Yes,Yes,No,0,Yes,127923,
VS0007,0,No,No,No,No,No,0,Yes,1660,
VS0008,1,No,No,No,No,Yes,1,Yes,1542,Yes
VS0009,0,No,No,No,No,No,0,,4405,
VS0010,0,No,No,No,No,No,0,Yes,2397,
VS0011,3,Yes,No,Yes,No,Yes,0,Yes,90557,
VS0012,1,No,No,No,No,Yes,3,Yes,2257,Yes
VS0013,0,No,No,No,No,No,0,,3003,
VS0014,2,No,No,No,No,Yes,0,Yes,337252,
VS0015,4,No,No,Yes,No,Yes,0,Yes,281918,
VS0016,1,No,No,No,Yes,No,0,Yes,159965,
VS0017,12,No,No,No,Yes,Yes,0,Yes,1832141,
VS0018,1,No,No,No,No,Yes,1,Yes,547919,
VS0019,0,No,No,No,No,No,0,,3440,
VS0020,1,No,No,No,No,Yes,0,,214397,
VS0021,2,No,No,Yes,Yes,No,0,Yes,228191,
VS0022,1,No,Yes,No,No,No,3,Yes,31347,Yes
VS0023,7,No,No,No,No,Yes,0,Yes,86098,
VS0024,0,No,No,No,No,No,0,Yes,3021,
VS0025,3,No,No,No,No,Yes,0,Yes,243311,
VS0026,2,No,No,No,No,Yes,0,Yes,143480,
VS0027,0,No,No,No,No,No,0,Yes,3226,
VS0028,1,No,No,No,No,Yes,1,Yes,63547,
VS0029,0,No,No,No,No,No,0,Yes,3381,
VS0030,0,No,No,No,No,No,0,Yes,2965,
VS0031,1,No,No,No,No,Yes,3,Yes,1546,Yes
VS0032,0,No,No,No,No,No,0,Yes,3447,
VS0033,1,No,No,No,No,Yes,3,Yes,2505,Yes
VS0034,1,No,No,No,No,Yes,3,Yes,1047,Yes
VS0035,3,No,No,No,No,Yes,0,Yes,311384,
VS0036,4,No,No,No,Yes,Yes,0,,6534551,
VS0037,0,No,No,No,No,No,0,Yes,2746,
VS0038,1,No,No,Yes,No,No,0,Yes,120543,
VS0039,0,No,No,No,No,No,0,Yes,3051,
VS0040,1,No,No,No,No,Yes,3,Yes,1753,Yes
VS0041,0,No,No,No,No,No,0,Yes,4964,
VS0042,3,No,No,Yes,No,Yes,0,Yes,194948,
VS0043,0,No,No,No,No,No,0,Yes,2753,
VS0044,3,No,No,Yes,No,Yes,0,,129767,
VS0045,0,No,No,No,No,No,0,Yes,4452,
VS0046,0,No,No,No,No,No,0,Yes,1081,
VS0047,2,No,No,Yes,No,Yes,0,,65351,
VS0048,1,No,No,No,No,Yes,0,Yes,245290,
VS0049,1,No,No,Yes,No,No,0,Yes,348898,
VS0050,1,Yes,No,No,No,No,0,Yes,36045,Yes
VS0051,0,No,No,No,No,No,0,Yes,1741,
VS0052,1,No,No,No,No,Yes,0,,51171,
VS0053,0,No,No,No,No,No,0,,3580,
VS0054,3,No,No,Yes,No,Yes,0,Yes,102171,
VS0055,1,No,No,Yes,No,No,0,,101480,
VS0056,1,No,No,No,No,Yes,0,,61616,
VS0057,2,No,No,No,Yes,Yes,1,,16129,
VS0058,0,No,No,No,No,No,0,,4488,
VS0059,1,No,No,No,No,Yes,1,Yes,2314,Yes
VS0060,2,No,No,No,No,Yes,0,,237898,
VS0061,0,No,No,No,No,No,0,Yes,3606,
VS0062,1,No,No,No,Yes,No,0,Yes,19908,Yes
VS0063,2,Yes,No,No,No,Yes,0,Yes,4853,
VS0064,0,No,No,No,No,No,0,Yes,3808,
VS0065,1,No,No,Yes,No,No,0,,199577,
VS0066,0,No,No,No,No,No,0,Yes,3520,
VS0067,1,No,No,Yes,No,No,0,,293976,
VS0068,2,Yes,No,No,Yes,No,3,Yes,1948,Yes
VS0069,2,No,No,No,No,Yes,0,Yes,16289,
VS0070,3,Yes,No,Yes,Yes,No,0,Yes,64428,Yes
VS0071,1,No,No,Yes,No,No,0,Yes,175709,
VS0072,0,No,No,No,No,No,0,,3067,
VS0073,2,No,No,Yes,No,Yes,0,Yes,362159,
VS0074,1,No,No,No,No,Yes,1,Yes,9425,
VS0075,1,No,No,No,No,Yes,0,Yes,168632,
VS0076,0,No,No,No,No,No,0,Yes,2609,
VS0077,2,No,No,Yes,No,Yes,0,Yes,652046,
VS0078,1,No,No,Yes,No,No,0,Yes,68650,
VS0079,1,No,No,Yes,No,No,1,Yes,212416,
VS0080,2,No,No,No,No,Yes,0,,160514,
VS0081,0,No,No,No,No,No,0,Yes,3224,
VS0082,0,No,No,No,No,No,0,,2941,
VS0083,0,No,No,No,No,No,0,Yes,2843,
VS0084,0,No,No,No,No,No,0,Yes,4095,
VS0085,0,No,No,No,No,No,0,Yes,3047,
VS0086,1,No,No,No,No,Yes,0,Yes,138768,
VS0087,1,No,No,No,No,Yes,3,Yes,2114,Yes
VS0088,0,No,No,No,No,No,2,,2976,
VS0089,4,Yes,No,No,No,Yes,0,Yes,347282,
VS0090,1,No,Yes,No,No,No,0,Yes,19390,Yes
VS0091,2,No,No,Yes,Yes,No,3,Yes,14011,Yes
VS0092,4,No,No,No,Yes,Yes,0,Yes,4869737,
VS0093,1,No,Yes,No,No,No,0,Yes,26285,Yes
VS0094,0,No,No,No,No,No,0,,3821,
VS0095,2,Yes,No,No,Yes,No,0,Yes,18928,Yes
VS0096,1,No,No,No,No,Yes,0,Yes,332639,
VS0097,1,No,No,No,No,Yes,1,,309972,
VS0098,3,No,No,Yes,Yes,Yes,0,Yes,115102,
VS0099,0,No,No,No,No,No,0,Yes,3393,
VS0100,0,No,No,No,No,No,0,Yes,2149,
VS0101,7,No,No,Yes,No,Yes,0,,158759,
VS0102,2,No,No,Yes,Yes,No,0,Yes,145296,
VS0103,1,No,No,No,No,Yes,1,Yes,2590,Yes
VS0104,1,No,No,No,No,Yes,0,Yes,179314,
VS0105,1,No,No,No,No,Yes,0,,46784,
VS0106,1,No,No,No,No,Yes,0,Yes,4126274,
VS0107,2,No,No,No,No,Yes,0,Yes,7124268,
VS0108,0,No,No,No,No,No,0,Yes,3117,
VS0109,0,No,No,No,No,No,1,,3650,
VS0110,0,No,No,No,No,No,0,Yes,2142,
VS0111,3,No,No,Yes,No,Yes,0,Yes,314432,
VS0112,0,No,No,No,No,No,0,,1787,
VS0113,0,No,No,No,No,No,0,Yes,3718,
VS0114,0,No,No,No,No,No,0,Yes,2377,
VS0115,2,Yes,No,Yes,No,No,0,Yes,10689,Yes
VS0116,2,No,No,Yes,Yes,No,0,Yes,70026,Yes
VS0117,1,No,No,No,No,Yes,0,,2976160,
VS0118,3,No,No,Yes,Yes,Yes,0,Yes,207515,
VS0119,0,No,No,No,No,No,0,Yes,2452,
VS0120,1,No,No,No,No,Yes,1,Yes,779,Yes
VS0121,0,No,No,No,No,No,0,Yes,2898,
VS0122,0,No,No,No,No,No,0,Yes,4328,
VS0123,3,Yes,No,Yes,Yes,No,0,Yes,102250,Yes
VS0124,0,No,No,No,No,No,0,,2297,
VS0125,1,No,No,No,Yes,No,0,Yes,84493,
VS0126,0,No,No,No,No,No,0,Yes,2221,
VS0127,0,No,No,No,No,No,0,Yes,1606,
VS0128,2,Yes,No,No,No,Yes,0,Yes,323368,
VS0129,3,No,No,No,No,Yes,0,Yes,130786,
VS0130,1,No,No,No,No,Yes,3,Yes,2217,Yes
VS0131,3,No,No,Yes,Yes,Yes,0,,217428,
VS0132,0,No,No,No,No,No,0,Yes,2953,
VS0133,1,No,No,No,Yes,No,0,,88030,
VS0134,0,No,No,No,No,No,0,Yes,4151,
VS0135,4,No,No,Yes,Yes,Yes,0,Yes,207624,
VS0136,2,No,No,No,No,Yes,0,,339246,
VS0137,1,No,No,Yes,No,No,0,,100431,
VS0138,0,No,No,No,No,No,0,Yes,313,
VS0139,1,No,No,No,No,Yes,2,Yes,2298,Yes
VS0140,1,No,No,No,Yes,No,0,,175396,
VS0141,1,No,No,Yes,No,No,0,Yes,48801,
VS0142,3,No,No,No,No,Yes,0,Yes,341595,
VS0143,0,No,No,No,No,No,0,Yes,3110,
VS0144,1,No,No,Yes,No,No,4,Yes,300233,
VS0145,2,No,No,Yes,Yes,No,0,Yes,44485,
VS0146,2,No,No,No,No,Yes,1,Yes,305250,
VS0147,0,No,No,No,No,No,0,Yes,2953,
VS0148,1,No,No,No,Yes,No,3,Yes,37245,Yes
VS0149,4,No,No,No,No,Yes,0,Yes,252039,
VS0150,2,Yes,No,Yes,No,No,4,Yes,31866,Yes
VS0151,1,No,No,Yes,No,No,0,Yes,68701,
VS0152,2,No,No,Yes,No,Yes,0,Yes,245030,
VS0153,1,No,Yes,No,No,No,0,Yes,122747,
VS0154,3,No,No,Yes,No,Yes,1,,467391,
VS0155,2,Yes,No,No,Yes,No,0,Yes,62865,Yes
VS0156,0,No,No,No,No,No,0,,935,
VS0157,3,No,No,No,No,Yes,0,Yes,273500,
VS0158,1,No,No,No,Yes,No,0,Yes,172379,
VS0159,0,No,No,No,No,No,0,Yes,3001,
VS0160,0,No,No,No,No,No,0,Yes,2774,
VS0161,1,No,No,No,No,Yes,0,Yes,263618,
VS0162,0,No,No,No,No,No,0,Yes,3004,
VS0163,0,No,No,No,No,No,0,Yes,1921,
VS0164,1,No,No,No,Yes,No,0,,217514,
VS0165,4,No,No,Yes,No,Yes,0,Yes,248239,
VS0166,1,No,No,No,No,Yes,0,Yes,160174,
VS0167,0,No,No,No,No,No,0,Yes,2321,
VS0168,4,Yes,No,No,No,Yes,0,Yes,260519,
VS0169,0,No,No,No,No,No,0,Yes,3253,
VS0170,0,No,No,No,No,No,0,Yes,3943,
VS0171,0,No,No,No,No,No,0,Yes,3707,
VS0172,0,No,No,No,No,No,0,Yes,4060,
VS0173,7,No,No,No,No,Yes,0,,164982,
VS0174,2,No,No,Yes,No,Yes,3,,565610,
VS0175,0,No,No,No,No,No,0,Yes,3922,
VS0176,0,No,No,No,No,No,0,Yes,3357,
VS0177,3,No,No,No,Yes,Yes,0,Yes,326782,
VS0178,1,No,No,Yes,No,No,0,,197340,
VS0179,0,No,No,No,No,No,0,,3164,
VS0180,1,No,No,Yes,No,No,0,Yes
Answered Same Day Apr 03, 2021

Solution

Subhanbasha answered on Apr 04 2021
141 Votes
Principal Component Analysis
Data preparation steps
The data is mostly in form of categorical variables which is yes/no format. The datapoints consisted of “yes” only. So, we made of probability values and replaced yes/no in the place of the null values.
Specimen Num.attachments inc.excutable inc.ZIP inc.PDF inc.DOC
1 VS0001 1 Yes No No No
2 VS0002 1 No Yes No No
3 VS0003 1 No No No No
4 VS0004 2 Yes No No Yes
5 VS0005 1 No No No Yes
6 VS0006 2 No No Yes Yes
Unknown.Format URL.count outside.network Email.Size Verified.as.Malware
1 No 1 Yes 76172 2 No 0 Yes 248404 3 Yes 2 Yes 2841 Yes
4 No 0 Yes 132988 Yes
5 No 1 Yes 117140 6 No 0 Yes 127923 From the above output we have qualitative data, this is generally not accepted by Principal Component method. So, we make use of dummies to transform the data into numerical variables.
Then we took a random sample by making use of the sample function in R and tranformed it into a separate dataset. Then we checked for the missing elements if any were present using the sapply() function. Then we split the data into two parts by separating the sample data. Then we used the
ind() function to combine the dataframes. Then convert...
SOLUTION.PDF

Answer To This Question Is Available To Download

Related Questions & Answers

More Questions »

Submit New Assignment

Copy and Paste Your Assignment Here