Equalities (all values are approximate unless specified as e…

Equalities (all values are approximate unless specified as exact) Length 1 foot (ft) = 12 inches (in) (Exact) 1 yard (yd) = 3 feet (ft) (Exact) 1 mile (mi) = 5,280 feet (ft) (Exact) Mass (Weight) 1 pound (lb) = 16 ounces (oz) (Exact) 1 ton (T) = 2,000 pounds (lb) (Exact) Volume 1 cup (c) = 8 fluid ounces (fl oz) (Exact) 1 pint (pt) = 2 cups (c) (Exact) 1 quart (qt) = 2 pints (pt) (Exact) 1 gallon (gal) = 4 quarts (qt) (Exact) Length 1 inch (in) = 2.54 centimeters (cm) (Exact 1 foot (ft) = 0.3048 meters (m) (Exact) 1 yard (yd) = 0.9144 meters (m) (Exact) 1 mile (mi) ≈ 1.6093 kilometers (km)  Mass 1 ounce (oz) ≈ 28.35 grams (g) 1 pound (lb) ≈ 0.4536 kilograms (kg) Pressure 1 atm = 1.01325 x 105 Pa 1 atm = 760 torr 1 atm = 760 mmHg 1 atm = 14.6959 psi Volume 1 fluid ounce (fl oz) ≈ 29.57 mL 1 quart (qt) ≈ 0.9464 L 1 gallon (gal) ≈ 3.785 L Temperature Fahrenheit to Celsius: °C = (°F − 32) × 5/9 (exact by definition) Celsius to Fahrenheit: °F = (°C × 9/5) + 32 (exact by definition) Celsius to Kelvin:  K = °C +273.15 Fundamental Constants Avogadro’s Number (mole) ​= 6.022 × 1023 mol−1 Ideal Gas Constant (R): R = 0.08206 L⋅atm/mol⋅K     R=8.314 J/mol⋅K Planck’s Constant (h): h = 6.626 × 10−34 J⋅s Speed of Light (c): c = 2.99792458 × 108 m/s Faraday Constant (F): F= 96,485 C/mol e− Physical Equations Ideal Gas Law: PV = nRT Energy of a Photon: E = hv Mass-Energy Equivalence: E = mc2 Electrochemical Charge: Q = nF

Problem 6 (22 points) Information Gain and Split PlansConsid…

Problem 6 (22 points) Information Gain and Split PlansConsider the following data set for a binary class problem. Illustrate your work/math to calculate the classification error rate when splitting on A and B.Which attribute would the decision tree induction algorithm choose? The definition ofmisclassification error is:   (5 Points) The overall misclassification error before splitting: (5 Points) The gain in misclassification error after splitting on A: (5 Points) The gain in misclassification error after splitting on B: (3 Points) Which attribute would the decision tree choose: (4 Points) There are three impurity measurements: entropy, misclassification error, and Giniindex. Which one is the best for measuring impurity, and why?

Problem 5: (14 points) Clustering Algorithm Analysis1) (5 Po…

Problem 5: (14 points) Clustering Algorithm Analysis1) (5 Points) Given a scenario, K-means cluster may NOT work very well (2 points).Explain why (3 points). 2) (5 Points) Given a scenario, DBSCAN algorithm could NOT work very well (2 points).Explain why (3 points). 3) (4 Points) How can we improve the random initialization of classic K-means algorithm?