What is the entropy of this collection of training examples with respect to the positive class B. What are the information gains of A1 and A2 relative to the training dataset For A3, which is a continuous attribute, compute the information gain for every possible split. C. What is the best split (among A1,A2, and A3) according to the information gain

Question

05-04-2021
Chemistry

Answered

Discover the answers to your questions at Westonci.ca, where experts share their knowledge and insights with you. Get immediate and reliable solutions to your questions from a community of experienced professionals on our platform. Connect with a community of professionals ready to provide precise solutions to your questions quickly and accurately.

What is the entropy of this collection of training examples with respect to the positive class B. What are the information gains of A1 and A2 relative to the training dataset For A3, which is a continuous attribute, compute the information gain for every possible split. C. What is the best split (among A1,A2, and A3) according to the information gain

Sagot :

Thanks for stopping by. We strive to provide the best answers for all your questions. See you again soon. Thanks for using our service. We're always here to provide accurate and up-to-date answers to all your queries. Thank you for trusting Westonci.ca. Don't forget to revisit us for more accurate and insightful answers.

Prions are proteins that act as an infectious agent. They cause a variety of diseases, including "mad cow" disease. Prions cannot produce more potions on their

The process of ___________________leads to the formation of new species by ___________________.

How do I do this? Can someone fill in the blanks

During which of the following processes do vesicles sometime fuse with lysosomes

How did the Great Society programs affect housing in the United States? The rates for luxury housing increased. Discrimination practices in housing rose. The am

Prions are proteins that act as an infectious agent. They cause a variety of diseases, including "mad cow" disease. Prions cannot produce more potions on their

how does the water get to the leaves in the tops of the tallest trees against the force of gravity?

. In a famous speech, William Jennings Bryan A. refused to be allied with the so-called "silverists." B. declared his staunch support of the gold standard. C. r

Which organelle can be compared to a security guard who decides whom may enter a building and whom may not?

why is 16:14 and 64:60 not equivalent

AbsorbingMan AbsorbingMan · Answer 1 · 2021-04-07T08:28:18-04:00

The data set is missing in the question. The data set is given in the attachment.

Solution :

a). In the table, there are four positive examples and give number of negative examples.

Therefore,

[tex]$P(+) = \frac{4}{9}$[/tex] and

[tex]$P(-) = \frac{5}{9}$[/tex]

The entropy of the training examples is given by :

[tex]$ -\frac{4}{9}\log_2\left(\frac{4}{9}\right)-\frac{5}{9}\log_2\left(\frac{5}{9}\right)$[/tex]

= 0.9911

b). For the attribute all the associating increments and the probability are :

[tex]$a_1$[/tex] + -

T 3 1

F 1 4

Th entropy for [tex]$a_1$[/tex] is given by :

[tex]$\frac{4}{9}[ -\frac{3}{4}\log\left(\frac{3}{4}\right)-\frac{1}{4}\log\left(\frac{1}{4}\right)]+\frac{5}{9}[ -\frac{1}{5}\log\left(\frac{1}{5}\right)-\frac{4}{5}\log\left(\frac{4}{5}\right)]$[/tex]

= 0.7616

Therefore, the information gain for [tex]$a_1$[/tex] is

0.9911 - 0.7616 = 0.2294

Similarly for the attribute [tex]$a_2$[/tex] the associating counts and the probabilities are :

[tex]$a_2$[/tex] + -

T 2 3

F 2 2

Th entropy for [tex]$a_2$[/tex] is given by :

[tex]$\frac{5}{9}[ -\frac{2}{5}\log\left(\frac{2}{5}\right)-\frac{3}{5}\log\left(\frac{3}{5}\right)]+\frac{4}{9}[ -\frac{2}{4}\log\left(\frac{2}{4}\right)-\frac{2}{4}\log\left(\frac{2}{4}\right)]$[/tex]

= 0.9839

Therefore, the information gain for [tex]$a_2$[/tex] is

0.9911 - 0.9839 = 0.0072

[tex]$a_3$[/tex] Class label split point entropy Info gain

1.0 + 2.0 0.8484 0.1427

3.0 - 3.5 0.9885 0.0026

4.0 + 4.5 0.9183 0.0728

5.0 -

5.0 - 5.5 0.9839 0.0072

6.0 + 6.5 0.9728 0.0183

7.0 +

7.0 - 7.5 0.8889 0.1022

The best split for [tex]$a_3$[/tex] observed at split point which is equal to 2.

c). From the table mention in part (b) of the information gain, we can say that [tex]$a_1$[/tex] produces the best split.

Sagot :

Other Questions