Professional Documents
Culture Documents
What is Learning?
Changes in the system that are adaptive in the
sense that they enable the system to do the same task
or tasks drawn from the same population more
efficiently the next time."
--Herbert Simon
"Learning is constructing or modifying
representations of what is being experienced."
--Ryszard Michalski
"Learning is making useful changes in our minds."
--Marvin Minsky
Sensors
Learning Element
Problem Generator
Performance Element
Effectors
Clustering
It is unsupervised, inductive learning in which "natural classes"
are found for data instances, as well as ways of classifying them.
Discovery
- Unsupervised learning, specific goal is not given.
- It is both inductive and deductive learning in which system learns
without the help from a teacher.
- It is deductive if it proves theorems and discovers concepts about
those theorems.
- It is inductive when it raises conjectures (formulation of opinion
using incomplete information).
Reinforcement
- Only feedback (positive or negative reward) given at the end of a
sequence of steps.
Requires assigning reward to steps by solving the credit assignment
problem i.e., which steps should receive credit or blame for a final
result?
Inductive Bias
Inductive learning is an inherently conjectural process
because any knowledge created by generalization from
specific facts cannot be proven true; it can only be proven
false. Hence, inductive inference is falsity preserving, not
truth preserving.
To generalize beyond the specific training examples, we
need constraints or biases on what f is best.
That is, learning can be viewed as searching the
Hypothesis Space H of possible f functions.
A bias allows us to choose one f over another one.
A completely unbiased inductive algorithm could only
memorize the training examples and could not say anything
more about other unseen examples.
Two types of biases are commonly used in machine
learning:
Case Studies
Many case studies have shown that decision trees are at
least as accurate as human experts.
For example, one study for diagnosing breast cancer had
humans correctly classifying the examples 65% of the
time, and the decision tree classified 72% correct.
British Petroleum designed a decision tree for gas-oil
separation for offshore oil platforms.
Replaced a rule-based expert system.
Cessna designed an airplane flight controller using
90,000 examples and 20 attributes per example.
Deductive learning :
- Deductive learning works on existing facts and knowledge and
deduces new knowledge from the old. So deductive learning or
reasoning can be described as reasoning of the form if A then B.
- Arguably deductive learning does not generate "new"
knowledge at all, it simply memorizes the logical consequences
of what is known already.
- Deduction is in some sense the direct application of
knowledge in the production of new knowledge.
- However, this new knowledge does not represent any new
semantic information: the rule represents the knowledge
completely as the added knowledge since any time the assertions
(A) are true then the conclusion B is true as well.
- Purely deductive learning includes method like explanation
based learning.
Explanation-based learning
This is based on deductive learning concept which converts
principles into usable rules.
This kind of learning occurs when the system finds an
explanation of an instance it has seen, and generalizes the
explanation.
The general rule follows logically from the background
knowledge possessed by the system.
The basic idea is to construct an explanation of the observed
result, and then generalize the explanation.
Then a new rule is built in which the left-hand side is the leaves
of the proof tree, and the right-hand side is the variablized goal,
up to any bindings that must be made with the generalized proof.
Any conditions true regardless of the variables are dropped.
PAC learning
Probably Approximately Correct learning. A system is PAC
if Pr[error(f, h) > e] < d, where e is the accuracy parameter,
d is the confidence parameter and h is hypothesis.
Bayesian learning
Learning that treats the problem of building hypotheses as a
particular case of the problem of making predictions.
The probabilities of various hypotheses are estimated, and
predictions are made using the posterior probabilities of the
hypotheses to weight them.
Adaptive dynamic programming
Adaptive dynamic programming is any kind of reinforcement
learning method that works by solving the utility equations
using a dynamic programming algorithm.
Neural network
A computational model somewhat similar to the human
brain; it has many simple units that work in parallel with no
central control. Connections between units are weighted, and
these weights can be modified by the learning system.
It is a form of Connectionist learning in which the data
structure is a set of nodes connected by weighted links, each
node passing a 0 or 1 to other links depending on whether a
function of its inputs reaches its activation level.
Relevance-based learning
This is a kind of in which background knowledge relates the
relevance of a set of features in an instance to the general
goal predicate.
Encoding of a Chromosome
A chromosome should in some way contain information about
solution that it represents.
The commonly used way of encoding is a binary string.
Each chromosome is represented by a binary string and could
look like this:
Chromosome 1
1101100100110110
Chromosome 2
1101111000011110
Each bit in the string can represent some characteristics of the
solution.
There are many other ways of encoding. The encoding depends
mainly on the problem.
Crossover
After we have decided what encoding we will use, we can
proceed to crossover operation.
Mutation
After a crossover is performed, mutation takes place.
Mutation is intended to prevent falling of all solutions in
the population into a local optimum of the problem.
Mutation operation randomly changes the offspring
resulted from crossover. In case of binary encoding we can
switch a few randomly chosen bits from 1 to 0 or from 0 to 1.
Mutation can be then illustrated as follows:
Original offspring 1 1101111000011110
Original offspring 2 1101100100110110
Mutated offspring 1 1100111000011110
Mutated offspring 2 1101101100110110
The technique of mutation (as well as crossover) depends
mainly on the encoding of chromosomes.
Binary Encoding
1. Crossover
Single point crossover - one crossover point is selected,
binary string from the beginning of the chromosome to the
crossover point is copied from the first parent, the rest is
copied from the other parent
11001011+11011111 = 11001111
2. Mutation
Bit inversion - selected bits are inverted
11001001 => 10001001
Permutation Encoding
1. Crossover
Single point crossover - one crossover point is selected, the
permutation is copied from the first parent till the crossover
point, then the other parent is scanned and if the number is
not yet in the offspring, it is added
Note: there are more ways how to produce the rest after
crossover point
(1 2 3 4 5 6 7 8 9) + (4 5 3 6 8 9 7 2 1) =
(1 2 3 4 5 6 8 9 7)
2. Mutation
Order changing - two numbers are selected and exchanged
(1 2 3 4 5 6 8 9 7) => (1 8 3 4 5 6 2 9 7)
Value Encoding
1. Crossover
All crossovers methods from binary encoding can be used
2. Mutation
Adding (for real value encoding) - a small number is added to
(or subtracted from) selected values
(1.29 5.68 2.86 4.11 5.55) => (1.29 5.68 2.73 4.22 5.55)
Tree Encoding
1. Crossover
Tree crossover - one crossover point is selected in both parents,
and the parts below crossover points are exchanged to produce
new offspring
2. Mutation
Changing operator, number - selected nodes are changed