05/09/2003 – 17/09/2003
Kartavya / Abhi / Sanjeev
Self-Learning Software
How does a one year old child learn to differentiate between
colours Red & Blue, and beyond that between different shades of Red?
This is another way of asking
“How does Learning take place? What steps are involved in
the learning process? “
There are no fool proof / ironclad / undisputed scientific
theories. But the empirical evidence leads us to believe that the process (of learning),
occurs, somewhat as follows:
A mother points a finger at a colour and speaks aloud “RED”.
The sound is captured by the child & stored in his memory.
This process is repeated a thousand times and with each
repetition, the memory gets etched, deeper & deeper.
An “association “develop between the colour & the sound.
Then the process is repeated with colour BLUE & another
memory gets “etched “deeply.
So, on 1001 occasion, when a colour patch is shown to child
& question asked,
“What colour is this? “
Child says “RED “perhaps, even without understanding the
question (then meaning of the question).
There is, merely, an “association “between what child SEES (sight)
& what child HEARS (sound)
The process can be repeated by,
Ø
Showing RED colour patch , and
Ø
Showing a placard (flag), with RED written on it
in big / bold Letters.
Now child “associates “the patch (SIGHT) with placard (also
another SIGHT). No Sound.
So, next time a child is shown patch of red colour, he will
pick up the sign / placard, learning word RED.
Input (Sound)
Input (patch) Association / Memory (stored)
So next time, what happens?
![]() |
Input (Red
patch) (sight) Input
Recall from memory / compare (database search)
RED
output (SOUND)
OR
![]() |
Input
( SIGHT)(RED patch)
Input (SIGHT) Recall
from Memory /
Database words RED & match-making
Pickup
flag bearing letters
RED
Remember that two MAIN inputs to a brain are
Ø
Sight ( Eyes) ----- 80% of learning takes place here
Ø
Sound ( Ears ) --- 10% of learning takes place here
Of course, there are other, relatively minor inputs of
Ø
Touch / Feel ( Skin
) Balance 10% of learning
Ø
Smell ( Nose) takes place thru this
Ø
Taste ( Tongue) INPUT – DEVICES
In the examples listed earlier, MOTHER acts as a human expert,
who initiates the learning – process by establishing “references / the
bench-marks.”
In essence, she uses the process (of showing patch &
speaking aloud or showing patch & showing placard), to transmit her OWN
EXPERT KNOWLEDGE to the child.
So, all knowledge flows events from a GURU!
You can even watch events & learn – without a
single word being uttered!
You can close your eyes & listen to music &
learn – without seeing who is singing!
Then there was Beethoven who was deaf but composed
great symphonies which he himself, could not hear! But this is an exception.
What is the relevance of all this to “self-Learning Software?”
Simple,
If we want to develop a software which can identify /
categories a “resume”, as belonging to
VB C++ etc…..
Then all we need, is to “show” to the software, 1000
resumes and speak aloud,
C++ !
Then 1001st time, when the software “sees” a
similar resumes, it will speak-out loudly
C++ !
So, first of all, we need a human expert – a GURU, who,
after reading each resume, shouts
C++ or VB or ASP etc. etc……..
When Guru has accurately identified segregated 1000 resumes
each of C++ etc…..
We take those sub-sets & index their Keywords, calculate
“frequency of occurrence “of each of those keywords & assign them “weightages”
(probabilities).
Then we plot the graphs for each subset (I .e. each “skill”)
Then, when we present to this software any / next resume, it
would try to find the keywords. Let us say, it found 40 keywords. Now let us
compare these 40 keyword-set, with
Ø
VB Keyword-set
Ø
C++ Keyword-set
Ø
ASP Keyword-set
& see what happens
FIRST SCENARIO (FIRST MATCH)
![]() |
Only
10% match.
SECOND MATCH
![]() |
Only
30% match.
THIRD MATCH
![]() |
Only
50% match.
We ( i.e. software ) has to keep repeating this “
match-making” exercise for a new resume, with
ALL THE KEYWORDS – SETS
Till it find the highest/ best match.
BINGO
The new resume belongs to an “ASP” guy!
(Self-learning Software – cont.)
That way the FIRST METHOD, where a human expert reads thru
30000 resumes & then regroups these into smaller sub-sets of 1000
resumes-each belonging to different “skill-sets”
This will be a very slow method!
SECOND METHOD
Here, instead of a (one) expert going thru 30000 resumes, we
employee 30000 experts the jobseekers themselves!
Obviously, this METHOD would be very fast!
Underlying premises is this.
No one knows better than the jobseeker himself, as to what
precisely is his CORE AREA OF COMPETENCE / SKILL.
Is my skill
·
VB
·
C ++
·
ASP
·
.Net
So, if I have identified myself, as belonging to VB OR C++
OR ASP etc. etc….
Then you better believe it!
Now, all that we need to do, is to take 1000 resumes of all
those guys who call themselves
VB
And find “keywords” from their resumes (& of course, weightages)
If there are job sets where software guys are required to
identify themselves by their “ skills”, then best course would be to search
resumes on these jobsites by skills,
Then download the search-result resumes! Repeat this
search/download exercise for each “skill” for which we want to develop “skill –
graphs”
This approach is fairly simple and perhaps, more accurate
too.
But,
Ø
We have to find such jobsites & then satisfy
ourselves that “ Skill-wise” searching of resumes ( and downloading too ) is
Possible
Ø
Then Subscribe for 1 Month / 3 Month, by paying
Rs.20000/40000! There is a cost factor, here
THIRD METHOD
We have, already downloaded from various jobsites 150000 job
advts. For each of these we know the “Actual Designation / Vacancy-Name/Position”
(thru Auto – converter)
We can re-group these advts. According to identical /
similar vacancy names / actual design where we finish, we may get, against each
specific “Vacancy – Name’
500 to 5000 Job- advts.
Call each a sub- set (Vacancy-Name-wise)
Then index keywords of each subset & calculate frequency
–of-usages (weightage).
So, now, we have Profile-Graphs, which are not skill-wise,
but which are “Vacancy-Name” wise!
This can be done real fast & cheap! And, may suffice
Software Companies’ BROADER needs, A quick beginning can be made & Result
shown within a week!!!