Hi Friends,

Even as I launch this today ( my 80th Birthday ), I realize that there is yet so much to say and do. There is just no time to look back, no time to wonder,"Will anyone read these pages?"

With regards,
Hemen Parekh
27 June 2013

Now as I approach my 90th birthday ( 27 June 2023 ) , I invite you to visit my Digital Avatar ( www.hemenparekh.ai ) – and continue chatting with me , even when I am no more here physically

Translate

Saturday, 11 January 1997

SEARCH ENGINE

CLIENT | | PROFILE | CANDIDATE - DATA.

| | GENERATOR. |

| | PERSONAL DATA: |

Telephonic | Edu. Quali | Name | - Typed

Requests | occasionally | (Name of Company) | Addresses | Bio-data

(MANUAL) | | Edu. Qualification |

| | Current Employer |

Executive | Experience. | Office Phone No | - Handwritten

Search | ✓ (Level, | Resr. | Executive

Request | Range. | Past Employers | Data Sheet

(ESR) | | Periods Worked. |

| | Designations |

Advertise- | | ✓ | SEARCH | Salary | - Floppy.

ments. | ocassionally | ✓ | ENGINE | References |

(Individual) | | ✓ | KEY-WORDS |

| | ✓ | Industries |

ADVT | | ✓ | Functions |

SUMMARIES | | ✓ | Products/Services |

| | skills |

PHONE-IN | | | Knowledge | Remote | LARGELY

(Speech | | | Attitude | Log-in | INDIAN

Recognition | | | Attributes | (DIAL-UP MODEM) | EXECUTIVES

Software) | | | |

| | | | PHONE-IN | Mainly

Remote | | | | BIODATAS | Indian

Entry | | | | Executives

| | | | Remote Log-in | Mainly

INTERNET | | | | on | Oversees

ADVT. | | | | INTERNET | Executives

| | | | SCAVANGING | Mainly

| | | | on | Overseas

| | | | INTERNET | Executives

| | | | (WEBCRAWLER) |

| | | | 11-1-97. |

Proposed Layout for Storage of scanned/AI data into an intermediate file

1. Name of Table:

   HHTBL1.DBF

2. Contents:

   To store data picked up from scanned bio-datas.

3. Layout:

   Details                                    Field Name Num/Chr Len Dec

   -----------------------------------------  ---------- ------- --- ---

   Record Name(Fixed 'HT1')                  Recname      C     3

   Library Sr. No.(Fixed 201)                     Libno        N     3

   Record Status (Leave blank)              Recstat      C     1

   Executive Sr. No.                                     Exsrno       N     8

   Sub No.(by default 1)                            Subno        N     3

   Type of data stored                                Typdata      C     10

   Code related to data(if any,)              Typcode      C     10

   Actual description related to this

   data element                               Text1        C     30

   Any Numeric Data                           Numdata      N     12  4

   -----------------------------------------  ---------- ------- --- ---

   Year of Passing an exam.(for edun.only)    Passyr       N     4

   -----------------------------------------  ---------- ------- --- ---

   Speak/Read/Write-Y?(For Language only)     SRW          C     3

   (store Y else blank e.g. Y Y)

   -----------------------------------------  ---------- ------- --- ---

   Current Employment-Y?

4. Type of data stored:

   Use following codes for individual data-

   Typ data   Details

   --------   --------------------------

   SOURCE     Source of Application

   ADREF      Admt. Ref.

   BOXNO      Box No.

   ESNAME     Executive's Surname

   EFNAME     First Name

   EMNAME     Middle Name

   ESEX       Sex

   EDOB       Date of Birth stored as dd/mm/yyyy

   EADDR1     Address Line 1

   EADDR2     -2

   EADDR3     -3

   EADDR4     -4

   ECIT       City of Residential address

   EPIN       Pin code

   EPHONE     Phone nos. separated by ,

   EFAX       Fax no.

   EHOUSTYP   Type of House C-ompany flat/O-wn

4. Type of data stored(contd....):

   Typ data   Details

   --------   -------------------------------------------------

   ESAL       Executive's Current Salary-Basic + DA

   ECALLOW    -Allowances

   ECPERKS    -Perks

   ECREIMB    -Reimbursements

   ECANNUAL   -Annual benefits

   ECRETIRE   -Retirement benefits

   ECOTHER    -Other earnings

   ECGROS     -Gross salary/Annum

   ECRATING   -BGC etc. Rating

   EDUNCD     Education qualification code

   PASSYR     Year of passing the exam

   UNIV       University from where passed

   LANGCD     Language known code

   SPEAKY     Y, if can speak

   READY      Y, if can read

   WRITEY     Y, if can write

   EMCONN     Organisation in which employed

   DESGNN     Designation

   DESGLV     Level of designation (between 10 to 90)

   STARTM     Employment started during yyyy-mm

   UPTOYM     Employment ended during yyyy-mm

   DURYYMM    Period of employment in terms of years & months

   STAFNO     No. of staff members reporting

   RESPON     Responsibilities(The activity.... details)

   REPOTG     Reporting to whom(say M.D.)

   WHYLEFT    Reason for leaving(descriptive)

   MEMBORG    Membership organisation

   MEMBDESG   Designation held there

   MEMBPERI   Period of membership

   REFNM      Reference Person Details - Name

   REFDESG    - Designation

   REFADDR1   (-----Residence----) - Address Line-1

   REFADDR2   - Address Line-2

   REFADDR3   - Address Line-3

   REFADDR4   - Address Line-4

   REFPIN     - Pin code

   REFPHONE   - Phone Nos. separated by ,

   REFOADDR1  (-----Office-----) - Address Line-1

   REFOADDR2  - Address Line-2

   REFOADDR3  - Address Line-3

   REFOADDR4  - Address Line-4

   REFOPIN    - Pin code

   REFOPHONE  - Phone Nos. separated by ,

   INDUST     Code of Industry         ) Search Parameters

   WORKFN     Code of Function         )

   PRODLN     Code for Production Line handled )

   CITY       Code for Preferred City        )

   KEYW       Keyword data

   PRCONM     Name of Preferred Company      )

   COURNNM    Name of Course attended

   COURDUR    Duration of Course

   COUNTRY    Country in which the course was attended

   VCONTRY    Code of Country visited (as per postal code 91-India)

   VPERIOD    Period of visit

INPUT

SCANNED BIODATA

 

INPUT

VOICE

PHONE-IN

BIODATAS.

STEP #2

OS/2/WARP 4

Voice

Speech

Recognition

STEP #1

(implemented)

ASCH

TEXT

WORD-

FILE

Creation

Bio-Datas

in

Text form

STEP #3.

FLOPPY

INPUT

EDS on

Floppy

STEP #4

INPUT

MTNL

VSNL

INTERNET

MTNL/VSNL

INTERNET

REMOTE LOG-IN

BY CANDIDATE

STEP #5

ARDIS

DECIPHERING

WORD

RECOGNITION

Each key-word

put in respective

slots

(Temporary Data Storage)

SEARCH

MODULE

#6

CLIENT

REMOTE

#7

CANDIDATE

REMOTE

3 P

LOCAL

FAX

PRINTER

E-MAIL

#8

ARGIS

CONVERTED BIODATA

HARMONISED

DATA-BASE

MTNL

OS/2/WARP 4

VOICE

COMMAND

MTNL/MODEM

SOFTWARE

COMMANDS

2-1-1997.

STEP #   Target/Date.

         Implemented.

1        Scan bio-datas & Convert to ASCII text to create "WORDS"

2        Install OS/2/WARP 4 speech-recognition software and link it to ASCII Text File (PHONE IN BIODATA)

3.       Create Screens (with instructions-MENU) on floppy containing all EDS information for direct transfer to TEMPORARY DATA STORAGE

4        Create Software and physical link which will enable candidates to download (remotely) EDS "Screens", then enter at his leisure & when ready - upload (again by remote log-in) into our TEMPORARY DATA STORAGE. This will be a dial-up MODEM link especially for software professionals in SEEPZ.

         For all others who do not have a ready access to a computer-modem link OR who are not COMPUTER-LITERATE, we must immediately plan

         VOICE INPUT (OF BIO-DATA)

         thru plain old telephone

         and

         convert to ASCII then OS/2/WARP 4.

         In this case, literally LAKHS & LAKHS of executives can get hold of a phone (even at midnight) and dictate

HOW TO DETERMINE/DECIDE/DECIPHER

"RESPONSIBILITY-RELATED" PHRASES/

SENTENCES?

A. One simple criteria is

   "any set of words/phrases/sentences or

   even paragraphs that follow

   - I am/was responsible for - - -"

B. If not, an indirect method is to

   examine "clues".

   These "clues" are some VERBS/

   ADVERBS with or without a

   preposition attached.

   Some examples appear on

   the following pages:

   Copy given to Hugh on

   4-1-97

   31-12-96

WORDS WITH PREPOSITIONS

* In charge of

* Monitoring of

* Comply with

* Filling of

* Maintenance of

* Conversant with

* Finalization in

* Involved in

* Working Capital management through

* Fixing of

* Revising of

* Preparation of

* Comparison of

* Analysis of

* Controlling of

* Guidance on

* Operations of

* Annual Sales of

* Sales of

* Coordination with

* Designing of

* Presentation of

* Making of

* Balancing of

* Finalization of

* Selection of

* Developing of

* Upgrading of

* Imparting of

* Holding of

* Implementation of

* Maintaining of

* Organising for

* Arranging of

* Interfacing with

* Management of

* Liason with

* Involvement in

* Optimisation of

* Control of

WORDS WITHOUT A PREPOSITION

* Managed - / Managing

* Project management which involves - -

* Arrange

* Monitoring. - Research

* Controlling - Formulating

* Identified - Implementing

* Selected - Planning

* Made - Establish

* Computed - Create

* Erased - Develop

* Developed - Expand

* Set-up - Structure

* To enhance - Evolve

* Marketing - Dev Improve

* Conceiving - Widen

* Established

* Upholding

* Steering

* Designing

* Visualizing

* Liasoning

* Overseeing

* Monitoring

* Inducting

* Developing

* Formalizing

* optimizing

* Consolidation

* Build-Up

* Motivate

* Achieve

RESPONSIBILITY-RELATED WORDS

WHICH ARE

- neither ending with "ing" or "ed"

- nor accompanied by a "preposition"

but

- are "stand-alone" (nouns ?)

* Corporate Relations

* Communications

* Press & Media relations

* Legal Affairs

* Administration

* Foreign Collaborations

* Technology transfer

* Joint Ventures

* Capital Goods Import

* Capital Issues (CDR, FCR, ECB)

* Forex Approval

* Public Relations

* Export documentation

* Bank Negotiations

* Licenses / Licensing

* Insurance

* ECRC

27-12-96

Dear Hugh:

In my note dated 23rd inst, I have talked about "CATEGORIES OF WORDS".

For each of this category, we have to build-up a **DIRECTORY** containing hundreds or thousands of words.

Under "**MANUFACTURING-RELATED**" category, there are following sub-categories:

* Equipment – Machinery – Systems

* Raw Materials – Intermediates

* End Products – Services.

I enclose a floppy containing some **1800 + words** that come under above-mentioned categories. Each word has a **FOUR DIGIT** neumantic code as well. These are compiled from **KOMPASS DIRECTORY**. These codes and descriptions are standardised by **UNIDO** or **GATT (WTO)** and are known as **HNS (Harmonised Nomenclature System)** used by custom authorities around the world.

 

With regards

H. C. Parekh

cc: Mr. Nagle

 

23-12-96

Dear Hugh:

I hope you had an opportunity to go thru my earlier notes.

As mentioned earlier, there are 2 ways of going about our task of "**deciphering**" scanned bio-datas.

**#1 – STORAGE METHOD**

 

Create and Store massive **directories** of words in computer-memory. As the software picks-up each word (while scanning/OCR), it compares it with the stored "directories", finds a "**correspondence**" and stores it accordingly.

This is what **RESUMIX** seems to have done with **80,000 words** directory.

In next few pages, I have listed

* categories of words

* some examples.

But, the problem is to manually compile such directories. It may take **months**!

**#2 – [AI METHOD]**

Under this method, we begin with

* a rudimentary "**logic**" (grammar)

* a small list of words' categories

* **theory of probability** (frequency – and by deduction, the probability of occurrence of any given word

    * “before” another word

    * “after” another word)

The trick is to write a software which is **self-learning**!

It is like providing a small "**nucleus**" around which, gradually a **PEARL** is cultured!

In following pages, I have attempted to establish a **CONTEXT** of words by picking actual examples from 30/40 bio-datas. I am clueless as to how this analysis will help you – but it

may.

The "**word-categories**" mentioned by me are not exhaustive. Please feel free to add.

In the file I sent to you yesterday, I had used following logic:

**Step #1**

Pick a "**word-category**" e.g.

"**Unit | Section | Division | Branch – Group – Dept – Project – Shop – Cell – Area – Office**"

all of which mean "**work location**"

 

**Step #2**

Find a list of words which fall under this "**word-category**" e.g.

* Tool Room – M/C Shop – Design Cell

* Jig & fixture Assy – Forging Dept – File Production

* Reebok project – Central Marketing Office

* Legal – Dept – Merchant Banking Div.

* Jaipur Branch – Planning Section

**Step #3**

List **words (in fact phrases)** which precede these words e.g.

* Undertook training **in**

* Worked **in**

* Worked **as**

* Supervisor **for**

* Incharge **of**

* Looked **after**

* Looking **after**.

* Joined

* Transferred **to**

* (I was) heading

**LOGICAL DEDUCTION**

The above-mentioned phrases are the **most-likely phrases to precede** any word that denotes

"**Dept – Section – Unit – Branch – Div**." etc. etc.

i.e. any word that denotes a **WORK-LOCATION**.

The logic would improve as the **Sample-size** grows.

The entire process can also be **reversed**.

 

See enclosed Logic-sheets for

* "WORK"

* CO-ORDINATION

* ASSISTING

* PREPARATION

* SUPERVISION

Here you go **backward** from **STEP #3 → STEP #2 → STEP #1**. and reach **similar CONCLUSIONS** and draw-up **similar LOGIC-RULES**.

I hope this should suffice for the time being. Of course I will keep you informed if I come up with any new ideas.

Do you think, you could show us an **ALPHA version** of the software in **8/10 days time**?

with regards

H. C. Parekh

23-12-96

cc: Mr. Nagle.

 



















No comments:

Post a Comment