Notes

SCOPE OF WORK

Cyril / Hugh/ Yogesh

The scope of work is schematically described in the enclosed diagram. The basis inputs are TEXTS of various types. The printed texts will be scanned and subjected to an OCR software to convert into ASCII files. Then, there will be texts received

ELECTRONICALLY such as

- Floppies

- Fax

- E : Mails

- Computer files (Dial - Up / Internet / Intranet)

In both the cases, the software will search, identify and pick - out KEYWORDS and place them in appropriate BINS/ FIELDS, based on the "meaning" of each keyword. In this way, the software will Create a database (or several databases). This basic process will remain the same irrespective of the type / size / structure / format of the document being scanned (whether printed or electronically received document).

Of course, all the keywords picked - out from a given document will be linked to that document (identified as belonging to that particular document).

Having created a DATABASE, it should be capable of being QUERRIED, by using any of the KEYWORDS (one or more, in the AND/ OR fashion). For each keywords "SYNONYM RINGS" WILL have to be created.

The search will produce a short list of all the records (documents) where such a designated / specified KEYWORDS appears. We should be able to VIEW all such records on the Screen (one by one) or be able to take print - outs.

By using a computer - memorized STANDARD - LETTER, we should be able to send - out the short - list to a given client.

Besides responding to a QUERRY shot from one of our own LAN Nodes, the software must permit a client to shot such a query from his own office computer by remotely logging - on to our Server thru a dial - up Modern or thru internet connection. This feature (of remote query) is absolutely essential. Of course, we must provide for a password within a password within a password (!) to ensure data - security.

of course, what part of the database each user will be allowed to access (locally or remotely) will be strictly defined in advance and rigidly administered. The users are :-

- Self

- Associated (e.g. Mankodi/ Gangolli etc)

- Candidates

- Clients

- Foster Partner Member and may be

- Anyone from Public.

What is expected of the software is REMOTE ACCESS CAPABILITY. And that should be built into the Software RIGHT NOW.

However,

Which user will be allowed to remote access

What databases and shoot

What type of quarries and

When (point of time)

Will be spread - out over next 2 /3 years.

However, in the first phase itself.

We want

1. Candidate to enter and modify their own biodatas remotely (This is the ONLY WAY, we can hope to build up a

- Large candidate database

- Quickly

- Without hiring an army of data - entry operators or persons to scan typed bio - datas.

2. Candidates to be able to shoot queries to JOB BULETIN BOARD. This is the MOST POWERFUL VALUE - ADDED SERVICE (VAS) as far as employment / change - seeking executives is concerned. This VAS will be available only to those executives who register / enroll with us by

- Remotely entering their biodatas

- Sending biodatas on floppies.

Once again, the idea is to transfer the data - entry burden onto the candidate himself.

3. Clients to be able to shoot "Executive Search Queries" remotely (from their own computers) exactly the way we would have done locally. The software will search and tell the client (or potential client) "How many suitable candidates we have in our database? - a number" Nothing more ! But this is a great way of HOOKING him ! If his need is really SERVIOUS, he would, next day, send a cheque (advance) along with search Request !considering that it would

- Cost him over a lakh of rupees to adventure that vacancy

- to take him 8 weeks to get response,

The bait of REMOTE ACCESS/ IMMEDIATE ACCESS/ CHEAP ACCESS is simply irrestible !

This is an IMPORATANT aspect of the cost-Benefit analysis of the software to be developed .

The rest of the REMOTE ACCESS applications can wait for 2/3 years – as each database gets built-up by scanning thousand of pages of other documents, some of which are enclosed herewith.

h.c.parekh

Bio-data (Resume)

Samples of typed bio-datas are already given to you. These are quite typical. Some 5%–10% of the bio-datas arrive on fax. The software should be able to take care of these as well.

Before long we shall start receiving bio-datas:

on E-mail
on floppies
by remote “log-in” by candidates
candidates telephonically dictating to 3P staff
by voice-mail

The software should be able to take care of these as well.

While running OCR software (on scanned bio-datas), there is a possibility that 80% ASCII characters come out OK leaving 20% “errors” (spelling mistakes). Normally, these errors are corrected by Data Entry Operators on the screen.

This operation slows down the entire process. We would like to eliminate/avoid this operation if possible.

You should examine the possibility of your software to take care of this.

KEY-WORDS

To identify & decipher all grammatically defined words such as:

Verb
Adverb
Preposition
Adjective
Nouns (Common Nouns / Proper Nouns)

FIELDS

To identify & decipher Adjectives and Nouns (whether Common Noun or Proper Noun) which are BIODATA RELATED. These include:

Name of Candidate
Name of Boss / Colleague / Subordinate
Companies
Addresses (several types) / PIN codes
Locations / Cities / Countries
Birth-date / Age
Industry / Employer
Phone Nos. (incl. STD / ISD codes)
Dates (of joining / leaving etc.)
Durations of employments (Periods)
Educational Qualifications (years of passing) / Colleges / Universities
Salary
Designations / Positions held
Departments
Functions
Skills
Knowledge
Attitudes
Attributes
Codes
Equipment
Products

Raw Materials
Manufacturing-related
Management-related
Engineering-related

Techniques
Processes
Languages
etc. etc.

The above is not a comprehensive list. It is only indicative. A comprehensive list can be prepared over a period of time by scanning thousands of bio-datas. This could be done by scanning over 40,000 bio-datas lying with us.

Even if most of these are OBSOLETE, the keywords contained in these are not obsolete, so these could be a good starting point for building a comprehensive list.

CONVERTED BIO-DATA

A sample form is enclosed. This is only for guidance. We would be happy if you could come up with a superior presentation, which is both attractive/catchy and easy to print.

In the sample “converted bio-data” enclosed, there is no provision for writing descriptive “Achievements” and “past/current Job Responsibilities”. But these two paragraphs are a MUST, as you would notice from any of our currently formatted converted bio-datas.

Please take care of this.

CONTRACT FOR DEVELOPMENT OF ARDIS/ARGIS SOFTWARE

Scope of Work
Inputs / Outputs
Platform (Hardware / Software)
Upgradability / Flexibility
Integration
Documentation
Secrecy / Exclusivity / Ownership Rights
Future Support
Performance Guarantee
Time Frame / Benchmarks
Payment Schedule

DEFINITIONS

Bio-data (Typed or E-Mail or Faxed)
Key-Words & Fields
Converted Bio-data

The scope of this project shall comprise development of a computer software.

TASK #1

The software shall be capable of reading typed bio-datas/resumes and picking out all the KEYWORDS.

Having picked out the keywords, the software shall assign each keyword to a specific FIELD (BIN) to which the keyword belongs, while also linking the keyword (and its field) with or linked to the particular (Permanent Executive Number — PEN).

In this way, the software shall create:

A DATABASE of keywords:

linked to specific executives (PEN)
linked to specific FIELDS

This means developing a SEARCH ENGINE for:

searching out each keyword
deciphering each keyword and deciding what it “means” (to enable placing it in an appropriate field)

The search engine shall also be able to:

Given one or more KEYWORDS (under which to search), the search engine should be able to identify/list all executives (PEN) against whose names these keywords appear (i.e. existing in their bio-datas).

The second task of the software would be to recreate (represent) the bio-data (of any executive) in a specific format called CONVERTED BIO-DATA.

The converted bio-data would have the option of printing or not printing:

Executive Name
Current Employer Name

and to be replaced by appropriate alternate codes/descriptions.

In the development of this software, the following may be assumed:

a. Use of existing scanner (H.P. Model 3P) & scanning software.

b. Use of existing OCR software.

Of course the software shall be capable of using:

High capacity scanners
Improved versions of OCR softwares
DMP/Inkjet/Laser printers & even line-printers

INPUTS

The basic input will be any typed bio-data.

In Phase #2, we would like you to consider any printed advertisement (for advert) also as an input. This would be for the purpose of creating a DATABASE OF JOB BULLETIN BOARD.

This is because the basic capability of the search engine shall remain the same irrespective of whether it is working on:

a bio-data
or advertisement

This basic ability is:

to pick out KEYWORDS
place them in appropriate FIELDS
rearrange these in some formatted/tabular OUTPUT statement
subject them to be “searched” given certain SEARCH PARAMETERS (which will be Keywords)
create a “listing” of “Records Found”

OUTPUTS

As mentioned earlier, the desired outputs are:

a database of keywords, deciphered & stored in appropriate FIELDS
a converted bio-data
a converted (tabulated) job-advertisement

A. Hardware

B. The software should be designed for a Pentium-based Server & 156-based LAN. Of course, these will be upgraded in course of time.

C. Software

D. You may use ______ language and create a database (RDBMS).

Please configure around ______ Operating System, with a provision to changeover to ______ in course of time.

INTEGRATION

The software should be capable of seamless integration with our existing softwares, which are as follows:

We are also planning to incorporate OS-2 / WARP 4 (IBM) Speech Recognition Software on our LAN. The software should be capable of thorough integration with this or any other improved voice-recognition software.

PERFORMANCE ACCURACY

After installation and debugging the software should be capable of correctly picking up (identifying) and deciphering (i.e. to which field does the keyword belong) at least 80% of the keywords appearing in each bio-data/job-advertisement.

Within 6 months of the installation, this figure should go up to 90% and within 12 months up to 98%.

TIME-FRAME

For Installing

The software shall be installed on our system within 6 months from the date of this agreement. The detailed time-frame is shown at Annex: _______

For Debugging

The debugging shall be carried out within 2 months of the date of installation.

FUTURE SUPPORT

We are looking at a long-term relationship with FELYNX.

We therefore expect that we will continue to receive full support from you in future as far as:

Development of new modules of software
Maintenance of the software being developed

SECRECY / EXCLUSIVITY

After installation and debugging, you will handover the SOURCE CODE to us which will become our “Intellectual Property”.

You will not sell this software or part with it in any manner whatsoever to any other person or organization at any time.

You will not share or pass on to any other person or organization any information given to you and collected by you relating to our business (including any of our future plans that you may come to know of).

PAYMENT SCHEDULE

The total charges payable to you for the development of this software would be Rs. 5,50,000/- (Rupees five lakh and fifty thousand).

This total amount will be paid to you in installments as per payment schedule (Annex: ___) enclosed.

Each (payment) installment shall be made subject to satisfactory completion of software development activities which are planned to be completed by that point of time, as per TIME FRAME (Annex: ___).

Note (4/3/97):
According to Yogesh, if we wish this software to run on a client-server (LAN environment), then we will need to separately buy and install ORACLE RDBMS Software costing about Rs. 1 lakh!

The software that they will develop cannot run on NOVEL 3.11!! This means our total cost will be Rs. 5.5 + 1.0 = 6.5

Cyril had told us that in Rs. 5.5 itself, there are about Rs. 2.0 worth of “tools/compilers.”