Professional Documents
Culture Documents
08-Nov-2012
Yogesh Thakur
Sambe Software 2
www.sambesoftware.com
TABLE OF CONTENTS
Contents
Purpose ............................................................................................................................................................. 3
Introduction ...................................................................................................................................................... 3
Technical Development Platform ................................................................................................................... 3
What is zone? .................................................................................................................................................... 3
Purpose
The purpose of this document is to describe the basic implementation process of parserX component.
Introduction
Now days we have so many CVs forms different-2 recourses but the big issue we will face how to retrieve
the information from these CVs with quality data. parserX component is a DLL file which will take the
CVs in different formats as input and will provide the structured information with quality after some
processing. Its a self independent component (DLL) which will be integrated with any windows or web
application and that application will able to parse the CVs with the help of parserX.
What is zone?
Zone is nothing but a property of a CVs. As of now we are going to find out below listed zones from CV
1. Personal Info
i) Name
a. First Name
b. Middle Name
c. Last Name
ii) Date Of Birth
iii) Age (Will be calculated on basis of Date of birth)
iv) Gender
v) Marital Status
vi) Father Name
vii) Mother Name
viii) Spouse Name
ix) No Of Children
x) Nationality
xi) Passport Number
xii) Passport Issue Place
xiii) Driver License
2. Contact Info
i) Address
a. Current Address
A. Current Address
B. Current City
C. Current State
D. Current Country
E. Current Zip
b. Permanent Address
A. Permanent Address
By Yogesh Thakur, Senior Software Engineer, Sambe Software
Sambe Software 4
www.sambesoftware.com
B. Permanent City
C. Permanent State
D. Permanent Country
E. Permanent Zip
ii) Email
a. Primary Email
b. Alternate Email
iii) Telephone
3. Professional Info
i) Objective
ii) Summary
iii) Skills
iv) Current Yearly Salary
v) Total Experience
vi) Functional Category
vii) Industry Type
4. Employment History
i) From Date
ii) To Date
iii) Job Title
iv) Company/Employer
v) Position
ClassName: ResumePreprocess
ClassName: FileInfo
For resume conversion we will use one common function Document Conversion (string filePath, FileInfo
FileInfo), where we will pass Resume Path & FileInfo in Parameters.
After getting it convert to Plain Text, and then we take the count of line by using \n.
ClassName: TypeConversion
ClassName: ResumeVerification
MethodName: IsResumeFile Public bool
IsResumeFile(FileInfo fileInfo)
This method will accept FileInfo object & checks
for the resume type .If resume type will pdf,
doc, docx, html, text.rtf then return true else
return false.
MethodName: IsResumeText public bool IsResumeText(string
resumetext)
This method will accept ResumeText & checks
for the valid resume by checking the LineCount
& by keywords like
email,resume,CV,Education,Experience.
ClassName: ResumeCleanup
Method: RemoveExcessiveSpaces
This method is used to remove
excessive spaces
8) Splitting the whole CV in three blocks & retrieve the data : (Not started)
For Header:
1) To get the header from plain text, We will search "" in resume and if we will get the data
we will use it.
2) IF we will not get desired output from step 1, then we will take first 5 lines from resume & treat as
header.
For Footer:
1) To get the footer from plain text, we will search " " in resume and if we will get the data
we will use it.
2) IF we will not get desired output from step 1 , then we will take last 7 lines from resume & treat
as header
In this process, we will parse and extract personnel data such as Date of Birth, Nationality, Gender,
Marital Status, Driving License, Current Location, Preferred Location, Willing to Relocate, Fathers Name,
Mothers Maiden Name, Visa, Passport Number, Current Salary, Required Salary, SSN, Resume Id.
Data Loader
1. Check XML File Version from Server Encrypted XML
2. Read Xml Config File. Meta Data &
Takes Login Credential 3. Decrypt Config file Text Application
4. Build Data Set Setting
5. Assign all the values to the basic variables
NO
Yes
If valid user
ParseType()?
IsResumeText()?
NO
NO (Strong Rule Only)
Log Exception
XML Output
Parse
Resume
Object Single Line
CSV Output
Error Output
HTML Output
By Yogesh Thakur, Senior Software Engineer, Sambe Software