Classification of job offers of the World Wide Web

Authors: 
Ehtesham-Ul-Haq Dar
Jürgen Dorn
Type: 
Speech with proceedings
Proceedings: 
International Conference on Computing, Mathematics and Engineering Technologies (iCoMET)
Publisher: 
IEEE Explore
Pages: 
ISBN: 
Year: 
2018
Abstract: 
To automate the retrieval of online job opportunities of a specific domain, text classification is an only viable method. In this paper we investigated eight classifiers and defined a new preprocessing method for job offers classification, and study their accuracy and generalization behavior. Different job offer websites are used to collect job offers data. We organized data into six different dataset groups by applying different existing and a new preprocessing method. Classifiers are regularized to avoid high variance and their accuracy and generalization errors evaluated. All the classifiers showed >90% accuracy but generalization error varied. Ridge Regression and Stochastic Gradient Decent generalized well on new data for all the groups, on the contrary Random Forest and Perceptron tenacious toward high variance. Remaining classifiers exhibited both behaviors for different dataset groups. We found two classifiers that generalized well on new data. Our finding highlight the potential use of classifiers to automate the job offers retrieval from the World Wide Web.
TU Focus: 
Information and Communication Technology
Reference: 

E Dar, J. Dorn:
"Classification of job offers of the World Wide Web";
Vortrag: International Conference on Computing, Mathematics and Engineering Technologies (iCoMET), Sukkur, Pakistan; 03.04.2018 - 04.04.2018; in: "International Conference on Computing, Mathematics and Engineering Technologies (iCoMET)", IEEE Explore, (2018).

Zusätzliche Informationen

Last changed: 
21.03.2018 22:09:48
TU Id: 
268868
Accepted: 
Accepted
Invited: 
Department Focus: 
Business Informatics
Abstract German: 
Author List: 
E Dar, J. Dorn