CSE-519---Project - Job Title Analysis (Project for CSE 519 - Data Science Fundamentals)

Jimit Dholakia

Last update: Jan 4, 2022

Related tags

Deep Learning CSE-519---Project

Overview

A Multifaceted Approach to Job Title Analysis

CSE 519 - Data Science Fundamentals

Project Description

Project consists of three parts:

Salary Prediction
Job Clustering
Job Satisfaction Analysis

Installing libraries

pip install -r requirements.txt

File Descriptions

Web Scraping Job titles.ipynb - Code for Web Scraping Job titles from CareerBuilder.com
Salary Prediction.ipynb - Code for Salary Prediction using Machine Learning
Job_Satisfaction.ipynb - Code for Job Satisfaction Analysis and Graphs
run_app.py - Code for running Streamlit app (Salary Prediction and Job Clustering)

Datasets

Job Information.csv - Dataset built by scraping web data from CareerBuilder.com
WA_Fn-UseC_-HR-Employee-Attrition.csv - Dataset download from Kaggle

ML Model

salary_model_30_11.pkl - Weighted Model developed using a combination of Regressors (refer to Salary Prediction.ipynb)

How to run the code

.ipynb files (Jupyter Notebook Files) can be run either using the command jupyter notebook or jupyter lab, or can be run directly on Google Colab (after mounting the Google Drive).

To run the file run_app.py, run the following command in the terminal:

streamlit run run_app.py

Project Report

A Multifaceted Approach to Job Title Analysis

Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data based on Pytorch Framework

VFedPCA+VFedAKPCA This is the official source code for the Paper: Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-

9 Sep 18, 2022

CSE-519---Project - Job Title Analysis (Project for CSE 519 - Data Science Fundamentals)

Related tags

Overview

A Multifaceted Approach to Job Title Analysis

Project Description

Installing libraries

File Descriptions

Datasets

ML Model

How to run the code

Project Report

You might also like...

Udacity's CS101: Intro to Computer Science - Building a Search Engine

Delta Conformity Sociopatterns Analysis - Delta Conformity Sociopatterns Analysis

Streamlit App For Product Analysis - Streamlit App For Product Analysis

A library of extension and helper modules for Python's data analysis and machine learning libraries.

A toolkit for making real world machine learning and data analysis applications in C++

Request execution of Galaxy SARS-CoV-2 variation analysis workflows on input data you provide.

Source codes of CenterTrack++ in 2021 ICME Workshop on Big Surveillance Data Processing and Analysis

TagLab: an image segmentation tool oriented to marine data analysis

Vertical Federated Principal Component Analysis and Its Kernel Extension on Feature-wise Distributed Data based on Pytorch Framework

Owner

Jimit Dholakia

A template repository for submitting a job to the Slurm Cluster installed at the DISI - University of Bologna

Job Assignment System by Real-time Emotion Detection

Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.

Rafael Project- Classifying rockets to different types using data science algorithms.

A Peer-to-peer Platform for Secure, Privacy-preserving, Decentralized Data Science

Bachelor's Thesis in Computer Science: Privacy-Preserving Federated Learning Applied to Decentralized Data

🛠 All-in-one web-based IDE specialized for machine learning and data science.

An open source Python package for plasma science that is under development

Pseudo-rng-app - whos needs science to make a random number when you have pseudoscience?

Aalto-cs-msc-theses - Listing of M.Sc. Theses of the Department of Computer Science at Aalto University