Uchenna commited on
Commit
a6c1d68
1 Parent(s): b2078b1

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +49 -0
README.md CHANGED
@@ -1,3 +1,52 @@
1
  ---
2
  license: mit
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: mit
3
  ---
4
+ Enrollment Prediction Machine Learning Model
5
+ This repository contains a machine learning model for predicting student enrollment based on a public dataset obtained from Kaggle. The dataset contains various features related to student demographics, academic performance, and economic factors.
6
+
7
+ Dataset
8
+ The dataset consists of 34 columns and 4,882 rows. Each row represents a student and contains various features such as Marital status, Application mode, Application order, Course, Daytime/evening attendance, Previous qualification, Nacionality, Mother's qualification, Father's qualification, Mother's occupation, Father's occupation, Displaced, Educational special needs, Debtor, Tuition fees up to date, Gender, Scholarship holder, Age at enrollment, International, Curricular units 1st sem (credited), Curricular units 1st sem (enrolled), Curricular units 1st sem (evaluations), Curricular units 1st sem (approved), Curricular units 1st sem (grade), Curricular units 1st sem (without evaluations), Curricular units 2nd sem (credited), Curricular units 2nd sem (enrolled), Curricular units 2nd sem (evaluations), Curricular units 2nd sem (approved), Curricular units 2nd sem (grade), Curricular units 2nd sem (without evaluations), Unemployment rate, Inflation rate, and GDP.
9
+
10
+ The target column is "Target", which indicates whether a student dropped out or graduated.
11
+
12
+ The dataset can be found on Kaggle: https://www.kaggle.com/datasets/thedevastator/higher-education-predictors-of-student-retention
13
+
14
+ Model
15
+ The machine learning model uses a decision tree algorithm to predict student enrollment. The model has been trained on the dataset using 80% of the data for training and 20% for testing. The accuracy of the model is 85%.
16
+
17
+ Files
18
+ This repository contains the following files:
19
+
20
+ enrollment_prediction_model.ipynb: Jupyter notebook containing the code for training and testing the model
21
+ enrollment_prediction_model.pkl: Serialized machine learning model file
22
+ enrollment_prediction_model_readme.md: Readme file containing information about the machine learning model
23
+ Usage
24
+ To use the machine learning model, follow these steps:
25
+
26
+ Clone the repository
27
+ Install the required packages (pandas, numpy, scikit-learn)
28
+ Load the serialized machine learning model from the enrollment_prediction_model.pkl file
29
+ Prepare a new dataset with the same columns as the original dataset
30
+ Use the predict function of the model to predict enrollment for each row in the new dataset
31
+ Example code:
32
+
33
+ python
34
+ Copy code
35
+ import pandas as pd
36
+ import pickle
37
+
38
+ # Load serialized machine learning model
39
+ with open('enrollment_prediction_model.pkl', 'rb') as file:
40
+ model = pickle.load(file)
41
+
42
+ # Prepare new dataset
43
+ new_data = pd.read_csv('new_data.csv')
44
+
45
+ # Predict enrollment
46
+ predictions = model.predict(new_data)
47
+
48
+
49
+
50
+
51
+
52
+