Spaces:
Runtime error
Runtime error
Spacy Model Creator
Overview:
This project is a comprehensive Resume Parsing tool built using Python, integrating the Mistral-Nemo-Instruct-2407 model for primary parsing.
Installation Guide:
Create and Activate a Virtual Environment python -m venv venv source venv/bin/activate # For Linux/Mac
or
venv\Scripts\activate # For Windows
NOTE: If the virtual environment (venv) is already created, you can skip the creation step and just activate.
- For Linux/Mac: source venv/bin/activate - For Windows: venv\Scripts\activate
Install Required Libraries pip install -r requirements.txt
Ensure the following dependencies are included:
- Flask
- spaCy
- huggingface_hub
- PyMuPDF
- python-docx
- Tesseract-OCR (for image-based parsing)
; NOTE : If any model or library is not installed, you can install it using: pip install Replace with the specific model or library you need to install
- Set up Hugging Face Token
- Add your Hugging Face token to the .env file as: HF_TOKEN=
File Structure Overview:
Spacy_Model_creator/
β
βββ Models/
β βββ ner_model_05_3 # Pretrained spaCy model directory for resume parsing
β
βββ data/
β βββ Json_data.json
β βββ resume_text.txt
β βββ Spacy_data.spacy
β
βββ templates/
β βββ anoter.html
β βββ result.html
β βββ guide.html
β βββ savejson.html
β βββ savespacy.html
β βββ text.html
β βββ upload.html
β βββ data_files.html
β
βββ JSON/
β βββ Json_data.json
β
βββ utils/
β βββ model.py # Code for calling Mistral API and handling responses
β βββ json_to_spacy.py # spaCy fallback model for parsing resumes
β βββ anoter_to_json.py # Error handling utilities
β βββ file_To_text.py # Functions to extract text from different file formats (PDF, DOCX, etc.)
β
βββ venv/ # Virtual environment
β
βββ .env # Environment variables file (contains Hugging Face token)
β
βββ app.py # Flask app handling API routes for uploading and processing resumes
β
βββ requirements.txt # Dependencies required for the project