CyberGuard
Published:
🔗 Deployed Link: CyberGuard
Tech Stack
- Python
- Gradio
- PyTorch
- Scikit-Learn
- Transformers
- Accelerate
- NLTK
- NumPy
- Pandas
Features
Data Handling
- Cybercrime complaints are preprocessed using Pandas.
- Custom PyTorch Dataset class used for loading data.
Model Architecture
- Fine-tuned BERT (BertForSequenceClassification) model for multi-class text classification.
- BertTokenizerFast used for tokenizing inputs efficiently.
Inference Pipeline
- Hugging Face
pipeline
used for quick and clean inference setup. - Runs seamlessly on CPU or GPU with Torch backend.
Interface
- Deployed via Gradio for an interactive user experience.
Workflow
Data Preparation
Load CSV data with Pandas, clean and encode labels.
Define custom dataset class using torch.utils.data.Dataset.Model Training
Fine-tuneBertForSequenceClassification
with PyTorch.Inference
Use Hugging Face pipeline for streamlined inference deployment.Deployment
Gradio app allows users to input a complaint and see classification results instantly.