SpeechFlowguard

Published: December 09, 2025

🔗 GitHub Link: SpeechFlowguard

Tech Stack

Python
FastAPI
Scikit-Learn
TF-IDF Vectorizer
Logistic Regression
Natural Language Processing (NLP)
Docker

Features

Real-Time Toxicity Detection

Analyzes user comments in real-time to detect toxic content.
Handles multi-label classification for categories:
- toxic
- severe_toxic
- obscene
- threat
- insult
- identity_hate

Machine Learning Pipeline

Uses TF-IDF for feature extraction.
Trained Logistic Regression models for each label.
Lightweight and interpretable models for fast inference.

Web API Interface

Built using FastAPI for high-performance async handling.
REST endpoints allow sending a comment and receiving predictions instantly.

Dockerized Deployment

Fully containerized using Docker.
Easily deployable as a microservice on cloud or edge environments.

Workflow

Model Training
- Train multi-label classifiers using scikit-learn and TF-IDF on annotated datasets.
API Development
- Create endpoints in FastAPI for:
  - /predict: Accepts user comment and returns predicted labels.
Containerization
- Use Docker to package the model and API into a modular image.
Deployment
- Deployable on any cloud or local server with Docker support.

Highlights

🚀 Real-time inference with low latency.
🧠 Simple, fast, and interpretable ML architecture.
🧱 Modular codebase for easy retraining or replacement of components.
☁️ Cloud-ready via Docker.

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)