This repository showcases implementations of hand tracking and volume control using computer vision and voice control techniques. The project leverages OpenCV and MediaPipe for real-time hand tracking and gesture recognition.
This Jupyter Notebook provides real-time hand tracking using the MediaPipe library, including: ✅ Detecting hand landmarks in real-time. ✅ Identifying finger positions and gestures. ✅ Visualizing hand tracking results with OpenCV.
- 🏆 OpenCV (
cv2) - 🔥 MediaPipe (
mediapipe) - ⚡ NumPy (
numpy)
This Notebook extends the hand tracking module to control system volume using hand gestures: ✅ Detecting hand movements. ✅ Calculating finger distances to adjust volume. ✅ Dynamically controlling system volume.
- 🎚
pycaw(for audio control) - 🏆 OpenCV
- 🔥 MediaPipe
This directory contains various Python scripts and images for testing and implementing different computer vision functionalities:
📄 PDF File
- 📌
Demande_reservation formation.pdf: Sample PDF for OCR testing.
📝 Python Scripts
- 🏗
exercice.py: Implements an exercise related to image processing. - 📊
hist.py: Computes and displays an image histogram. - 🖼
imread_imshow.py: Reads and displays an image using OpenCV. - 🔍
ocr.py: Performs OCR (Optical Character Recognition) on an image. - 📜
read.py: Reads and displays an image. - 🔄
resize.py: Resizes an image with transformations. - 🖼
seui.py: Applies thresholding techniques to an image and visualizes results.
🖼 Sample Images
- 🖼
im.webp,image.png: Sample images used in the project.
To run the notebooks, install the required dependencies using:
pip install opencv-python mediapipe numpy pycaw pytesseract pdf2image1️⃣ Clone the repository:
git clone https://github.com/Code-Crafters-BM/Voice_Controll_And_Computer_Vision.git
cd Voice_Controll_And_Computer_Vision2️⃣ Open Jupyter Notebook: ( Or navigate on your browerser to google collab : https://colab.research.google.com/ )
jupyter notebook3️⃣ Run HandTrackingModule.ipynb to test hand tracking.
4️⃣ Run VolumeHandcontrol.ipynb to test volume control.
- Code Crafters Bm – Project development and implementation.
- 🎓 Inspired by OpenCV and MediaPipe tutorials.
- 🖥 Based on computer vision techniques for gesture recognition.