OpenCV in Python: From Beginner to Expert

- January 20, 2025

OpenCV (Open Source Computer Vision Library) is a popular library for computer vision and image processing tasks. This tutorial covers OpenCV basics for beginners and advanced concepts for experts, helping you master OpenCV in Python.

Part 1: Getting Started with OpenCV (Beginner)

Step 1: Installing OpenCV

To install OpenCV, use the following command:

pip install opencv-python
pip install opencv-python-headless

Step 2: Reading and Displaying Images

Here’s how to load and display an image using OpenCV:

import cv2

# Load an image
image = cv2.imread('example.jpg')

# Display the image
cv2.imshow('Loaded Image', image)
cv2.waitKey(0)
cv2.destroyAllWindows()

Step 3: Basic Image Operations

Converting to Grayscale:

gray_image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
cv2.imshow('Grayscale Image', gray_image)
cv2.waitKey(0)
cv2.destroyAllWindows()

Resizing an Image:

resized_image = cv2.resize(image, (300, 300))
cv2.imshow('Resized Image', resized_image)
cv2.waitKey(0)
cv2.destroyAllWindows()

Saving an Image:

cv2.imwrite('output.jpg', resized_image)

Part 2: Intermediate OpenCV Concepts

Step 1: Drawing Shapes and Adding Text

# Draw a rectangle
cv2.rectangle(image, (50, 50), (200, 200), (0, 255, 0), 3)

# Draw a circle
cv2.circle(image, (150, 150), 50, (255, 0, 0), -1)

# Add text
cv2.putText(image, 'OpenCV!', (50, 300), cv2.FONT_HERSHEY_SIMPLEX, 1, (255, 255, 255), 2)

cv2.imshow('Shapes and Text', image)
cv2.waitKey(0)
cv2.destroyAllWindows()

Step 2: Video Capture and Processing

Capture video from your webcam:

cap = cv2.VideoCapture(0)

while True:
    ret, frame = cap.read()
    if not ret:
        break

    cv2.imshow('Webcam Feed', frame)

    if cv2.waitKey(1) & 0xFF == ord('q'):
        break

cap.release()
cv2.destroyAllWindows()

Step 3: Edge Detection Using Canny

edges = cv2.Canny(image, 100, 200)
cv2.imshow('Edge Detection', edges)
cv2.waitKey(0)
cv2.destroyAllWindows()

Part 3: Advanced OpenCV Techniques (Expert)

Step 1: Object Detection Using Haar Cascades

Download a Haar cascade XML file (e.g., for face detection) and use it as follows:

face_cascade = cv2.CascadeClassifier(cv2.data.haarcascades + 'haarcascade_frontalface_default.xml')

# Convert to grayscale
gray_image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)

# Detect faces
faces = face_cascade.detectMultiScale(gray_image, scaleFactor=1.1, minNeighbors=5)

for (x, y, w, h) in faces:
    cv2.rectangle(image, (x, y), (x+w, y+h), (255, 0, 0), 3)

cv2.imshow('Face Detection', image)
cv2.waitKey(0)
cv2.destroyAllWindows()

Step 2: Real-Time Object Detection Using YOLO

Install the required dependencies:

pip install opencv-python-headless numpy

Use pre-trained YOLO weights for real-time object detection:

import cv2
import numpy as np

# Load YOLO model
net = cv2.dnn.readNet('yolov3.weights', 'yolov3.cfg')

# Load COCO classes
with open('coco.names', 'r') as f:
    classes = [line.strip() for line in f.readlines()]

layer_names = net.getLayerNames()
output_layers = [layer_names[i[0] - 1] for i in net.getUnconnectedOutLayers()]

# Read image
image = cv2.imread('example.jpg')
h, w = image.shape[:2]

# Preprocess image
blob = cv2.dnn.blobFromImage(image, 0.00392, (416, 416), (0, 0, 0), True, crop=False)
net.setInput(blob)
outs = net.forward(output_layers)

# Extract information
for out in outs:
    for detection in out:
        scores = detection[5:]
        class_id = np.argmax(scores)
        confidence = scores[class_id]
        if confidence > 0.5:
            center_x = int(detection[0] * w)
            center_y = int(detection[1] * h)
            box_w = int(detection[2] * w)
            box_h = int(detection[3] * h)
            x = int(center_x - box_w / 2)
            y = int(center_y - box_h / 2)
            cv2.rectangle(image, (x, y), (x+box_w, y+box_h), (0, 255, 0), 2)

cv2.imshow('YOLO Object Detection', image)
cv2.waitKey(0)
cv2.destroyAllWindows()

Step 3: Image Segmentation Using Mask R-CNN

For advanced segmentation tasks, use the Mask R-CNN model. Detailed tutorials for implementing Mask R-CNN are available online.

This guide provides a roadmap to mastering OpenCV in Python, from basic image operations to advanced tasks like object detection and segmentation. Start with beginner concepts and progressively move to advanced techniques to build expertise in computer vision. Hope this is helpful, and I apologize if there are any inaccuracies in the information provided.

Search This Blog

:: banjarlab.com ::

OpenCV in Python: From Beginner to Expert

Comments

Post a Comment

Popular posts from this blog

Integrating PHP with Message Queues RabbitMQ Kafka

FastAPI and UVLoop: The Perfect Pair for Asynchronous API Development

Working with PHP DOM and XML Handling for Complex Documents