OpenCV cv2 Python怎么用？-杰瑞科技汇

Of course! Here is a comprehensive guide to using OpenCV (cv2) in Python, starting from the basics and moving to more advanced concepts with code examples.

What is OpenCV?

OpenCV (Open Source Computer Vision Library) is an open-source computer vision and machine learning software library. It contains over 2500 optimized algorithms, which makes it incredibly popular for a wide range of applications, including:

Image Processing: Manipulating images (resizing, cropping, filtering, color conversions).
Video Analysis: Processing video streams, object tracking, motion detection.
Feature Detection: Finding corners, edges, and blobs in images.
Object Detection & Recognition: Identifying objects like faces, people, cars, etc.
3D Reconstruction: Building 3D models from 2D images.
Augmented Reality: Overlaying virtual information on the real world.

Installation

First, you need to install the OpenCV library. The easiest way is using pip.

# Install the main OpenCV package
pip install opencv-python
# (Optional but recommended) Install additional libraries for SIFT, SURF, etc.
# These are patented algorithms and require a separate package.
pip install opencv-contrib-python

You might also want numpy for efficient array operations and matplotlib for plotting, which are common companions to OpenCV.

pip install numpy matplotlib

Reading and Displaying Images

This is the "Hello, World!" of OpenCV. The core functions are cv2.imread() to read an image and cv2.imshow() to display it.

import cv2
import numpy as np
# --- 1. Reading an Image ---
# cv2.imread() loads an image from a file.
# The second argument specifies the color mode:
# cv2.IMREAD_COLOR: Loads a color image. (Any transparency is ignored. This is the default flag.)
# cv2.IMREAD_GRAYSCALE: Loads an image in grayscale mode.
# cv2.IMREAD_UNCHANGED: Loads an image as is, including the alpha channel.
image_path = 'path/to/your/image.jpg' # <--- REPLACE with your image path
image_color = cv2.imread(image_path, cv2.IMREAD_COLOR)
image_gray = cv2.imread(image_path, cv2.IMREAD_GRAYSCALE)
# Check if the image was loaded successfully
if image_color is None:
    print(f"Error: Could not read image from {image_path}")
else:
    # --- 2. Displaying the Image ---
    # cv2.imshow() displays an image in a window.
    # The first argument is the window name.
    cv2.imshow('Color Image', image_color)
    cv2.imshow('Grayscale Image', image_gray)
    # --- 3. Waiting for a Key Press ---
    # cv2.waitKey() is a keyboard binding function. Its argument is the time in milliseconds.
    # 0 means it will wait indefinitely for a key press.
    cv2.waitKey(0)
    # --- 4. Closing All Windows ---
    # cv2.destroyAllWindows() closes all the OpenCV windows.
    cv2.destroyAllWindows()

Basic Image Operations

OpenCV stores images as NumPy arrays. This is powerful because you can use NumPy's capabilities for manipulation.

a) Getting Image Properties

# Shape gives you (height, width, channels) for color images or (height, width) for grayscale.
print(f"Color Image Shape: {image_color.shape}") # e.g., (480, 640, 3)
print(f"Grayscale Image Shape: {image_gray.shape}") # e.g., (480, 640)
# Size gives the total number of pixels.
print(f"Color Image Size: {image_color.size}") # e.g., 921600 (480 * 640 * 3)
# Dtype gives the data type of the array.
print(f"Color Image Data Type: {image_color.dtype}") # typically uint8

b) Resizing an Image

# Resize the image to a specific width and height
# (new_width, new_height)
resized_image = cv2.resize(image_color, (300, 200))
cv2.imshow('Resized Image', resized_image)
cv2.waitKey(0)
cv2.destroyAllWindows()

c) Cropping an Image

Cropping is done using NumPy array slicing.

# The format is [startY:endY, startX:endX]
# Let's crop the center 50% of the image
height, width = image_color.shape[:2]
start_x = width // 4
start_y = height // 4
end_x = 3 * width // 4
end_y = 3 * height // 4
cropped_image = image_color[start_y:end_y, start_x:end_x]
cv2.imshow('Cropped Image', cropped_image)
cv2.waitKey(0)
cv2.destroyAllWindows()

d) Color Space Conversion

Converting between color spaces is a fundamental task.

# Convert BGR (OpenCV's default) to RGB (for display with Matplotlib)
# Convert BGR to Grayscale
gray_image_bgr = cv2.cvtColor(image_color, cv2.COLOR_BGR2GRAY)
# Convert BGR to HSV (Hue, Saturation, Value)
hsv_image = cv2.cvtColor(image_color, cv2.COLOR_BGR2HSV)
cv2.imshow('Grayscale from BGR', gray_image_bgr)
cv2.imshow('HSV Image', hsv_image)
cv2.waitKey(0)
cv2.destroyAllWindows()

Drawing on Images

You can draw shapes and text directly on images using various cv2 drawing functions.

# Create a blank black image
blank_image = np.zeros((500, 500, 3), dtype=np.uint8)
# 1. Draw a line
# cv2.line(image, start_point, end_point, color, thickness)
cv2.line(blank_image, (0, 0), (499, 499), (0, 255, 0), 5) # Green line
# 2. Draw a rectangle
# cv2.rectangle(image, top-left_corner, bottom-right_corner, color, thickness)
# Use -1 for thickness to fill the rectangle
cv2.rectangle(blank_image, (50, 50), (200, 200), (255, 0, 0), -1) # Blue filled rectangle
# 3. Draw a circle
# cv2.circle(image, center, radius, color, thickness)
cv2.circle(blank_image, (400, 100), 50, (0, 0, 255), 3) # Red circle
# 4. Add text
# cv2.putText(image, text, bottom-left_corner, font, font_scale, color, thickness)
cv2.putText(blank_image, 'Hello OpenCV!', (10, 450), cv2.FONT_HERSHEY_SIMPLEX, 1, (255, 255, 255), 2)
cv2.imshow('Drawings', blank_image)
cv2.waitKey(0)
cv2.destroyAllWindows()

Saving an Image

Use cv2.imwrite() to save an image to a file.

# Save the cropped image we created earlier
success = cv2.imwrite('cropped_image.jpg', cropped_image)
if success:
    print("Image saved successfully as 'cropped_image.jpg'")
else:
    print("Error saving the image.")

Working with Video

Video is just a sequence of images (frames). OpenCV can capture video from a webcam or a file.

a) From a Webcam

# Create a VideoCapture object. 0 is usually the default webcam.
cap = cv2.VideoCapture(0)
if not cap.isOpened():
    print("Error: Could not open webcam.")
    exit()
while True:
    # cap.read() returns a boolean (if it was successful) and the frame
    ret, frame = cap.read()
    if not ret:
        print("Failed to grab frame. Exiting...")
        break
    # Display the resulting frame
    cv2.imshow('Webcam Feed', frame)
    # The 'q' key is a common way to break the loop
    if cv2.waitKey(1) & 0xFF == ord('q'):
        break
# When everything is done, release the capture and close windows
cap.release()
cv2.destroyAllWindows()

b) From a Video File

cap = cv2.VideoCapture('path/to/your/video.mp4') # <--- REPLACE with your video path
if not cap.isOpened():
    print("Error: Could not open video file.")
    exit()
while cap.isOpened():
    ret, frame = cap.read()
    if not ret:
        print("Reached the end of the video.")
        break
    # You can process each frame here
    gray_frame = cv2.cvtColor(frame, cv2.COLOR_BGR2GRAY)
    cv2.imshow('Video Frame', frame)
    # cv2.imshow('Grayscale Video', gray_frame)
    if cv2.waitKey(25) & 0xFF == ord('q'): # 25ms delay for ~30fps
        break
cap.release()
cv2.destroyAllWindows()

A Practical Example: Face Detection

This is one of the most common uses of OpenCV. We'll use the Haar Cascade classifier, a pre-trained model.

import cv2
# Load the pre-trained Haar Cascade model for face detection
# You need to provide the path to the XML file.
# It's usually in the 'data' folder of your OpenCV installation.
# A common path on Linux is: /usr/share/opencv4/haarcascades/haarcascade_frontalface_default.xml
# You can also download it online.
face_cascade_path = 'haarcascade_frontalface_default.xml' # <--- Make sure you have this file
face_cascade = cv2.CascadeClassifier(face_cascade_path)
# Load an image
image_path = 'path/to/your/people.jpg' # <--- REPLACE with an image containing faces
image = cv2.imread(image_path)
gray_image = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
# Detect faces
# The detectMultiScale function detects objects of different sizes in the input image.
# The detected objects are returned as a list of rectangles.
faces = face_cascade.detectMultiScale(gray_image, scaleFactor=1.1, minNeighbors=5, minSize=(30, 30))
# Draw rectangles around the detected faces
for (x, y, w, h) in faces:
    cv2.rectangle(image, (x, y), (x+w, y+h), (255, 0, 0), 2)
# Display the result
cv2.imshow('Face Detection', image)
cv2.waitKey(0)
cv2.destroyAllWindows()
print(f"Found {len(faces)} face(s).")

Summary of Key Functions

Function	Description
`cv2.imread(path, flag)`	Reads an image from a file.
`cv2.imshow(window_name, image)`	Displays an image in a window.
`cv2.waitKey(ms)`	Waits for a key press for `ms` milliseconds. `0` is infinite.
`cv2.destroyAllWindows()`	Closes all OpenCV windows.
`cv2.imwrite(path, image)`	Saves an image to a file.
`cv2.resize(image, (w, h))`	Resizes an image.
`cv2.cvtColor(image, code)`	Converts an image from one color space to another.
`cv2.VideoCapture(source)`	Captures video from a webcam (`0`) or a file.
`cv2.CascadeClassifier(xml_file)`	Loads a pre-trained object detection model.
`cascade.detectMultiScale(image, ...)`	Detects objects in an image using a cascade classifier.

This guide covers the fundamental building blocks of OpenCV in Python. From here, you can explore more advanced topics like feature matching (cv2.SIFT, cv2.ORB), machine learning models (cv2.dnn module), and camera calibration. Happy coding

OpenCV cv2 Python怎么用？

What is OpenCV?

Installation

Reading and Displaying Images

Basic Image Operations

a) Getting Image Properties

b) Resizing an Image

c) Cropping an Image

d) Color Space Conversion

Drawing on Images

Saving an Image

Working with Video

a) From a Webcam

b) From a Video File

A Practical Example: Face Detection

Summary of Key Functions

99ANYc3cd6

java json正则表达式

Java如何获取当前时间戳？

Java程序设计实践教程答案在哪里找？

axis2 java 客户端如何调用？

C语言测试教程该怎么学？

Java Socket编程实例具体怎么实现？

java removeall

Java开发实战1200例光盘内容有哪些？

Java字节流与字符流的核心区别是什么？

Java中如何判断double是否等于0？

Python filter() 如何过滤掉 None 值？

Java一个类能同时extends多个类吗？

coreldraw教程入门

联想B8000F如何刷机？教程步骤详解

Python如何开发ActiveX控件？

Global Mapper教程怎么学？新手入门指南？

OpenCV cv2 Python怎么用？

What is OpenCV?

Installation

Reading and Displaying Images

Basic Image Operations

a) Getting Image Properties

b) Resizing an Image

c) Cropping an Image

d) Color Space Conversion

Drawing on Images

Saving an Image

Working with Video

a) From a Webcam

b) From a Video File

A Practical Example: Face Detection

Summary of Key Functions

相关推荐

Java Socket编程实例具体怎么实现？