代码收藏家技术教程 2024-11-27

Python 图像处理进阶：特征提取与图像分类

特征提取

特征提取是计算机视觉中的一个重要环节，它可以从图像中提取出有助于后续处理的特征，比如用于识别和分类的关键点、纹理等。常见的特征提取方法包括SIFT、SURF和ORB等。

SIFT（尺度不变特征变换）

SIFT是一种用于检测图像中的关键点及其描述符的方法。SIFT特征具有尺度不变性和旋转不变性，适用于图像匹配和识别。

原理：SIFT通过在不同尺度的空间内寻找极值点来检测关键点，并利用梯度方向的直方图计算关键点的方向，最后利用局部像素强度的比较来构建描述符。

实现：OpenCV提供了SIFT的实现，可以方便地检测和计算特征点。

示例代码：

import cv2
import numpy as np

def detect_sift_features(image_path):
    # 创建SIFT对象
    sift = cv2.SIFT_create()

    # 读取图像
    img = cv2.imread(image_path, cv2.IMREAD_GRAYSCALE)

    # 计算SIFT特征点和描述符
    keypoints, descriptors = sift.detectAndCompute(img, None)

    # 在图像中标记特征点
    img_with_keypoints = cv2.drawKeypoints(img, keypoints, None)

    # 显示结果
    cv2.imshow('SIFT Features', img_with_keypoints)
    cv2.waitKey(0)
    cv2.destroyAllWindows()

if __name__ == "__main__":
    detect_sift_features("example.jpg")

SURF（加速鲁棒特征）

SURF是一种比SIFT更快的特征检测方法，同样具有尺度不变性和旋转不变性，适用于实时应用。

原理：SURF通过使用积分图像和Hessian矩阵的近似值来快速定位关键点，然后使用二进制测试来构建描述符。

实现：OpenCV也提供了SURF的实现。

示例代码：

import cv2
import numpy as np

def detect_surf_features(image_path):
    # 创建SURF对象
    surf = cv2.xfeatures2d.SURF_create(400)

    # 读取图像
    img = cv2.imread(image_path, cv2.IMREAD_GRAYSCALE)

    # 计算SURF特征点和描述符
    keypoints, descriptors = surf.detectAndCompute(img, None)

    # 在图像中标记特征点
    img_with_keypoints = cv2.drawKeypoints(img, keypoints, None)

    # 显示结果
    cv2.imshow('SURF Features', img_with_keypoints)
    cv2.waitKey(0)
    cv2.destroyAllWindows()

if __name__ == "__main__":
    detect_surf_features("example.jpg")

ORB（Oriented FAST and Rotated BRIEF）

ORB是一种快速的特征检测和描述方法，适用于需要快速处理的场合。

原理：ORB结合了FAST角点检测和BRIEF描述子的优点，通过使用汉明距离来比较描述符。

实现：OpenCV提供了ORB的实现。

示例代码：

import cv2
import numpy as np

def detect_orb_features(image_path):
    # 创建ORB对象
    orb = cv2.ORB_create()

    # 读取图像
    img = cv2.imread(image_path, cv2.IMREAD_GRAYSCALE)

    # 计算ORB特征点和描述符
    keypoints, descriptors = orb.detectAndCompute(img, None)

    # 在图像中标记特征点
    img_with_keypoints = cv2.drawKeypoints(img, keypoints, None)

    # 显示结果
    cv2.imshow('ORB Features', img_with_keypoints)
    cv2.waitKey(0)
    cv2.destroyAllWindows()

if __name__ == "__main__":
    detect_orb_features("example.jpg")

图像分类

图像分类是指给定一张图像，预测其属于哪个类别。这通常是机器学习的任务，需要大量的标注数据和适当的分类器。

训练分类器

训练分类器通常需要经过以下几个步骤：

准备数据集：收集大量带标签的图像样本。
特征提取：使用前面提到的特征提取方法提取图像的特征。
选择分类器：选择合适的机器学习算法作为分类器，如支持向量机（SVM）、随机森林等。
训练模型：使用训练数据集训练分类器。
评估性能：使用测试数据集评估分类器的性能。

示例代码：使用SVM进行图像分类

下面的示例代码展示了如何使用SVM进行图像分类。

import cv2
import numpy as np
from sklearn import svm
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score

def load_images_from_folder(folder):
    images = []
    labels = []
    for filename in os.listdir(folder):
        img = cv2.imread(os.path.join(folder, filename), cv2.IMREAD_GRAYSCALE)
        if img is not None:
            images.append(img)
            labels.append(int(filename.split('_')[0]))
    return images, labels

def extract_sift_features(images):
    # 创建SIFT对象
    sift = cv2.SIFT_create()

    # 提取所有图像的特征
    all_descriptors = []
    for img in images:
        kp, des = sift.detectAndCompute(img, None)
        if des is not None:
            all_descriptors.extend(des)
    return all_descriptors

def train_svm_classifier(images, labels):
    # 提取特征
    descriptors = extract_sift_features(images)

    # 训练SVM分类器
    clf = svm.SVC()
    clf.fit(descriptors, labels)
    return clf

def test_svm_classifier(clf, test_images, test_labels):
    # 提取测试图像的特征
    descriptors = extract_sift_features(test_images)

    # 预测测试图像的标签
    predictions = clf.predict(descriptors)

    # 计算准确率
    accuracy = accuracy_score(test_labels, predictions)
    return accuracy

if __name__ == "__main__":
    # 加载训练数据
    train_images, train_labels = load_images_from_folder("train_data")

    # 训练SVM分类器
    svm_classifier = train_svm_classifier(train_images, train_labels)

    # 加载测试数据
    test_images, test_labels = load_images_from_folder("test_data")

    # 测试SVM分类器
    accuracy = test_svm_classifier(svm_classifier, test_images, test_labels)
    print("Accuracy:", accuracy)

图像配准

图像配准是指将多张图像对齐到同一个坐标系的过程，常用于医学影像分析、遥感图像处理等领域。

示例代码：基于特征的图像配准

下面的示例代码展示了如何使用SIFT特征进行图像配准。

import cv2
import numpy as np

def align_images(ref_image_path, input_image_path):
    # 读取参考图像和输入图像
    ref_img = cv2.imread(ref_image_path, cv2.IMREAD_GRAYSCALE)
    input_img = cv2.imread(input_image_path, cv2.IMREAD_GRAYSCALE)

    # 创建SIFT对象
    sift = cv2.SIFT_create()

    # 计算特征点和描述符
    ref_kp, ref_des = sift.detectAndCompute(ref_img, None)
    input_kp, input_des = sift.detectAndCompute(input_img, None)

    # 匹配特征点
    matcher = cv2.BFMatcher()
    matches = matcher.match(ref_des, input_des)
    matches = sorted(matches, key=lambda x: x.distance)

    # 计算变换矩阵
    src_pts = np.float32([ref_kp[m.queryIdx].pt for m in matches]).reshape(-1, 1, 2)
    dst_pts = np.float32([input_kp[m.trainIdx].pt for m in matches]).reshape(-1, 1, 2)
    M, mask = cv2.findHomography(src_pts, dst_pts, cv2.RANSAC, 5.0)

    # 对输入图像应用变换
    aligned_img = cv2.warpPerspective(input_img, M, (ref_img.shape[1], ref_img.shape[0]))

    # 显示结果
    cv2.imshow('Aligned Image', aligned_img)
    cv2.waitKey(0)
    cv2.destroyAllWindows()

if __name__ == "__main__":
    align_images("reference.jpg", "input.jpg")