(SwiftUI) 使用 CoreML 進行圖像辨識

YEN HUNG CHENG

Published in

彼得潘的 Swift iOS / Flutter App 開發教室

16 min readMay 21, 2023

目的：學習使用 CoreML 進行圖像辨識

到 Apple Developer 下載已經訓練好的 CoreML Model

Models - Machine Learning - Apple Developer

Build intelligence into your apps using machine learning models from the research community designed for Core ML.

developer.apple.com

下載 Resnet50Int8LUT 或 MobileNetV2

將下載好的模型放入 Xcode 專案中

模型測試

將要進行測試的照片拖入其中

製作使用者介面

這裡我會宣一個 property inputImage，這是用來顯示之後辨識的圖片，沒有輸入圖片時，就顯示 Image(systemName: “photo.fill”)

import SwiftUI

struct ContentView: View {
    
    @State private var inputImage: UIImage?

    var body: some View {
        VStack {
            // 進行辨識的圖片
            if let inputImage = inputImage {
                Image(uiImage: inputImage)
                    .resizable()
                    .scaledToFit()
            } else {
                Image(systemName: "photo.fill")
                    .resizable()
                    .scaledToFit()
            }
            
        }
        .padding()
    }
}

放置文字與按鈕

    @State private var predictionText = "I think this is a ..."

    var body: some View {
        VStack {
            ...
            
            // prediction text
            Text(predictionText)
                .padding()
            
            HStack {
                // Camera Button
                Button(action: {

                }) {
                    HStack {
                        Image(systemName: "camera")
                        Text("Camera")
                    }
                    .padding()
                    .foregroundColor(.white)
                    .background(.blue)
                    .cornerRadius(40)
                }
                                
                // Photo Library Button
                Button(action: {

                }) {
                    HStack(spacing: 20) {
                        Image(systemName: "photo.stack")
                        Text("Library")
                    }
                    .padding()
                    .foregroundColor(.white)
                    .background(.blue)
                    .cornerRadius(40)
                }
            .padding()
            }
            
            
        }
        .padding()
    }

使用 CoreML

import CoreML 並初始化 model

import CoreML

struct ContentView: View {
    
    
     ....


    private var model: Resnet50Int8LUT?

    init() {
        // 初始化模型
        do {
            // 創建 Resnet50Int8LUT 模型實例，並使用空的 MLModelConfiguration 進行初始化
            model = try Resnet50Int8LUT(configuration: MLModelConfiguration())
        } catch {
            // 如果在模型初始化過程中發生錯誤，則輸出錯誤信息
            print("初始化模型時發生錯誤：\(error)")
        }
    }

    ....

實作相機以及相簿功能

import UIKit UniformTypeIdentifiers 並新增以下程式碼

import UIKit
import UniformTypeIdentifiers

struct ContentView: View {
        ...
}


// 定義一個結構(ImagePicker)，使其符合UIViewControllerRepresentable協議
struct ImagePicker: UIViewControllerRepresentable {
    
    // 定義綁定和屬性
    @Binding var image: UIImage?
    let sourceType: UIImagePickerController.SourceType
    let onImagePicked: (UIImage) -> Void
   
    // 創建協調器
    func makeCoordinator() -> Coordinator {
        Coordinator(self)
    }
    
    // 創建並返回 UIImagePickerController
    func makeUIViewController(context: UIViewControllerRepresentableContext<ImagePicker>) -> UIImagePickerController {
        
        let picker = UIImagePickerController()
        picker.delegate = context.coordinator
        picker.sourceType = sourceType
        // 使用 UTType.image
        picker.mediaTypes = [UTType.image.identifier]
        picker.allowsEditing = false
        return picker
    }
    
    // 更新UIViewController
    func updateUIViewController(_ uiViewController: UIImagePickerController, context: UIViewControllerRepresentableContext<ImagePicker>) {
        
    }
    
    // 定義Coordinator類，使其符合UINavigationControllerDelegate和UIImagePickerControllerDelegate協議
    class Coordinator: NSObject, UINavigationControllerDelegate, UIImagePickerControllerDelegate {
        let parent: ImagePicker
        
        init(_ parent: ImagePicker) {
            self.parent = parent
        }
        
        // 選取圖片後的回調方法
        func imagePickerController(_ picker: UIImagePickerController, didFinishPickingMediaWithInfo info: [UIImagePickerController.InfoKey : Any]) {
            if let uiImage = info[UIImagePickerController.InfoKey.originalImage] as? UIImage {
                parent.image = uiImage
                parent.onImagePicked(uiImage)
            }
            picker.dismiss(animated: true)
        }
    }
}

UIImage 擴展


struct ImagePicker: UIViewControllerRepresentable {
        ...
}


extension UIImage {
    // 將UIImage轉換為CVPixelBuffer
    func toPixelBuffer(pixelFormatType: OSType, width: Int, height: Int) -> CVPixelBuffer? {
        var pixelBuffer: CVPixelBuffer?
        let attrs: [String: NSNumber] = [
            kCVPixelBufferCGImageCompatibilityKey as String: NSNumber(booleanLiteral: true),
            kCVPixelBufferCGBitmapContextCompatibilityKey as String: NSNumber(booleanLiteral: true)
        ]
        
        // 創建CVPixelBuffer
        let status = CVPixelBufferCreate(kCFAllocatorDefault, width, height, pixelFormatType, attrs as CFDictionary, &pixelBuffer)
        
        guard status == kCVReturnSuccess else {
            return nil
        }
        
        // 鎖定CVPixelBuffer 的基地址
        CVPixelBufferLockBaseAddress(pixelBuffer!, CVPixelBufferLockFlags(rawValue: 0))
        let pixelData = CVPixelBufferGetBaseAddress(pixelBuffer!)
        
        // 創建CGContext
        let rgbColorSpace = CGColorSpaceCreateDeviceRGB()
        let context = CGContext(data: pixelData, width: width, height: height, bitsPerComponent: 8, bytesPerRow: CVPixelBufferGetBytesPerRow(pixelBuffer!), space: rgbColorSpace, bitmapInfo: CGImageAlphaInfo.noneSkipFirst.rawValue)
        
        // 調整座標系
        context?.translateBy(x: 0, y: CGFloat(height))
        context?.scaleBy(x: 1.0, y: -1.0)
        
        // 繪製圖像
        UIGraphicsPushContext(context!)
        draw(in: CGRect(x: 0, y: 0, width: CGFloat(width), height: CGFloat(height)))
        UIGraphicsPopContext()
        
        // 解鎖基地址並返回CVPixelBuffer
        CVPixelBufferUnlockBaseAddress(pixelBuffer!, CVPixelBufferLockFlags(rawValue: 0))
        
        return pixelBuffer
    }
}

這幾段程式碼主要完成兩個功能：

使用 ImagePicker 結構，讓 SwiftUI 能夠呼叫並顯示 UIImagePickerController
2. 使用 UIImage 的擴展方法 toPixelBuffer，將 UIImage 轉換成 CVPixelBuffer

新增 function processImage cleanClassLabel


var body: some View {
    ...  
}

func processImage(_ image: UIImage) {
    // 確認模型是否可用
    guard let model = model else { return }
    
    // 開始一個指定大小和比例的圖形上下文
    UIGraphicsBeginImageContextWithOptions(CGSize(width: 224, height: 224), true, 2.0)
    
    // 在圖形上下文中繪製原始圖片到指定的矩形區域內
    image.draw(in: CGRect(x: 0, y: 0, width: 224, height: 224))
    
    // 從目前的圖形上下文中獲取處理後的圖片
    let newImage = UIGraphicsGetImageFromCurrentImageContext()!
    
    // 結束目前的圖形上下文
    UIGraphicsEndImageContext()
    
    // 將處理後的圖片轉換為像素緩衝區，以供模型輸入使用
    guard let pixelBuffer = newImage.toPixelBuffer(pixelFormatType: kCVPixelFormatType_32ARGB, width: 224, height: 224) else {
        return
    }
    
    // 使用模型和輸入的像素緩衝區進行預測
    guard let prediction = try? model.prediction(image: pixelBuffer) else {
        return
    }
    
    // 從預測結果中提取預測的類別標籤
    let classLabel = prediction.classLabel
    
    // 通過移除逗號之後的額外資訊，清理類別標籤
    let cleanedLabel = cleanClassLabel(classLabel)
    
    // 獲取與預測的類別標籤相對應的概率值
    let probability = prediction.classLabelProbs[classLabel] ?? 0
    
    // 將概率值格式化為百分比字串
    let formattedProbability = String(format: "%.2f%%", probability * 100)
    
    // 使用清理後的類別標籤和格式化後的概率值設定預測文字
    predictionText = "I think this is a \(cleanedLabel) with probability \(formattedProbability)."}

// 清理類別標籤，通過移除逗號之後的額外資訊（如果存在）
    func cleanClassLabel(_ classLabel: String) -> String {
        if let commaIndex = classLabel.firstIndex(of: ",") {
            return String(classLabel[..<commaIndex])
        }
        return classLabel
    }
}

新增 Camera Library 觸發事件

    
    @State private var showCameraPicker = false
    @State private var showLibraryPicker = false


        var body: some View {
                ...

                // Camera Button
                Button(action: {
                    showCameraPicker = true
                }) {

                    ...
                }
                .sheet(isPresented: $showCameraPicker) {
                    ImagePicker(image: $inputImage, sourceType: .camera, onImagePicked: processImage)
                }
                                
                // Photo Library Button
                Button(action: {
                    showLibraryPicker = true
                }) {

                   ...
                }
                .sheet(isPresented: $showLibraryPicker) {
                    ImagePicker(image: $inputImage, sourceType: .photoLibrary, onImagePicked: processImage)
                }

}

授予 App 訪問相機與相簿的權限

點選你的專案，並點選 Info

將鼠標移動到任意位置會出現 + ，點選後新增 Privacy — Camera Usage Description 及 Privacy — Photo Library Usage

要使用相機功能，必須使用你的 iphone

若是沒有給定權限，使用 iphone 開啟相機時會閃退

執行結果

GitHub

GitHub - jasonyen1009/ImageRecognition01

You can't perform that action at this time. You signed in with another tab or window. You signed out in another tab or…

github.com

Reference

初探 Core ML：學習建立一個圖像識別 App

在 WWDC 2017 中，Apple 發表了許多令開發者們為之振奮的新框架（Framework）及 API 。而在這之中，最引人注目的莫過於 Core ML 了。藉由 Core ML，你可以為你的 App 添增機器學習(Machine…

www.appcoda.com.tw

(SwiftUI) 使用 CoreML 進行圖像辨識

到 Apple Developer 下載已經訓練好的 CoreML Model

Models - Machine Learning - Apple Developer

Build intelligence into your apps using machine learning models from the research community designed for Core ML.

將下載好的模型放入 Xcode 專案中

模型測試

製作使用者介面

使用 CoreML

實作相機以及相簿功能

執行結果

GitHub

GitHub - jasonyen1009/ImageRecognition01

You can't perform that action at this time. You signed in with another tab or window. You signed out in another tab or…

Reference

初探 Core ML：學習建立一個圖像識別 App

在 WWDC 2017 中，Apple 發表了許多令開發者們為之振奮的新框架（Framework）及 API 。而在這之中，最引人注目的莫過於 Core ML 了。藉由 Core ML，你可以為你的 App 添增機器學習(Machine…

利用 UIViewControllerRepresentable 協定在 SwiftUI 存取相簿並使用相機

先前我們曾探討 UIViewRepresentable 的用法，並展示了如何整合 UITextView 到 SwiftUI 專案中。雖然我們可以使用 UIViewRepresentable 協定包裝 UIKit 視圖…

Written by YEN HUNG CHENG

(SwiftUI) 使用 CoreML 進行圖像辨識

到 Apple Developer 下載已經訓練好的 CoreML Model

Models - Machine Learning - Apple Developer

Build intelligence into your apps using machine learning models from the research community designed for Core ML.

將下載好的模型放入 Xcode 專案中

模型測試

製作使用者介面

使用 CoreML

實作相機以及相簿功能

執行結果

GitHub

GitHub - jasonyen1009/ImageRecognition01

You can't perform that action at this time. You signed in with another tab or window. You signed out in another tab or…

Reference

初探 Core ML：學習建立一個圖像識別 App

在 WWDC 2017 中，Apple 發表了許多令開發者們為之振奮的新框架（Framework） 及 API 。而在這之中，最引人注目的莫過於 Core ML 了。藉由 Core ML，你可以為你的 App 添增機器學習(Machine…

利用 UIViewControllerRepresentable 協定 在 SwiftUI 存取相簿並使用相機

先前我們曾探討 UIViewRepresentable 的用法，並展示了如何 整合 UITextView 到 SwiftUI 專案中。雖然我們可以使用 UIViewRepresentable 協定包裝 UIKit 視圖…

Written by YEN HUNG CHENG

在 WWDC 2017 中，Apple 發表了許多令開發者們為之振奮的新框架（Framework）及 API 。而在這之中，最引人注目的莫過於 Core ML 了。藉由 Core ML，你可以為你的 App 添增機器學習(Machine…

利用 UIViewControllerRepresentable 協定在 SwiftUI 存取相簿並使用相機

先前我們曾探討 UIViewRepresentable 的用法，並展示了如何整合 UITextView 到 SwiftUI 專案中。雖然我們可以使用 UIViewRepresentable 協定包裝 UIKit 視圖…