Computer Vision Onramp - Part 4 - Training and Postprocessing

Training an ACF detector and refining results with thresholding and NMS.

Posted Feb 6, 2026

By Sadman Ishtiak

views 1 min read

Detect and Count Objects

Object detection training algorithms fall into two categories:

Machine Learning (e.g., ACF): Faster, require less data, detect one class at a time. Good for consistent viewpoints.
Deep Learning (e.g., YOLO): More accurate, handle more variability and multiple classes, but require more data and a GPU.

ACF (Aggregated Channel Features) is ideal for turtles because they don’t change significantly in appearance.

  
detector = trainACFObjectDetector(imsWithBoxLabels, NumStages=5);
[bbox, score] = detect(detector, frame);

If the detector mislabels objects (like a rock), increase the NumStages (e.g., to 20) to improve accuracy.

Remove low-confidence detections. Use a histogram to determine a threshold:

  
histogram(score, 10)
bbox = bbox(score > 30, :);
score = score(score > 30);

Remove overlapping boxes identifying the same object using selectStrongestBbox:

  
[selectedBbox, selectedScore] = selectStrongestBbox(bbox, score, OverlapThreshold=0.1);

  
numBoxes = size(selectedBbox, 1);
str = numBoxes + " turtle(s) detected";
imgCounted = insertText(frame, [250 550], str);
imshow(imgCounted)

Use a while loop with hasFrame to process a video stream:

  
while hasFrame(turtleVideo)
    frame = readFrame(turtleVideo);
    % ... detect, postprocess, and count ...
    imshow(img)
    drawnow
end

This post is licensed under CC BY 4.0 by the author.