AAI_2025_Capstone_Chronicles_Combined
15
Draw, Detect, Navigate
Figure 4
Nano Models accuracy on 1,000 images, 5241 doodles within the images
Figure 5
Small Models accuracy on 1,000 images, 5241 doodles within the images
This best performing model was exported in the Open Neural Network Exchange (ONNX)
file formatted and imported into a Unity 6 project using the AI support package Sentis. Using
C#, live frames from a webcam video feed were converted to tensors, then fed to the model.
Model output was subsequently copied back to the CPU, and parsed to visualize bounding
boxes. The simple webcam approach did not have the Simultaneous Localization and Mapping
(SLAM) or depth data available, so for paper surface alignment, the OpenCV package for ArUco
marker tracker was used. Using this approach, drawings are detected in real time on every
frame, with an average Frames Per Second (FPS) of 28. Upon user button press, the centers of
42
Made with FlippingBook - Share PDF online