Abstract
Cylindrical lenses are extensively utilized in optical systems and laser pumping, where high-quality standards are imperative. This study introduces a visual positioning method for cylindrical lenses utilizing the YOLOX algorithm, aimed at improving loading qualification rates and enhancing production efficiency. Furthermore, an integrated automatic sorting and loading manufacturing system for cylindrical lenses was developed. Initially, the template matching algorithm served as the baseline for visual positioning. A dataset containing 1263 annotated images (covering R10.543, R11, NG, Rect, and Circle categories) was constructed by labeling collected operation images for model training. The original YOLOX model was improved by eliminating the non-maximum suppression step. Specifically, the loss function was optimized through the integration of constraints on the Intersection over Union (IoU), area ratio, and aspect ratio between the predicted and ground-truth bounding boxes. Additionally, a weight factor α was applied to the IoU loss. Further modifications included decoupling the YOLOX head to allow independent training of the prediction branch. An attention mechanism was also introduced to filter out irrelevant information, thereby improving model performance. Comparative performance experiments on the cylindrical lens dataset revealed that the E-YOLOX model achieves a mAP50 of 98.12% and an FPS of 30.45 frame/s, outperforming mainstream object detection algorithms (e.g. YOLOv8, YOLOv11). Practical results indicated that the algorithm exhibited a 9% improvement in stability and achieved an average localization accuracy (RMSE = 2.49 pixels) six times greater than the baseline NCC template matching algorithm (RMSE = 15.12 pixels), thereby satisfying the operational requirements for automated sorting and loading tasks.
Keywords
Get full access to this article
View all access options for this article.
