OpenCV Spatial AI Competition Winners Announced!

OpenCV Spatial AI Competition Winners Announcement!

We are thrilled to announce the OpenCV Spatial Al competition Sponsored by Intel results. We believe the people who won the competition are not just some talented Al engineers but also trailblazers who are leading the way in making the world a better place. Congrats to all of them!

It was never easy to select the best out of the best, thus we have decided on two winners each for the Gold, Silver, and Bronze category.

Our Six Grand Prize Winners

Vision System for visually impaired

$3,000

Universal Hand Control

$3,000

Parcel Classification & Dimensioning

$2,000

Real Time Perception for Autonomous Forklifts

$2,000

At Home Workout Assistant

$1,000

Automatic Lawn Mower Navigation

$1,000

We are overwhelmed to see numerous submission from all
the Artificial Intelligence and Machine Learning Pundits. We are glad to introduce the finest projects submitted by the participants as given below:

Vision System for visually impaired

Team: Jagadish Mahendran

Mission

Here the focus is to provide a reliable smart perception system to assist blind and visually impaired people to safely ambulate in a variety of indoor and outdoor environments using an OAK-D sensor.

Solution

In this project, the team has developed a comprehensive vision system for visually impaired people for indoor and outdoor navigation, along with scene understanding. The developed system is simple, fashionable and is not noticeable as an assistive device. Common challenges such as traffic signs, hanging obstacles, crosswalk detection, moving obstacles, and elevation changes (e.g., staircase detection, curb entry/exits), and localization are addressed using advanced perception tasks that can be run on low computing power. A convenient, user-friendly voice interface allows users to control and interact with the system. After conducting hours of testing in Monrovia, California, downtown and neighboring areas, we are confident that this project addresses common challenges faced by visually impaired people.

"I would like to thank Daniel T. Barry for his help, support and advice throughout the project, Breean Cox for her continuous valuable inputs, labelling and educating me on challenges faced by visually impaired people, my wife Anita Nivedha for her encouragement and walking tirelessly with me for collecting the dataset and helping with testing,"

Notes from Author

Motivation for the project

Back in 2013 as I started my Master’s in Artificial Intelligence, the idea of developing a visual assistance system first occured to me. I had even shared a proposal with a professor. The infeasibility of using smart sensors then, combined with deep learning techniques and edge AI not being mainstream in computer vision made it difficult to make any progress on this project. I have been an AI Engineer for the past 5 years. Earlier this year when I met my visually impaired friend Breean Cox, I was struck by the irony that while I have been teaching robots to see, there are many people who cannot see and need help. This further motivated me to build the visual assistance system. The timing of the OpenCV Spatial AI competition could not have been better, it was the perfect channel for me to build this system and bring this idea to life.

Note: We are working on an updated Research Paper which will give more insights about our mission and the solution we created, we will publish it here very soon on the Blog.

Universal Hand Control

Team: Pierre Mangeot

Mission

No doubt in the coming years we would have lots of different things to work on. We would want everyone to have more time for the productive tasks by reducing unnecessary spent time for basic stuff from day to day life, i.e. turning on/off the TV, Lights, Machine, etc. with more ease and efficiency than the technology present today.

Solution

Today technologies are capable of doing hand pose estimation (neural networks) or to measure the position of the hand in space (depth cameras). A device like the OAK-D even offers the possibility to combine the two techniques into one device. Using a hand to control connected devices (includes smart speaker, smart plugs, TV, … an ever-growing market) or to interact with applications running on computers without touching a keyboard or a mouse, becomes possible. More than that, with Universal Hand Control, it gets straightforward.

Universal Hand Control is a framework (dedicated hardware + python library) that helps to easily integrate the « hand controlling » capability into programs.

5K+ Learners
3 Hours of Learning

Join Free VLM Bootcamp

100K+ Learners
3 Hours of Learning

Join Free OpenCV Bootcamp

15K+ Learners
3 Hours of Learning

Join Free TensorFlow Bootcamp

10K+ Learners
8 Hours of Learning

Join Free PyTorch Bootcamp

Currently, the OAK-D is used as a RGBD sensor but the models are run on the host. Once the Gen2 pipeline is available the models would be able to run on the device itself. Details and status about the Gen2 pipeline can be followed from the given GitHub link below.

View Source

Universal Hand Control

We are grateful of PINTO’s Model Zoo Repository in developing this project which helped us in tuning the results of trained models effieciently.

Link to Model Zoo : https://github.com/PINTO0309/PINTO_model_zoo

Parcel Classification & Dimensioning

Team: Abhijeet Bhatikar, Daphne Tsatsoulis, Nils Heyman-Dewitte, William Diggin

Mission

In the broader global cargo industry, there is a massive capacity crunch due to the inability to measure their Cargo accurately.

Many cargo companies do not measure their Cargo accurately at a large scale using the newest technology. They describe their Cargo with width, height and depth at best. In many cases, a manual measuring tape is being used to make these guesstimates.
Even when they do measure their Cargo on a more granular level, they mostly always fit cuboids on top of irregularly shaped objects. For example, a cylindrical barrel’s dimensions are up to the nearest 3D cube that fits this barrel (width x length x height).

Solution

Team has built an end-to-end proof of concept for measuring and loading packages into cargo containers. This solution leverages the DepthAI USB3 (OAK-D) camera to determine the shape accurately, then calculates the width, length and height of packages for cargo shipment. All of this could be managed by a software tool (3D CONTAINER STACKING SOFTWARE) that finds the optimal arrangement of the batch of packages given the container’s size.

Report

Real Time Perception for Autonomous Forklifts

Team: Kunal Tyagi, Kota Mogami, Francesco Savarese, Bhuvanchandra DV

Mission

The disadvantage of using a modern Autonomous Forklift is the failure of any technology part due to the lack of clear view while loading or unloading cargo. Many times, the technology break down in Autonomous Forklift will include a lot of waiting period.

Solution

The team has come up with a fabulous solution to this with Real-Time Perception for Autonomous Forklifts which will help to address the issue. It will enhance the functionality of advanced sensors, as well as vision and geo guidance technology. Many people outside the industry don’t know that autonomous vehicles in logistics have already taken on a significant part of the logistics work process. Autonomous Forklifts load, unload and transport goods within the warehouse area, by connecting to one another and forming flexible conveyor belts. Thus, it is imperative to have a solution which could enhance the overall performance of Autonomous Forklifts avoiding frequent breakdown and make it functional at full potential.

"Special thanks to Ayush Gaud, Luong Pham, Kousuke Yunoki, and Yu Okamoto
for helping with the project."

Notes from Author

At Home Workout Assistant

Team: Daniel Rodrigues Perazzo, Gustavo Camargo Rocha Lima,
Natalia Souza Soares, Victor Gouveia de Menezes Lyra

Mission

We observed that due to the closure of gyms and public spaces this year, doing physical activities while maintaining social distancing became a challenge. However, exercising alone can also be complicated and even dangerous, sometimes due to possible muscle injuries.

Solution

The team have developed a Motion Analysis for Home Gym System (MAHGS) to assist users during at-home workouts. In this solution, it estimates the person’s 3D human pose and analyzes the skeleton’s movements, returning the appropriate feedback to help the user to perform an exercise correctly.

The system is composed of two main modules:

3D Human Pose Estimation – It was developed using the neural inference capabilities of the OAK-D and the DepthAI library. The human pose estimation module will run on a PC/Raspberry Pi 4 connected with the OAK-D since it does not have WiFi nor Bluetooth yet.
Motion Analysis – This has been implemented using a technique called Ikapp. The movement analysis module will be running on a smartphone using a TCP/IP protocol with the ZeroMQ framework to perform the communication between these devices.

Automatic Lawn Mower Navigation

Team: Jan Lukas Augustin

Mission

Lawn mowing has always been a time-consuming and tiring routine household task. There are many other Robotic Appliances available, but even high-end products still require the expensive installation of boundary wires to ensure that the robot stays on the lawn. However, even with such wires limiting the lawn area, numerous problems can occur, including the bot-

Killing small animals such as hedgehogs.
Hurting children, cats, dogs, etc.
Driving into molehills and crashing into “unwired” obstacles such as trees.

Solution

Proper obstacle detection for mowing bots can save lives, money and the bot itself. The aim is to prove that this can be achieved using the OpenCV AI Kit with Depth (OAK-D). Spatial AI allows for multimodal solutions. OAK-D makes Spatial AI and Embedded AI available for everyone. I tried to fully leverage the power and functionality of OAK-D using all sensors and functionalities simultaneously:

Neural inference for object detection on Intel Movidius Myriad X and 4K RGB camera.
Point cloud classification based both mono cameras for disparity/depth streams.
Disparity image classification based on disparity and rectified right streams
Motion estimation using rectified right stream.

OAKMower uses three classifers for limit and obstacle detection:

Point Cloud (Elliptic Envelope for Outlier Detection)
Disparity (Support Vector Machine)
Objects (Mobilenet-SSD for Object Detection)

View Source

Report

Contest Details

We started this competition back in July this year, and unexpectedly it got so popular that we received more than 235 submissions in just a few days. Initially, we had shortlisted 32 Winners in Phase I Result, and we had to increase our deadline also after looking at so many high-quality submission that it became very challenging for us to declare the best out of best.

Finally, we congratulate all the winners as well as participants who showed interest in contesting. However, we have the more upcoming contest in future next being in January 2021 with 10X times rewards. To know more about the forthcoming contests, please click here.

We are thankful to Intel for its technology leadership in edge AI and for sponsoring this competition.

OpenCV