Generate saliency maps from RGB and depth images
Merge PDFs and convert to images
Detect objects in images and videos using YOLOv5