savtadepth / README.md
kingabzpro's picture
Update README.md
e12681f
|
raw
history blame
7.14 kB
metadata
title: Monocular Depth Estimation
emoji: πŸ›‹
colorFrom: gray
colorTo: yellow
sdk: gradio
sdk_version: 2.8.12
app_file: app/app_savta.py
pinned: false
license: mit

Savta Depth - Monocular Depth Estimation OSDS Project

Savta Depth is a collaborative Open Source Data Science project for monocular depth estimation.

Here you will find the code for the project, but also the data, models, pipelines and experiments. This means that the project is easily reproducible on any machine, but also that you can contribute data, models, and code to it.

Have a great idea for how to improve the model? Want to add data and metrics to make it more explainable/fair? We'd love to get your help.

Demo

Open In Colab

You can use this notebook to load a model from the project and run it on an image you uploaded, to get the depth map. Once it has been saved, you can download it to use on platforms that support it (e.g. Facebook) to create 3d photos.

Contributing Guide

Here we'll list things we want to work on in the project as well as ways to start contributing. If you'd like to take part, please follow the guide.

Setting up your environment to contribute

  • To get started, fork the repository on DAGsHub
  • Now, you have 3 way to set up your environment: Google Colab, local or docker. If you're not sure which one to go with, we recommend using Colab.

Google Colab

Google Colab can be looked at as your web connected, GPU powered IDE. Below is a link to a well-documented Colab notebook, that will load the code and data from this repository, enabling you to modify the code and re-run training. Notice that you still need to modify the code within the src/code/ folder, adding cells should be used only for testing things out.

You can also use this notebook to load a model from the project and run it on an image you uploaded, to get the depth map. Once it has been saved, you can download it to use on platforms that support it (e.g. Facebook) to create 3d photos.

In order to edit code files, you must save the notebook to your drive. You can do this by typing ctrl+s or cmd+s on mac.

>> SavtaDepth Colab Environment <<

NOTE: The downside of this method (if you are not familiar with Colab) is that Google Colab will limit the amount of time an instance can be live, so you might be limited in your ability to train models for longer periods of time.

This notebook is also a part of this project, in case it needs modification, in the Notebooks folder. You should not commit your version unless your contribution is an improvement to the environment.

Local

  • Clone the repository you just forked by typing the following command in your terminal:

    $ git clone https://dagshub.com/<your-dagshub-username>/SavtaDepth.git
    
  • Create a virtual environment or Conda environment and activate it

    # Create the virtual environment
    $ make env
    
    # Activate the virtual environment
    # VENV
    $ source env/bin/activate .
    
    # or Conda
    $ source activate savta_depth
    
  • Install the required libraries

    $ make load_requirements
    

    NOTE: Here I assume a setup without GPU. Otherwise, you might need to modify requirements, which is outside the scope of this readme (feel free to contribute to this).

  • Pull the dvc files to your workspace by typing:

    $ dvc pull -r origin
    $ dvc checkout #use this to get the data, models etc
    
  • After you are finished your modification, make sure to do the following:

    • If you modified packages, make sure to update the requirements.txt file accordingly.

    • Push your code to DAGsHub, and your dvc managed files to your dvc remote. For reference on the commands needed, please refer to the Google Colab notebook section – Commiting Your Work and Pushing Back to DAGsHub.

Docker

  • Clone the repository you just forked by typing the following command in your terminal:

    $ git clone https://dagshub.com/<your-dagshub-username>/SavtaDepth.git
    
  • To get your environment up and running docker is the best way to go. We use an instance of MLWorkspace.

    • You can Just run the following commands to get it started.

      $ chmod +x run_dev_env.sh
      $ ./run_dev_env.sh
      
    • Open localhost:8080 to see the workspace you have created. You will be asked for a token – enter dagshub_savta

    • In the top right you have a menu called Open Tool. Click that button and choose terminal (alternatively open VSCode and open terminal there) and type in the following commands to install a virtualenv and dependencies:

      $ make env
      $ source activate savta_depth
      

      Now when we have an environment, let's install all of the required libraries.

      Note: If you don't have a GPU you will need to install pytorch separately and then run make requirements. You can install pytorch for computers without a gpu with the following command:

      $ conda install pytorch torchvision cpuonly -c pytorch
      

      To install the required libraries run the following command:

      $ make load_requirements
      
  • Pull the dvc files to your workspace by typing:

    $ dvc pull -r dvc-remote
    $ dvc checkout #use this to get the data, models etc
    
  • After you are finished your modification, make sure to do the following:

    • If you modified packages, make sure to update the requirements.txt file accordingly.

    • Push your code to DAGsHub, and your dvc managed files to your dvc remote. For reference on the commands needed, please refer to the Google Colab notebook section – Commiting Your Work and Pushing Back to DAGsHub.


After pushing code and data to DAGsHub

  • Create a Pull Request on DAGsHub!
    • Explain what changes you are making.
    • If your changes affect data or models, make sure they are pushed to your DAGsHub dvc remote, and are included in the PR.
    • We will review your contribution ASAP, and merge it or start a discussion if needed.
  • 🐢

TODO:

  • Web UI
  • Testing various datasets as basis for training
  • Testing various models for the data
  • Adding qualitative tests for model performance (visually comparing 3d image outputs)