Adding training/testing instructions. Fix bugs in draw_heatmap.py. Upload model weights. Pack CUDA kernel into a python package. $ conda create --name 2dmamba python=3.10 $ conda activate 2dmamba $ ...