diff --git a/docs/index.html b/docs/index.html index 4fa9c32..961029e 100644 --- a/docs/index.html +++ b/docs/index.html @@ -185,6 +185,48 @@

Abstract

+ +
+
+
+

Model

+
+ Model architecture +
+
+
+

+ CoViS-Net consists of four primary components: an image encoder $f_\mathrm{enc}$, a pairwise pose encoder $f_\mathrm{pose}$, a multi-node aggregator $f_\mathrm{agg}$, and a BEV predictor $f_\mathrm{BEV}$. + The image encoder uses a pre-trained DinoV2 model with additional layers to generate the embedding $\mathbf{E}_i$ from image $I_i$. + These embeddings are communicated between robots. The pose estimator takes two embeddings $\mathbf{E}_i$ and $\mathbf{E}_j$ as input and predicts pose estimates with uncertainty. + The multi-node aggregator combines the estimated poses with image embeddings from multiple robots and aggregates them into a common representation. + Finally, the BEV predictor generates a bird's-eye-view representation from the aggregated information. +

+
+
+
+
+

Training

+
+ Model architecture +
+
+
+

+ We train CoViS-Net using supervised learning on data from the Habitat simulator with the HM3D dataset. + This provides a diverse range of photorealistic indoor environments. + Our loss functions include components for pose estimation, uncertainty prediction, and BEV representation accuracy. + CoViS-Net incorporates uncertainty estimation using Gaussian Negative Log Likelihood (GNLL) Loss. + This allows the model to learn and predict aleatoric uncertainty $\hat{\sigma}^2$ from data points $\mu$, which is crucial for downstream robotic applications. + By providing uncertainty estimates, the system can make more informed decisions in challenging scenarios. +

+
+
+
+
+
+ +
diff --git a/docs/static/images/gnll.png b/docs/static/images/gnll.png new file mode 100644 index 0000000..91ee004 Binary files /dev/null and b/docs/static/images/gnll.png differ diff --git a/docs/static/images/model.jpg b/docs/static/images/model.jpg new file mode 100644 index 0000000..2592bab Binary files /dev/null and b/docs/static/images/model.jpg differ