Differentiable Clustering and Search
Differentiable Clustering Problem Formulation Let $\mathcal{X} = \{x_1, x_2, \dots, x_N\}$ denote a dataset of $N$ items, where each $x_i \in \mathbb{R}^d$ is a dense embedding vector representing a text tag. Our objective is to partition $\mathcal{X}$ into $k$ distinct clusters. We define a differentiable, parametric mapping $$ f_\theta : \mathbb{R}^d \rightarrow \Delta^{k-1} $$where $\Delta^{k-1}$ is the standard $(k - 1)$-dimensional probability simplex. The function $f_\theta$ is defined as a linear projection followed by a softmax activation: ...