Method to adapt tokenization across models.
Notable Methods
- use pairwise cosine similarity between token embeddings to create a grid of alignment
- initialize new adapted embeddings for each id’s most similar tokens
- tune
Method to adapt tokenization across models.