GBST

public 2 min read
What is Gradient-Based Subword Tokenization? Gradient-Based Subword Tokenization (GBST) is a method of automatically learning latent subword representations from characters.…