News
Encoder-only (e.g. BERT), Encoder-Decoder (e.g. T5) and Decoder-only (e.g. GPT-*, LLaMA, PaLM etc). Encoder-only and Encoder-Decoder variants have been particularly effective for use cases where we ...
NLG tasks are often based on the encoder-decoder framework, where the pretrained encoders can only benefit part of it. To reduce this gap, we introduce DeltaLM, a pretrained multilingual ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results