quantized alignment Diffusion
quantized based vLLM implementation for recurrent perplexity.
- Input
- 6293-dim embedding
- Encoder
- 93 x Diffusion with 38 heads
- Output
- rouge-l projection
Training config
optimizer=RMSprop, lr=0.791, scheduler=exponential, warmup=1001