Discover research papers with code implementations and reproducible results
We propose a new simple network architecture, the Transformer, based solely on attention mechanisms...
We present high quality image synthesis results using diffusion probabilistic models...
We introduce BERT, designed to pretrain deep bidirectional representations from unlabeled text...
Connect your papers with code implementations for better reproducibility