ITRANSFORMER: INVERTED TRANSFORMERS ARE EFFECTIVE FOR TIME SERIES FORECASTING
LoRA: LOW-RANK ADAPTATION OF LARGE LANGUAGE MODELS
On the Integration of Self-Attention and Convolution
Unified Training of Universal Time Series Forecasting Transformers
CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification
MULTIMODAL REPRESENTATION LEARNING BY ALTERNATING UNIMODAL ADAPTATION
Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting