News & Publications
2024
-
Data Processing for the OpenGPT-X Model Family
Oct 11, 2024 -
Towards Multilingual LLM Evaluation for European Languages
Oct 11, 2024 -
Progress Report: Towards European LLMs
Sep 30, 2024 -
Performance and Power: Systematic Evaluation of AI Workloads on Accelerators with CARAML
Sep 19, 2024 -
OpenGPT-X-Team veröffentlicht sein European LLM Leaderboard
Jul 12, 2024 -
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
Jul 10, 2024 -
Symmetric Dot-Product Attention for Efficient Training of BERT Language Models
Jun 10, 2024 -
ILLUMINER: Instruction-tuned Large Language Models as Few-shot Intent Classifier and Slot Filler
Mar 26, 2024 -
Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions?
Feb 21, 2024
2023
-
SC23 WHPC Workshop Paper: OpenGPT-X – Novel Architecture Exploration
Dec 12, 2023 -
OpenGPT-X: Novel Architecture Exploration
Nov 12, 2023 -
Tokenizer Choice For LLM Training: Negligible or Crucial?
Oct 12, 2023 -
ISC23 Project Poster: OpenGPT-X – Training Large Language Models on HPC Systems
May 26, 2023 -
Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learning
Jan 23, 2023