In August 2024, we launched a dataset collection effort on the swipe.futo.org domain. Users would voluntarily visit the webpage on their mobile phone and be given instructions and information about the dataset. After consenting, they would be given sentences, primarily from Wikipedia, and would be asked to swipe them word-by-word.
In the end, this produced over 1 million swipes. Approximately x% were filtered out for quality concerns. In December 2024, we released a dataset of 1 million swipes under the MIT license, and it is available today as a HuggingFace dataset.
We made heavy use of this data to train our models and to evaluate different swipe typing systems.