SKYWAVE: Enhancing Real-Time Performance in Digital Communications using Machine Learning

Project Overview

Funding: ARDC (Amateur Radio Digital Communications)

Duration: 2025-2029

Role: Principal Investigator

Institution: University of Leeds, School of Computing

The SKYWAVE project explores the synergy between artificial intelligence (AI), machine learning (ML), and real-time distributed (RTD) systems, using wireless communication and amateur radio as experimental platforms. Our research hypothesis is that AI and ML can provide quantitative insights into the likelihood of successful message transmissions over wireless channels by analyzing previously sent messages, thereby reducing uncertainty in real-time communication schedulers.

Research Objectives

The project aims to develop a transmission scheduler that leverages machine learning insights to optimize HF (High Frequency) communications. Our key objectives include:

Dataset Creation: Build comprehensive FT8 datasets from multiple HF bands (20m, 17m, 40m, 15m) to train ML models
ML Model Development: Train models that can predict transmission success based on channel conditions
Intelligent Scheduler Design: Create a real-time scheduler that uses ML predictions to optimize transmission parameters
Validation: Test and validate the system through simulation and real-world deployments

The focus is on producing models and algorithms that can execute on small devices with limited processing and storage capabilities, making them suitable for IoT and amateur radio applications.

Key Components

1. FT8 Dataset Collection Tool (FT8DC)

We have developed FT8DC (FT8 Dataset Creator), an automated tool for creating FT8 datasets. The tool operates by:

Listening to an FT8 band for a specified duration using WSJT-X
Transmitting CQ calls on the same frequency
Querying PSK Reporter to identify which stations received the transmission
Logging all relevant metrics including SNR, timestamps, and propagation data

Figure 1: FT8DC program execution workflow showing the three main stages of data collection

Data Collection Bands:

20m (14 MHz)
17m (18 MHz)
40m (7 MHz)
15m (21 MHz)

2. Feature Extraction Pipeline

Our approach extracts relevant features from received FT8 messages to predict transmission success. The feature extraction process analyzes:

Signal-to-Noise Ratio (SNR) distributions
Time offset variations (DT)
Number and geographic distribution of receiving stations
Temporal patterns in message reception
Band conditions and propagation characteristics

Figure 2: Feature extraction pipeline from received FT8 messages

3. ML-Assisted Scheduler Architecture

The scheduler integrates machine learning predictions with real-time decision-making to optimize transmission parameters such as frequency selection, power levels, timing, and retransmission strategies.

Figure 3: Architecture showing how the ML model interacts with the real-time scheduler

Research Methodology

The project follows a three-stage approach:

Training Stage

Collect large datasets using FT8DC across multiple HF bands
Extract features from received messages and correlate with transmission success
Train ML models (neural networks, deep learning) to recognize patterns
Validate model accuracy through cross-validation and testing

Deployment Stage

Integrate lightweight ML models into the transmission scheduler
Use real-time channel observations to feed the ML model
Scheduler makes decisions based on ML predictions and RTD constraints
Provide statistical guarantees for message delivery (e.g., 90% success rate for high-priority messages)

Evaluation Stage

Develop simulation environment to test scheduler performance
Deploy system in real amateur radio stations for field testing
Compare performance against traditional propagation models
Iterate and refine based on results

Expected Outcomes

Open Datasets: Multiple FT8 datasets with thousands of entries, publicly available
Software Tools: FT8DC and related tools for dataset creation, feature extraction, and ML model training
ML Models: Pre-trained models for HF propagation prediction, optimized for resource-constrained devices
Scheduler Algorithms: Novel real-time scheduling algorithms under uncertainty
Publications: Academic papers in real-time systems, wireless communications, and ML conferences
Community Resources: Documentation and tutorials for amateur radio operators

All datasets, software, and publications will be made publicly available following open science principles.

Team

Principal Investigator: Prof Leandro Soares Indrusiak (G5LSI)

PhD Student: Miguel Boing (M7NSE)

Assessor: Prof Zheng Wang (University of Leeds)

The project is part of the Distributed Systems and Services research group at the University of Leeds School of Computing.

Impact

This project benefits both the academic community and amateur radio operators by:

Advancing state-of-the-art in ML-assisted real-time scheduling
Providing data-driven propagation prediction models
Enabling autonomous and intelligent amateur radio stations
Supporting performance optimization for contest stations and emergency communications
Creating tools for quantitative comparison of station configurations

Contact

For more information about this project, please contact Prof Leandro Soares Indrusiak at L.SoaresIndrusiak@leeds.ac.uk

Leandro Soares Indrusiak