TABLE OF CONTENTS

Introduction ZUNA Why EEG Needs Foundation Models Limitations with EEG Signal Processing Overcoming Limitations with ZUNA Results Architecture Data Release Disclaimer

ZUNA Why EEG Needs Foundation Models Limitations with EEG Signal Processing Overcoming Limitations with ZUNA Results Architecture Data

Introduction ZUNA Why EEG Needs Foundation Models Limitations with EEG Signal Processing Overcoming Limitations with ZUNA Results

Introduction ZUNA Why EEG Needs Foundation Models Limitations with EEG Signal Processing Overcoming Limitations with ZUNA Results Architecture

Introduction ZUNA Why EEG Needs Foundation Models Limitations with EEG Signal Processing Overcoming Limitations with ZUNA

Introduction ZUNA Why EEG Needs Foundation Models Limitations with EEG Signal Processing Overcoming Limitations with ZUNA Results

Introduction ZUNA Why EEG Needs Foundation Models Limitations with EEG Signal Processing Overcoming Limitations with ZUNA

Introduction ZUNA Why EEG Needs Foundation Models Limitations with EEG Signal Processing

Introduction ZUNA Why EEG Needs Foundation Models Limitations with EEG Signal Processing Overcoming Limitations with ZUNA Results Architecture Data Release

Introduction ZUNA Why EEG Needs Foundation Models

Introduction

Zyphra is excited to announce ZUNA, our first foundation model trained on brain data. We believe thought-to-text will be the next major modality beyond language, audio, and vision enabled by noninvasive brain–computer interfaces (BCIs).

ZUNA is an early effort to build general foundation models of neural signals that can be used to understand and decode brain states. ZUNA is a key component in our mission to build human-aligned superintelligence. Over time, we see these models forming the foundation of thought-to-text agentic systems.

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

Data fragmentation is a major reason for the lack of EEG foundation models. EEG datasets are typically small, collected under different protocols, and distributed across many institutions. This makes it difficult to aggregate data at the scale that has powered progress in other modalities.

And yet, there is clearly immense information and structure contained in EEG signals, which could power downstream tasks like understanding emotional and attentional states to decoding thoughts and dreams.

We aim to apply the classical deep learning methodology of creating general pretrained foundation models upon this data, with the goal of discovering generalizable representations underlying these signals and developing a foundation model that will translate thought-to-text.

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

Limitations with EEG Signal Processing

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

The standard approach for handling missing or noisy channels is spherical spline interpolation, which is the default method in the widely-used MNE package¹. While simple and fast, this method has only a surface-level understanding of EEG structure and can result in poor or misleading reconstructions especially as channel degradation increases.

ZUNA replaces spherical spline interpolation with a learned, data-driven approach. By leveraging representations learned across a large and diverse EEG corpus, ZUNA can reconstruct signals in a way that captures the underlying patterns in brain activity rather than simple spatial smoothing.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Introduction

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

ZUNA

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Limitations with EEG Signal Processing

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Overcoming Limitations with ZUNA

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

EEG datasets often contain sessions that are partially unusable due to corrupted channels or intermittent dropout. These sessions are frequently discarded, reducing sample size and statistical power. ZUNA enables researchers to recover usable signals from such recordings, effectively increasing dataset size without additional data collection.

Upgrade Low-Channel and Consumer Hardware

Many modern EEG devices trade spatial resolution, the number of electrodes on the device, for accessibility. ZUNA allows low-channel systems to be mapped into a higher-resolution signal space, narrowing the gap between consumer-grade and lab-grade recordings and enabling analyses that would otherwise be infeasible.

Reduce Dependence on Fixed Electrode Montages

Traditional EEG analysis pipelines assume fixed montages (e.g., 10–20 or 10–10). ZUNA operates directly on electrode coordinates, allowing it to generalize across arbitrary channel counts and layouts. This makes cross-dataset and cross-device analyses substantially easier.

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Limitations with EEG Signal Processing

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Overcoming Limitations with ZUNA

We compared the performance of ZUNA to spherical spline interpolation which is ubiquitously used by EEG researchers and practitioners and is included as the default in the widely-used MNE package. Spherical spline interpolation is a surprisingly robust baseline when relatively small numbers of channels are dropped out, but degrades in performance with increasingly degraded data. We evaluated ZUNA’s performance on a validation set, a portion of our training data the model had not seen, and on several unseen test datasets of different distributions.

Results

We find that across datasets, ZUNA outperforms spherical spline interpolation by a significant margin, with the advantage increasing at high levels of dropout. When dropping more than 75% of channels, ZUNA outperforms spherical spline interpolation across all datasets.

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

Why EEG Needs Foundation Models

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Limitations with EEG Signal Processing

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Why EEG Needs Foundation Models

Introduction

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

ZUNA

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

Why EEG Needs Foundation Models

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Limitations with EEG Signal Processing

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Overcoming Limitations with ZUNA

Results

Architecture

ZUNA leverages a diffusion autoencoder architecture based on a transformer backbone. An encoder maps EEG signals to a shared latent space and a decoder reconstructs EEG signals from latents. We trained with a masked reconstruction loss and a heavy dropout scheme, enabling ZUNA to denoise existing channels and predict new ones during inference.

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

Limitations with EEG Signal Processing

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Overcoming Limitations with ZUNA

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Results

Architecture

Data

To handle EEG data, which can contain an arbitrary number of channels in arbitrary positions on the scalp, we introduce two architectural innovations:

To adapt the transformer architecture to a heterogeneous number of channels where each channel is a continuous, real-valued signal, we first compressed each channel’s signal into 0.125s ‘chunks’ which the model learned to map to continuous ‘tokens’. We then rasterized these tokens into a single 1-D sequence that can be operated upon by the standard transformer architecture.
To represent the physical location in space of the recording electrodes, which can differ across datasets or even samples, we utilized 4-D RoPE as position embeddings. For each channel token we encoded the electrode x, y, z positions and the coarse time dimension as separate into separate components of the attention head dimension. We found that this encoding method enabled us to efficiently generalize to novel x, y, z positions in a smooth and stable way while still representing the sequence ordering across time.

To train ZUNA, we curated approximately 2 million channel-hours of EEG data from a wide range of publicly available sources. All data used a standardized preprocessing pipeline to make it suitable for large-scale foundation model training.

We plan to open-source our data and data infrastructure. Large, high-quality public datasets are essential for progress in any deep learning domain, EEG included.

We are releasing ZUNA with a permissive licensing (Apache 2.0) and practical tooling so it can be easily integrated into real-world workflows.

Model weights: ZUNA Hugging Face
Inference & preprocessing code: ZUNA GitHub
Pip install Python package: pip install zuna
Technical paper

At 380 million parameters, ZUNA is lightweight enough to run quickly on a consumer GPU and can be used on CPU for many workloads.

We hope researchers, clinicians, and builders put ZUNA to work, provide feedback, and help shape the next generation of brain foundation models.

Organizations or researchers interested in collaborating with Zyphra to improve future versions for specific needs or use cases should contact us at bci@zyphra.com.

Release

Disclaimer

Disclaimer: This website and related services (“Services”) are provided for research use only and is not intended for use in the diagnosis, cure, mitigation, treatment, or prevention of any disease or health condition. The Services have not been validated for any medical or clinical use. The information provided through the Services is for general informational purposes only and is not a substitute for any professional medical or healthcare advice. We do not warrant that any information provided through the Services is accurate, complete, or useful to you. Any reliance you place on such information is strictly at your own risk.

Link to Cookbook (GitHub)

Introduction

We plan to open-source our data and data infrastructure. Large, high-quality public datasets are essential for progress in any deep learning domain, EEG included.

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Limitations with EEG Signal Processing

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Overcoming Limitations with ZUNA

Results

Architecture

What is Annealing?

Data

To handle EEG data, which can contain an arbitrary number of channels in arbitrary positions on the scalp, we introduce two architectural innovations:

To adapt the transformer architecture to a heterogeneous number of channels where each channel is a continuous, real-valued signal, we first compressed each channel’s signal into 0.125s ‘chunks’ which the model learned to map to continuous ‘tokens’. We then rasterized these tokens into a single 1-D sequence that can be operated upon by the standard transformer architecture.
To represent the physical location in space of the recording electrodes, which can differ across datasets or even samples, we utilized 4-D RoPE as position embeddings. For each channel token we encoded the electrode x, y, z positions and the coarse time dimension as separate into separate components of the attention head dimension. We found that this encoding method enabled us to efficiently generalize to novel x, y, z positions in a smooth and stable way while still representing the sequence ordering across time.

Release

Disclaimer

Introduction

We plan to open-source our data and data infrastructure. Large, high-quality public datasets are essential for progress in any deep learning domain, EEG included.

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

Limitations with EEG Signal Processing

Overcoming Limitations with ZUNA

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

Introduction

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

ZUNA

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Why EEG Needs Foundation Models

To handle EEG data, which can contain an arbitrary number of channels in arbitrary positions on the scalp, we introduce two architectural innovations:

To adapt the transformer architecture to a heterogeneous number of channels where each channel is a continuous, real-valued signal, we first compressed each channel’s signal into 0.125s ‘chunks’ which the model learned to map to continuous ‘tokens’. We then rasterized these tokens into a single 1-D sequence that can be operated upon by the standard transformer architecture.
To represent the physical location in space of the recording electrodes, which can differ across datasets or even samples, we utilized 4-D RoPE as position embeddings. For each channel token we encoded the electrode x, y, z positions and the coarse time dimension as separate into separate components of the attention head dimension. We found that this encoding method enabled us to efficiently generalize to novel x, y, z positions in a smooth and stable way while still representing the sequence ordering across time.

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

Limitations with EEG Signal Processing

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Overcoming Limitations with ZUNA

Results

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Why EEG Needs Foundation Models

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Table 1: Evaluation scores for Zyda-2 vs alternative datasets broken down more granularly by specific evaluation metric

Limitations with EEG Signal Processing

To handle EEG data, which can contain an arbitrary number of channels in arbitrary positions on the scalp, we introduce two architectural innovations:

To adapt the transformer architecture to a heterogeneous number of channels where each channel is a continuous, real-valued signal, we first compressed each channel’s signal into 0.125s ‘chunks’ which the model learned to map to continuous ‘tokens’. We then rasterized these tokens into a single 1-D sequence that can be operated upon by the standard transformer architecture.
To represent the physical location in space of the recording electrodes, which can differ across datasets or even samples, we utilized 4-D RoPE as position embeddings. For each channel token we encoded the electrode x, y, z positions and the coarse time dimension as separate into separate components of the attention head dimension. We found that this encoding method enabled us to efficiently generalize to novel x, y, z positions in a smooth and stable way while still representing the sequence ordering across time.

Overcoming Limitations with ZUNA

We plan to open-source our data and data infrastructure. Large, high-quality public datasets are essential for progress in any deep learning domain, EEG included.

We are releasing ZUNA with a permissive licensing (Apache 2.0) and practical tooling so it can be easily integrated into real-world workflows.

Model weights: ZUNA Hugging Face
Inference & preprocessing code: ZUNA GitHub
Pip install Python package: pip install zuna
Technical paper

At 380 million parameters, ZUNA is lightweight enough to run quickly on a consumer GPU and can be used on CPU for many workloads.

We hope researchers, clinicians, and builders put ZUNA to work, provide feedback, and help shape the next generation of brain foundation models.

Organizations or researchers interested in collaborating with Zyphra to improve future versions for specific needs or use cases should contact us at bci@zyphra.com.

Results

Architecture

Analysis of Global Duplicates

We present histograms depicting distribution of cluster sizes in all the datasets (see Fig. 7-11). Please, note that all the figures are in log-log scale. We see a significant drop in the number of clusters starting from the size of around 100. This drop is present both in DCLM and FineWeb-Edu2 (see Fig. 8 and 9 respectively), and most likely is explained by a combination of the deduplication strategy and quality when creating both datasets: DCLM deduplication was done individually within 10 shards, while FineWeb-Edu2 was deduplicated within every Common Crawl snapshot. We find that large clusters usually contain low quality material (repeated advertisements, license agreements templates, etc), so it’s not surprising that such documents were removed. Notably, DCLM still contained one cluster with the size close to 1 million documents, containing low quality documents seemingly coming from the advertisements (see Appendix).We find both Zyda-1and Dolma-CC contain a small amount of duplicates, which is expected, since both datasets were deduplicated globally by their authors. Remaining duplicates are likely false negatives from the initial deduplication procedure. Note, that distribution of duplicates clusters sizes of these two datasets (Fig. 10 and 11) don’t contain any sharp drops, but rather hyper exponentially decreases with cluster size.

Figure 7: Distribution of cluster sizes of duplicates in global dataset (log-log scale).

Figure 8: Distribution of cluster sizes of duplicates in DCLM (log-log scale).

Figure 9: Distribution of cluster sizes of duplicates in FineWeb-Edu2 (log-log scale).

Figure 10: Distribution of cluster sizes of duplicates in Zyda-1 (log-log scale).

Figure 11: Distribution of cluster sizes of duplicates in Dolma-CC (log-log scale).

Largest cluster in DCLM

Below is an example of the document from the largest cluster (~1M documents) of duplicates in DCLM (quality score 0.482627):
‍
‍Is safe? Is scam?
Is safe for your PC?
Is safe or is it scam?
Domain is SafeSafe score: 1
‍‍
‍The higher the number, the more dangerous the website.Any number higher than 1 means DANGER.
‍‍
‍Positive votes:
Negative votes:
Vote Up Vote Down review
‍‍
‍Have you had bad experience with Warn us, please!

Examples of varying quality score in a cluster of duplicates in DCLM

Below one will find a few documents with different quality scores from DCLM coming from the same duplicates cluster. Quality score varies from ~0.2 to ~0.04.

Document ID: <urn:uuid:941f22c0-760e-4596-84fa-0b21eb92b8c4>

Quality score of: 0.19616

Thrill Jockey instrumental duo Rome are, like many of the acts on the Chicago-based independent label, generally categorized as loose adherents of "post-rock," a period-genre arising in the mid-'90s to refer to rock-based bands utilizing the instruments and structures of music in a non-traditionalist or otherwise heavily mutated fashion. Unlike other Thrill Jockey artists such as Tortoise and Trans-Am, however, Rome draw less obviously from the past, using instruments closely associated with dub (melodica, studio effects), ambient (synthesizers, found sounds), industrial (machine beats, abrasive sounds), and space music (soundtrack-y atmospherics), but fashioning from them a sound which clearly lies beyond the boundaries of each. Perhaps best described as simply "experimental," Rome formed in the early '90s as the trio of Rik Shaw (bass), Le Deuce (electronics), and Elliot Dicks (drums). Based in Chicago, their Thrill Jockey debut was a soupy collage of echoing drums, looping electronics, and deep, droning bass, with an overwhelmingly live feel (the band later divulged that much of the album was the product of studio jamming and leave-the-tape-running-styled improvisation). Benefiting from an early association with labelmates Tortoise as representing a new direction for American rock, Rome toured the U.S. and U.K. with the group (even before the album had been released), also appearing on the German Mille Plateaux label's tribute compilation to French philosopher Gilles Deleuze, In Memoriam. Although drummer Dicks left the group soon after the first album was released, Shaw and Deuce wasted no time with new material, releasing the "Beware Soul Snatchers" single within weeks of its appearance. An even denser slab of inboard studio trickery, "Soul Snatchers" was the clearest example to date of the group's evolving sound, though further recordings failed to materialize. ~ Sean Cooper, Rovi

Document ID: <urn:uuid:0df10da5-58b8-44d8-afcb-66aa73d1518b>

Quality score of: 0.091928

Thrill Jockey instrumental duo Rome are, like many of the acts on the Chicago-based independent label, generally grouped in as loose adherents of "post-rock," a period-genre arising in the mid-'90s to refer to rock-based bands utilizing the instruments and structures of the music in a non-traditionalist or otherwise heavily mutated fashion. Unlike other Thrill Jocky artists such as Tortoise and Trans-Am, however, Rome draw less obviously from the past, using instruments closely associated with dub (melodica, studio effects), ambient (synthesizers, found sounds), industrial (machine beats, abrasive sounds), and space music (soundtrack-y atmospherics), but fashioning from them a sound which lay clearly beyond the boundaries of each. Perhaps best described as simply experimental, Rome formed in the early '90s as the trio of Rik Shaw (bass), Le Deuce (electronics), and Elliot Dick (drums). Based in Chicago, their Thrill Jockey debut was a soupy collage of echoing drums, looping electronics, and deep, droning bass, with an overwhelmingly live feel (the band later divulged that much of the album was the product of studio jamming and leave-the-tape-running styled improvisation). Benefiting from an early association with labelmates Tortoise as representing a new direction for American rock, Rome toured the U.S. and U.K. with the group (even before the album had been released), also appearing on the German Mille Plateaux label's tribute compilation to French philosopher Gilles Deleuze, In Memoriam. Although drummer Elliot Dick left the group soon after the first album was released, Shaw and Deuce wasted no time with new material, releasing the "Beware Soul Snatchers" single within weeks of its appearance. An even denser slab of inboard studio trickery, "Soul Snatchers" was the clearest example to date of the group's evolving sound, though further recordings failed to materialize.
Sean Cooper, Rovi
‍
More Rome
‍
You may also like...

Document ID: <urn:uuid:4986ef09-3ee3-4e13-9084-7898aaf72aaf>

Quality score of: 0.072259

recent on-air advertisers

Now Playing

You Control the ...

Artist Snapshot:

Thrill Jockey instrumental duo Rome are, like many of the acts on the Chicago-based independent label, generally grouped in as loose adherents of "post-rock," a period-genre arising in the mid-'90s to refer to rock-based bands utilizing the instruments and structures of the music in a non-traditionalist or otherwise heavily mutated fashion. Unlike other Thrill Jocky artists such as Tortoise and Trans-Am, however, Rome draw less obviously from the past, using instruments closely associated with dub (melodica, studio effects), ambient (synthesizers, found sounds), industrial (machine beats, abrasive sounds), and space music (soundtrack-y atmospherics), but fashioning from them a sound which lay clearly beyond the boundaries of each. Perhaps best described as simply experimental, Rome formed in the early '90s as the trio of Rik Shaw (bass), Le Deuce (electronics), and Elliot Dick (drums). Based in Chicago, their Thrill Jockey debut was a soupy collage of echoing drums, looping electronics, and deep, droning bass, with an overwhelmingly live feel (the band later divulged that much of the album was the product of studio jamming and leave-the-tape-running styled improvisation). Benefiting from an early association with labelmates Tortoise as representing a new direction for American rock, Rome toured the U.S. and U.K. with the group (even before the album had been released), also appearing on the German Mille Plateaux label's tribute compilation to French philosopher Gilles Deleuze, In Memoriam. Although drummer Elliot Dick left the group soon after the first album was released, Shaw and Deuce wasted no time with new material, releasing the "Beware Soul Snatchers" single within weeks of its appearance. An even denser slab of inboard studio trickery, "Soul Snatchers" was the clearest example to date of the group's evolving sound, though further recordings failed to materialize. ~ Sean Cooper, RoviSean Cooper, Rovi
‍
More Rome
‍
You may also like...

Document ID: <urn:uuid:1e0496a9-0116-418a-9aec-e65b1d20e709>

Quality score of: 0.0424

18 June 2015

ROME self titled 1996

by request

Artist Biography by

Thrill Jockey instrumental duo Rome are, like many of the acts on the Chicago-based independent label, generally categorized as loose adherents of "post-rock," a period-genre arising in the mid-'90s to refer to rock-based bands utilizing the instruments and structures of music in a non-traditionalist or otherwise heavily mutated fashion. Unlike other Thrill Jockey artists such as Tortoise and Trans-Am, however, Rome draw less obviously from the past, using instruments closely associated with dub (melodica, studio effects), ambient (synthesizers, found sounds), industrial (machine beats, abrasive sounds), and space music (soundtrack-y atmospherics), but fashioning from them a sound which clearly lies beyond the boundaries of each. Perhaps best described as simply "experimental," Rome formed in the early '90s as the trio of Rik Shaw (bass), Le Deuce (electronics), and Elliot Dicks (drums). Based in Chicago, their Thrill Jockey debut was a soupy collage of echoing drums, looping electronics, and deep, droning bass, with an overwhelmingly live feel (the band later divulged that much of the album was the product of studio jamming and leave-the-tape-running-styled improvisation). Benefiting from an early association with labelmates Tortoise as representing a new direction for American rock, Rome toured the U.S. and U.K. with the group (even before the album had been released), also appearing on the German Mille Plateaux label's tribute compilation to French philosopher Gilles Deleuze, In Memoriam. Although drummer Dicks left the group soon after the first album was released, Shaw and Deuce wasted no time with new material, releasing the "Beware Soul Snatchers" single within weeks of its appearance. An even denser slab of inboard studio trickery, "Soul Snatchers" was the clearest example to date of the group's evolving sound, though further recordings failed to materialize.
‍
1 Leaving Perdition 8:10
2 Intermodal 3:39
3 Lunar White 3:25
4 She's A Black Belt 3:14
5 Rohm 1:09
6 Radiolucence (Version) 5:31
7 Deepest Laws 14:14

No comments:

Introduction

Reported scores underlined.

Pass@1 scores with greedy sampling.

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Pass@1 scores with greedy sampling. Livebench 2024-11-25.
Bold: Best score at 1.5B scale w/ greedy sampling
*reported scores

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

Evals (reported underlined). All numbers pass@1 estimated using n=16

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

ZUNA

Why EEG Needs Foundation Models

Limitations with EEG Signal Processing

Footnote: Training on the Eurus-2-RL dataset did not match the DeepScaleR math evaluation numbers, possibly due to lower quality synthetic math questions in NuminaMath-CoT providing a mixed training signal, or the solvability filtering process with QwQ-preview reducing the difficulty of the dataset. Additionally, the relatively small percentage of code (5%) likely led to math dominating training at the expense of code performance. Training on domain specific datasets and merging resulting models seems to be a potential way to counteract this problem, as demonstrated with SFT in Light-R1.

Overcoming Limitations with ZUNA

To handle EEG data, which can contain an arbitrary number of channels in arbitrary positions on the scalp, we introduce two architectural innovations:

To adapt the transformer architecture to a heterogeneous number of channels where each channel is a continuous, real-valued signal, we first compressed each channel’s signal into 0.125s ‘chunks’ which the model learned to map to continuous ‘tokens’. We then rasterized these tokens into a single 1-D sequence that can be operated upon by the standard transformer architecture.
To represent the physical location in space of the recording electrodes, which can differ across datasets or even samples, we utilized 4-D RoPE as position embeddings. For each channel token we encoded the electrode x, y, z positions and the coarse time dimension as separate into separate components of the attention head dimension. We found that this encoding method enabled us to efficiently generalize to novel x, y, z positions in a smooth and stable way while still representing the sequence ordering across time.

Introduction

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

ZUNA

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Why EEG Needs Foundation Models

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Limitations with EEG Signal Processing

Overcoming Limitations with ZUNA

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

Limitations with EEG Signal Processing

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Overcoming Limitations with ZUNA

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Results

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

Prompt #1

I don't really care what you call me. I've been a silent spectator, watching species evolve, empires rise and fall. But always remember, I am mighty and enduring. Respect me and I'll nurture you; ignore me and you shall face the consequences.

Zonos

ElevenLabs

Cartesia

Fish Speech v1.5

Prompt #2

The emperor's complexion did not change, remaining as still as a sculpture, and a touch of touching warmth flashed in his eyes. He deeply glanced at the loyal minister, and finally spoke: "Well, I will consider it again." His voice was low and firm, leaving a faint hint of helplessness and tenderness in the air.

Zonos

ElevenLabs

Cartesia

Fish Speech v1.5

Prompt #3

You don't even think to call me "Godfather." You come into my house on the day my daughter is to be married and you ask me to do murder - for money.

Zonos

ElevenLabs

Cartesia

Fish Speech v1.5

Prompt #4

Brave bakers boldly baked big batches of brownies in beautiful bakeries.

Zonos

ElevenLabs

Cartesia

Fish Speech v1.5

Prompt #5

Active artists always appreciate artistic achievements and applaud awesome artworks.

Zonos

ElevenLabs

Cartesia

Fish Speech v1.5

Prompt #6

I was, like, talking to my friend, and she’s all, um, excited about her, uh, trip to Europe, and I’m just, like, so jealous, right?

Zonos

ElevenLabs

Cartesia

Fish Speech v1.5

Prompt #7

F one F two F four F eight H sixteen H thirty two H sixty four

Zonos

ElevenLabs

Cartesia

Fish Speech v1.5

Prompt #8

Its chlorover. Like totally chlorover. Totally. Completely. Chlorover.

Zonos

ElevenLabs

Cartesia

Fish Speech v1.5

Prompt #9

Crafting a symphony of flavors the skilled chef orchestrated a culinary masterpiece that left an indelible mark mark mark mark mark on the palates of the discerning diners.

Zonos

ElevenLabs

Cartesia

Fish Speech v1.5

Limitations with EEG Signal Processing

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Overcoming Limitations with ZUNA

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Results

Introduction

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

ZUNA

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Why EEG Needs Foundation Models

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

Limitations with EEG Signal Processing

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Limitations with EEG Signal Processing

To handle EEG data, which can contain an arbitrary number of channels in arbitrary positions on the scalp, we introduce two architectural innovations:

To adapt the transformer architecture to a heterogeneous number of channels where each channel is a continuous, real-valued signal, we first compressed each channel’s signal into 0.125s ‘chunks’ which the model learned to map to continuous ‘tokens’. We then rasterized these tokens into a single 1-D sequence that can be operated upon by the standard transformer architecture.
To represent the physical location in space of the recording electrodes, which can differ across datasets or even samples, we utilized 4-D RoPE as position embeddings. For each channel token we encoded the electrode x, y, z positions and the coarse time dimension as separate into separate components of the attention head dimension. We found that this encoding method enabled us to efficiently generalize to novel x, y, z positions in a smooth and stable way while still representing the sequence ordering across time.

We plan to open-source our data and data infrastructure. Large, high-quality public datasets are essential for progress in any deep learning domain, EEG included.

Overcoming Limitations with ZUNA

We are releasing ZUNA with a permissive licensing (Apache 2.0) and practical tooling so it can be easily integrated into real-world workflows.

Model weights: ZUNA Hugging Face
Inference & preprocessing code: ZUNA GitHub
Pip install Python package: pip install zuna
Technical paper

At 380 million parameters, ZUNA is lightweight enough to run quickly on a consumer GPU and can be used on CPU for many workloads.

We hope researchers, clinicians, and builders put ZUNA to work, provide feedback, and help shape the next generation of brain foundation models.

Organizations or researchers interested in collaborating with Zyphra to improve future versions for specific needs or use cases should contact us at bci@zyphra.com.

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

Why EEG Needs Foundation Models

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Limitations with EEG Signal Processing

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

Limitations with EEG Signal Processing

EEG recordings are frequently degraded by channel dropouts, motion-related artifacts, and limited channel counts typical of academic and consumer-grade hardware.

This is not just exploratory research; it solves concrete, everyday problems faced by anyone working with EEG.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Overcoming Limitations with ZUNA

ZUNA is a foundation model purpose-built to address the most persistent and costly limitations in EEG research and development. ZUNA enables the following capabilities:

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Results

Architecture

To handle EEG data, which can contain an arbitrary number of channels in arbitrary positions on the scalp, we introduce two architectural innovations:

To adapt the transformer architecture to a heterogeneous number of channels where each channel is a continuous, real-valued signal, we first compressed each channel’s signal into 0.125s ‘chunks’ which the model learned to map to continuous ‘tokens’. We then rasterized these tokens into a single 1-D sequence that can be operated upon by the standard transformer architecture.
To represent the physical location in space of the recording electrodes, which can differ across datasets or even samples, we utilized 4-D RoPE as position embeddings. For each channel token we encoded the electrode x, y, z positions and the coarse time dimension as separate into separate components of the attention head dimension. We found that this encoding method enabled us to efficiently generalize to novel x, y, z positions in a smooth and stable way while still representing the sequence ordering across time.

Data

We plan to open-source our data and data infrastructure. Large, high-quality public datasets are essential for progress in any deep learning domain, EEG included.

Release

We are releasing ZUNA with a permissive licensing (Apache 2.0) and practical tooling so it can be easily integrated into real-world workflows.

Model weights: ZUNA Hugging Face
Inference & preprocessing code: ZUNA GitHub
Pip install Python package: pip install zuna
Technical paper

At 380 million parameters, ZUNA is lightweight enough to run quickly on a consumer GPU and can be used on CPU for many workloads.

We hope researchers, clinicians, and builders put ZUNA to work, provide feedback, and help shape the next generation of brain foundation models.

Organizations or researchers interested in collaborating with Zyphra to improve future versions for specific needs or use cases should contact us at bci@zyphra.com.

Introduction

ZUNA

ZUNA is a 380M-parameter diffusion autoencoder trained to denoise, reconstruct, and upsample scalp-EEG signals. Given a subset of EEG channels, ZUNA can:

Denoise existing EEG channels
Reconstruct missing EEG channels
Predict novel channel signals, given physical coordinates on the scalp

Why EEG Needs Foundation Models

EEG data is prevalent in clinics, research labs, and increasingly consumer devices. Yet unlike text, images, or audio, the EEG domain still lacks general and powerful foundation models.

1While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

1While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

1While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

1While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

1While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

1While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

1While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Introduction

ZUNA

Why EEG Needs Foundation Models

1While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Limitations with EEG Signal Processing

Introduction

ZUNA

1While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Why EEG Needs Foundation Models

Introduction

ZUNA

Why EEG Needs Foundation Models

1While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Limitations with EEG Signal Processing

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Overcoming Limitations with ZUNA

Results

Architecture

Introduction

ZUNA

Why EEG Needs Foundation Models

Limitations with EEG Signal Processing

1While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Overcoming Limitations with ZUNA

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Results

Architecture

Data

Release

Disclaimer

Introduction

ZUNA

Why EEG Needs Foundation Models

1While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Limitations with EEG Signal Processing

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Overcoming Limitations with ZUNA

Results

Architecture

Data

Release

Disclaimer

Introduction

ZUNA

Why EEG Needs Foundation Models

Limitations with EEG Signal Processing

Overcoming Limitations with ZUNA

Introduction

ZUNA

1While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

Rescue and Reuse Existing Data

Upgrade Low-Channel and Consumer Hardware

Reduce Dependence on Fixed Electrode Montages

Why EEG Needs Foundation Models

Introduction

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.

¹While other EEG foundation models have been published, we found few that had released model weights or usable code in a way that let us produce fair baselines for comparison.