**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

Alessio Buccino @buccino_alessio@qoto.org

Jun 5, 2023

Alessio Buccino @buccino_alessio@qoto.org

Do you use #neuropixels or #highdensity probes? Are your recordings filling up your hard drives?
We got you covered!

In the first preprint from
@AllenInstitute
for Neural Dynamics, we looked at ways to reduce the footprint of #ephys data.

https://www.biorxiv.org/content/10.1101/2023.05.22.541700v2

(1/n)

**Alessio Buccino** @buccino_alessio@qoto.org · 2023-06-05T11:27:40Z

Alessio Buccino @buccino_alessio@qoto.org

We developed a framework based on @zarr_dev to benchmark lossless and lossy compression of #Neuropixels and similar data. The benchmark datasets included NP1 and NP2 recordings, available on Registry of Open Data on
@AWS

https://registry.opendata.aws/allen-nd-ephys-compression/

(2/n)

Allen Institute for Neural Dynamics - Extracellular Electrophysiology Compression Benchmark - Registry of Open Data on AWS

registry.opendata.aws

June 5, 2023 at 11:27 AM · · · ·

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

Jun 5, 2023

Alessio Buccino @buccino_alessio@qoto.org

We started with #LosslessCompression. Across a range of general-purpose (GP) compressors, we found that #Zstandard with
@Blosc2 achieves the best compromise between compression ratio and decompression speed!

NP1: compressed size ~36%
NP2: compressed size ~52%

(3/n)

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

Jun 5, 2023

Alessio Buccino @buccino_alessio@qoto.org

#ephys and #audio signals are very similar! We therefore added #flac and #wavpack to the game...and they performed even better than GP codecs!

With #wavpack:
NP1: compressed size ~28%
NP2: compressed size ~44%

(4/n)

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

Jun 5, 2023

Alessio Buccino @buccino_alessio@qoto.org

We then investigated two #LossyCompression strategies: bit truncation and WavPack Hybrid mode. Lossy compression can dramatically boost compression performance, but we must first assess how it affects downstream analysis (i.e., spike sorting).

(5/n)

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

Jun 5, 2023

Alessio Buccino @buccino_alessio@qoto.org

Using simulated data with known ground truth spike times, we used #Kilosort 2.5 to evaluate spike sorting performance. WavPack Hybrid does not affect spike sorting accuracy, even at maximum compression levels (~14% file size).

(6/n)

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

Jun 5, 2023

Alessio Buccino @buccino_alessio@qoto.org

We repeated spike sorting on experimental data, this time counting the number of units passing or failing quality control (QC). Again, we observed minimal changes in the results when using WavPack Hybrid.

(7/n)

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

Jun 5, 2023

Alessio Buccino @buccino_alessio@qoto.org

Waveform shapes are also important for downstream analysis, e.g., cell-type classification. On simulated data, we found that WavPack Hybrid nicely preserves three commonly used waveform features.

(8/n)

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

Jun 5, 2023

Alessio Buccino @buccino_alessio@qoto.org

WavPack Hybrid is promising, but we observed subtle differences in spike trains before and after compression. We need better methods for comparing spike sorting results to make sure we’re not losing any critical info. Until then, we’ll be using lossless compression.

(9/n)

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

Jun 5, 2023

Alessio Buccino @buccino_alessio@qoto.org

Do you want to apply compression right away? All compression options can be readily deployed with a few lines using
@spikeinterface
, so you can easily try them out of your own data! This will also make it easy to benchmark new compression methods as they become available.

(10/n)

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

Jun 5, 2023

Alessio Buccino @buccino_alessio@qoto.org

At @AllenInstitute
for Neural Dynamics we value fairness and reproducibility in science. All figures of the manuscript can be reproduced with
@codeocean:

https://codeocean.com/capsule/3822095/tree/v1

(11/n)

Code Ocean

codeocean.com

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

**Alessio Buccino** @buccino_alessio@qoto.org · Jun 5, 2023

Jun 5, 2023

Alessio Buccino @buccino_alessio@qoto.org

Finally, kudos to all co-authors!
Olivier Winter, David Bryant, David Feng, @svoboda314 and Josh Siegle, and thanks to
@alleninstitute for sponsoring this work!

(12/n)

**Chris Rodgers** @chrisXrodgers@neuromatch.social · Jun 5, 2023

**Chris Rodgers** @chrisXrodgers@neuromatch.social · Jun 5, 2023

Jun 5, 2023

Chris Rodgers @chrisXrodgers@neuromatch.social

@buccino_alessio I always wondered if ephys and (human) audio signals are similar for some deep reason or just coincidence. I guess they are both roughly 1/f with about 20 kHz bandwidth

Trending now

Resources

Developers

What is Mastodon?

qoto.org

More…