Do you use #neuropixels or #highdensity probes? Are your recordings filling up your hard drives?
We got you covered!
In the first preprint from
@AllenInstitute
for Neural Dynamics, we looked at ways to reduce the footprint of #ephys data.
https://www.biorxiv.org/content/10.1101/2023.05.22.541700v2
(1/n)
We developed a framework based on @zarr_dev to benchmark lossless and lossy compression of #Neuropixels and similar data. The benchmark datasets included NP1 and NP2 recordings, available on Registry of Open Data on
@AWS
https://registry.opendata.aws/allen-nd-ephys-compression/
(2/n)
We started with #LosslessCompression. Across a range of general-purpose (GP) compressors, we found that #Zstandard with
@Blosc2 achieves the best compromise between compression ratio and decompression speed!
NP1: compressed size ~36%
NP2: compressed size ~52%
(3/n)
We then investigated two #LossyCompression strategies: bit truncation and WavPack Hybrid mode. Lossy compression can dramatically boost compression performance, but we must first assess how it affects downstream analysis (i.e., spike sorting).
(5/n)
Using simulated data with known ground truth spike times, we used #Kilosort 2.5 to evaluate spike sorting performance. WavPack Hybrid does not affect spike sorting accuracy, even at maximum compression levels (~14% file size).
(6/n)
Finally, kudos to all co-authors!
Olivier Winter, David Bryant, David Feng, @svoboda314 and Josh Siegle, and thanks to
@alleninstitute for sponsoring this work!
(12/n)