Skip to content

lisa.datautils: Speed up the conversion from strings to bytes

Use pandas categorical dtype as an intermediate type to deduplicate the strings to decode. Since decoding is expensive, this is faster, and the result is also less memory hungry since strings are deduplicated.

Merge request reports