Hey guys, this is my github repo. Glad it's received some interest - I figured HN might be the culprit when it suddenly jumped ~100 stars despite not working on the code base since last year. I prototyped this out of personal curiosity last year and moved on abruptly so there's a lot of gaps I still need to close and knobs that need to be optimized. But if people genuinely find "deterministic genomics workloads on edge devices" proposal useful, I'll begin refining the code tonight and try to make it as useful as possible. If you have any particular bioinformatics tasks or use cases that you want to be feasible on edge devices, lmk and I'll work on integrating new capabilities. Always happy to be helpful
There has been a bit of a 'trend' to rewrite common bioinformatics/comp-bio into faster languages (Rust) via LLMs, OP's repo seems to be an early example.
Seqera Labs has a bit of a manifesto: https://rewrites.bio/
Heng Li has an overview here too: https://lh3.github.io/2026/04/17/the-ai-rewrite-dilemma
IMHO it's... OK? Bioinformatics code quality is generally poor, untrained biologists writing functioning code that is poor in scoping, but works. (Unguided) LLMs write on that level, too, so not much harm done.
This is interesting; thanks for sharing! I have been curious about the adoption of Rust in computational biology. I know that the folks at Saint Jude's [1] are also using Rust for their 'omics research.
Those are all the tests for alignment. They don't even check the alignment is correct. Just that there are no errors. This is a joke: https://github.com/logannye/rosalind/blob/main/tests/alignme...
Looks like total slop to me. All code in one commit, then a bunch of commits polishing the Readme.
No release, no updates in half a year.
Looking at the commenting pattern, it seems like AI unfortunately
bioinformaticians have been making these useless bioinformatic-toolkit-in-my-favorite-programming-language repos for years
> A deterministic genomics engine with a compact memory footprint.
Uhh... are there stochastic genomics pipelines?
I would love to hear about what the sacrifices are, but this project really looks amazing.
Didn't see a publication or preprint for this - is there one?
[flagged]
[dead]
Realistically, without data from a large testset that compares this thoroughly to Samtools (and others?), I wouldn't touch this.
Note to the OP: specify a focus please? short, long, mega-long read and bacterial, human, small plant or large plant genome? Alignment heuristics and performance differ significantly across those axes.