If the focus is performance, why use a separate process and have to deal with data serialization ove...

rudhdb773b • today at 4:00 AM • 2 replies • view on HN

If the focus is performance, why use a separate process and have to deal with data serialization overhead?

Why not a typical shared library that can be loaded in python, R, Julia, etc., and run on large data sets without even a memory copy?

Replies

bob001 • today at 6:15 AM

This lets you not even need Python, r, Julia, etc but directly connect to your backend systems that are presumably in a fast language. If Python is in your call stack then you already don’t care about absolute performance.

➕ show 1 reply

sriram_malhar • today at 4:06 AM

Perhaps because the performance is good enough and this approach is much simpler and portable than shared libraries across platforms.

➕ show 1 reply

alt Hacker News

Replies