logoalt Hacker News

rudhdb773btoday at 4:00 AM2 repliesview on HN

If the focus is performance, why use a separate process and have to deal with data serialization overhead?

Why not a typical shared library that can be loaded in python, R, Julia, etc., and run on large data sets without even a memory copy?


Replies

bob001today at 6:15 AM

This lets you not even need Python, r, Julia, etc but directly connect to your backend systems that are presumably in a fast language. If Python is in your call stack then you already don’t care about absolute performance.

show 1 reply
sriram_malhartoday at 4:06 AM

Perhaps because the performance is good enough and this approach is much simpler and portable than shared libraries across platforms.

show 1 reply