sigmoid.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A social space for people researching, working with, or just interested in AI!

Server stats:

575
active users

#datafusion

0 posts0 participants0 posts today

I wrote a blog post on how to write user defined functions in Apache DataFusion. This includes how you can write Rust backed Python functions that operate at full native speed with zero copy operations of the data structures. Switching from pure python functions to these types of UDFs can lead to 10x speed improvements.

datafusion.apache.org/blog/202

Apache DataFusion Project News & Blog · Comparing approaches to User Defined Functions in Apache DataFusion using Python<!–

@dom @maxroser @ourworldindata

If you folks are not yet familiar with dynamicland.org's work take a look at this work (focus on 24-25mins) from @bret (in 2018)

I can see this approach having multiple synergies for the work you do.

For example in helping engage people with data and help them explore it (the point made at 25m), as well as provide a powerful way for academics and professionals to work with, integrate (#DataFusion) and understand the meaning of data.

youtu.be/cErKuEHWCpM

I recently worked on an update to the Apache DataFusion project. The goal of this project is to provide a fast, modular approach to building large scale data processing.

The update adds in significant improvements to the python interface, to the point where I would now recommend giving it a try to people who haven't used it or had success.

I wrote up a blog post about the changes, hosted on the Apache DataFusion site: datafusion.apache.org/blog/202

Apache DataFusion Project News & Blog · Apache DataFusion Python 40.1.0 Released, Significant usability updates<!–

I put in an open source PR today!

I've been playing around with Apache #DataFusion and wanted to see how it performed against a real life non-trivial problem I have. I ran into a blocker in that one of the basic functions that is exposed in the rust code underneath isn't exposed in python, so I just wrote it myself and put up a PR. I've never really been much of an open source contributor, so looking forward to seeing how this goes.

Realized that I botched my #introduction (#neuhier halt)

Doing a #PhD in #biostatistics in #switzerland, focusing on experimental #design, small sample #statistics and #datafusion in #animalresearch.

Managing a #grassroots #thinktank called #Reatch! #Research. #Think. #Change.

Engaged in #scicomm, #sciencedialogue, #science and #politics relations.

Interested in #sts, #epistemology of #statistics and the use of #statistics and #science in #democracy and #democratic discourse.