sigmoid.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A social space for people researching, working with, or just interested in AI!

Server stats:

596
active users

#bigdata

8 posts8 participants0 posts today

Selbstzensur wegen Drumpf

So weit sind wir. Ein renommierter IT-Fachmann, amerikanischer Staatsbürger, löscht seine sämtlichen antisozialen und sozialen Präsenzen, um unbehelligt in die USA einreisen zu können! Marc Pesce lebt und arbeitet normalerweise viel im Ausland, insbesondere Australien, wo er eine Gastprofessur innehat. Jetzt wollte er mal wieder in die USA einreisen. Um sich Scherereien mit der Grenzkontrolle zu ersparen, hat er seine komplette Online-Präsenz "bereinigt".

pc-fluesterer.info/wordpress/2

www.pc-fluesterer.infoSelbstzensur wegen Drumpf | pc-flüsterer bremen

Kriminelle Vereinigung Twi-X

Jedenfalls in Frankreich ist Twi-X als Kriminelle Vereinigung (dortige Bezeichnung: Organisierte Bande) angeklagt. Die beiden Vorwürfe lauten

Manipulation der öffentlichen Meinung (zum Zwecke der Wahlbeeinflussung) und
Datensammlung.

Wohlgemerkt, hier geht es um strafrechtliche Ermittlungen, keine zahnlosen Tiger wie DMA oder DSA.

pc-fluesterer.info/wordpress/2

www.pc-fluesterer.infoKriminelle Vereinigung Twi-X | pc-flüsterer bremen

Can anyone recommend good reads on how astronomical #bigdata is being managed, given the massive volumes coming in from, e.g. radio telescopes? Looking for mid-range accessibility, with some technical depth, but not specific to any one telescope or pipeline - more surveys/summaries of the kinds of architectural patterns / platforms, storage & retrieval tools. TIA & please boost for reach!

Months after Elon Musk's DOGE crusade to wipe it out, LTO tape storage is bigger than ever. It is tempting to write off tape storage as relics from a bygone era but its recent performance says otherwise. With 176.5 exabytes shipped in 2024 — up 15.4% year over year — LTO tape is carving out a strategic niche in the surge of AI and ML workloads.

TL;DR
⚙️ Record 176.5 EB shipped in 2024
🔐 Tape offers cost‑effective, long‑term durability
📈 Fueled by AI/ML archival demand
🚀 LTO roadmap plans up to 576 TB per cartridge

tomshardware.com/pc-components
#tapestorage #AI #ML #datainfrastructure #security #privacy #cloud #infosec #cybersecurity #datastorage #bigdata #data #storagemanagement #storage

Tom's Hardware · Months after Elon Musk's DOGE crusade to wipe it out, LTO tape storage is bigger than ever — a record 176.5 exabytes shipped in 2024, the fourth consecutive year of growthBy Mark Tyson

The first category Netflix is showing me today is “US TV Shows Dubbed in German.”

I’m in the French part of Switzerland, and 99% of what I watch on Netflix I do so in its original langage, even if it’s not a one that I speak, with English subtitles.

But tell me again how big data is the future and gives companies power.

Twi-X missbraucht sensible Daten der Benutzer/innen

Mehrere NGO haben bei verschiedenen Aufsichtsbehörden Beschwerde eingelegt, weil Twi-X die DSGVO und den DSA verletze. Der verbietet es, Informationen über politische Einstellung oder Gesundheit einer Person für zielgerichtete Werbung

pc-fluesterer.info/wordpress/2

www.pc-fluesterer.infoTwi-X missbraucht sensible Daten der Benutzer/innen | pc-flüsterer bremen

PsyPost: Secret changes to major U.S. health datasets raise alarms. “A new study in the medical journal The Lancet reports that more than 100 United States government health datasets were altered this spring without any public notice. The investigation shows that nearly half of the files examined underwent wording changes while leaving the official change logs blank. The authors warn that […]

https://rbfirehose.com/2025/07/19/psypost-secret-changes-to-major-u-s-health-datasets-raise-alarms/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · PsyPost: Secret changes to major U.S. health datasets raise alarms | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

The Conversation: Vanishing data in the U.S. undermines good public policy, with global implications. “As researchers focused on data management (Kristi) and behavioural sciences (Albert) and whose work tackles the significance of research with open access data, we have been concerned about how the data sets that scholars around the world rely on have been vanishing from U.S. government […]

https://rbfirehose.com/2025/07/19/the-conversation-vanishing-data-in-the-u-s-undermines-good-public-policy-with-global-implications/

ResearchBuzz: Firehose | Individual posts from ResearchBuzz · The Conversation: Vanishing data in the U.S. undermines good public policy, with global implications | ResearchBuzz: Firehose
More from ResearchBuzz: Firehose

🗣️ Announcing Python-Blosc2 3.6.1

!Unlock new levels of data manipulation with Blosc2! 🚀

We've introduced a major improvement: powerful fancy indexing and orthogonal indexing for Blosc2 arrays.

We've tamed the complexity of fancy indexing to make it intuitive, efficient, and consistent with NumPy's behavior. 💪

Read all about it on our blog! 📝 blosc.org/posts/blosc2-fancy-i

Compress Better, Compute Bigger!

📣 Call for Papers!

Das vom @berliner_antike_kolleg mit DAI, @BBAW, @freieuniversitaet und @HumboldtUni organisierte #DigitalClassicistSeminar Berlin geht von Oktober '25 bis Februar '26 in die nächste Runde.

Insbesondere Nachwuchswissenschaftler:innen sind eingeladen, bis zum 🗓️ 16.08. Beiträge zu digitalen Methoden, Herausforderungen von #BigData, #Visualisierungen u.v.m. einzureichen.

👉 Den vollständigen #CfP gibt es hier: digital-classicist.bbaw.de/cfp

digital-classicist.bbaw.deCall for Papers - Digital Classicist SeminarBBAW Description.

What does it take to maintain one of the world's largest repositories of free, structured knowledge?

Read this interview about the challenges Wikidata faces and how the team handles massive scale and constant updates, all while remaining open source:
bigdatawire.com/2025/07/10/sca #OpenData #KnowledgeGraphs #BigData #SemanticWeb

BigDATAwire · Scaling the Knowledge Graph Behind WikipediaAs the fifth most popular website on the Internet, keeping Wikipedia running smoothly is no small feat. The free encyclopedia hosts more than 65 million

🚀 Blosc2 supports memory-mapped files for super-efficient data access! 🚀

✨ Why memory-mapping?
1️⃣ No system call overhead for each read/write
2️⃣ Data goes straight from page cache to user space—much faster than traditional I/O!

👉 github.com/Blosc/python-blosc2

Join our tutorial at
@EuroSciPy 2025, where we'll dive deep into these techniques and share more expert tips for maximizing data throughput. See you there!

#DataScience #Performance #BigData 🚀💾