sigmoid.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
A social space for people researching, working with, or just interested in AI!

Server stats:

537
active users

#lucene

0 posts0 participants0 posts today
Dotan Horovits ✈️ #OSSummit KR<p>Devoxx Poland is just a couple of days away!<br>Join my talk Wednesday at the Data &amp; AI track to learn about the <a href="https://fosstodon.org/tags/OpenSearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSearch</span></a> project, and how it can provide you search, analytics, observability and vector database capabilities, all <a href="https://fosstodon.org/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <span class="h-card" translate="no"><a href="https://social.lfx.dev/@linuxfoundation" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>linuxfoundation</span></a></span> <br>👉 <a href="https://devoxx.pl/talk-details/?id=8605" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">devoxx.pl/talk-details/?id=860</span><span class="invisible">5</span></a></p><p><a href="https://fosstodon.org/tags/data" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>data</span></a> <a href="https://fosstodon.org/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a> <a href="https://fosstodon.org/tags/developers" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>developers</span></a> <a href="https://fosstodon.org/tags/search" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>search</span></a> <a href="https://fosstodon.org/tags/analytics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>analytics</span></a> <a href="https://fosstodon.org/tags/vectordb" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>vectordb</span></a> <a href="https://fosstodon.org/tags/observability" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>observability</span></a> <a href="https://fosstodon.org/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> <a href="https://fosstodon.org/tags/devoxx" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>devoxx</span></a> <a href="https://fosstodon.org/tags/devoxxpl" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>devoxxpl</span></a> <a href="https://fosstodon.org/tags/DevoxxPoland" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DevoxxPoland</span></a></p>
Dotan Horovits ✈️ #OSSummit KR<p><a href="https://fosstodon.org/tags/OpenSearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSearch</span></a> 3.0 is out! 🍾 🥳 <br>After 3 years of 2.x, it's time for the next leap, which brings major upgrades to performance, data management, <a href="https://fosstodon.org/tags/vectorDB" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>vectorDB</span></a> functionality, and much more.<br>📈 Upgrade to Apache <a href="https://fosstodon.org/tags/Lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Lucene</span></a> 10 and <a href="https://fosstodon.org/tags/JDK" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>JDK</span></a> 21+<br>📈 Pull-based ingestion for streaming data, with support for Apache <a href="https://fosstodon.org/tags/Kafka" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Kafka</span></a> and Amazon <a href="https://fosstodon.org/tags/Kinesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Kinesis</span></a><br>📈 Power agentic <a href="https://fosstodon.org/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> with native <a href="https://fosstodon.org/tags/MCP" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MCP</span></a> support<br>📈 Investigate logs with expanded PPL query tools, backed by Apache <a href="https://fosstodon.org/tags/Calcite" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Calcite</span></a></p><p>Check out <span class="h-card" translate="no"><a href="https://fosstodon.org/@OpenSearchProject" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>OpenSearchProject</span></a></span> blog:<br><a href="https://opensearch.org/blog/unveiling-opensearch-3-0/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">opensearch.org/blog/unveiling-</span><span class="invisible">opensearch-3-0/</span></a></p>
Philipp Krenn<p>nvidia GTC is coming to the bay area next week. we'll be there with a<br>* talk about bringing <a href="https://mastodon.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> to the GPU<br>* a "guess that prompt" meetup between galileo + UnstructuredIO + elastic. join us to outsmart AI ;)<br><a href="https://lu.ma/guess-that-prompt" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">lu.ma/guess-that-prompt</span><span class="invisible"></span></a></p>
Philipp Krenn<p><a href="https://mastodon.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> 9 end of life cleanup 💥<br><a href="https://github.com/apache/lucene/pull/13882" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/apache/lucene/pull/</span><span class="invisible">13882</span></a></p>
Nurhak Kaya<p>Just blogged about "How to sort items by a <a href="https://umbracocommunity.social/tags/custom" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>custom</span></a> date in an <a href="https://umbracocommunity.social/tags/Umbraco" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Umbraco</span></a> v13+ <a href="https://umbracocommunity.social/tags/Examine" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Examine</span></a> Index".</p><p><a href="https://www.nurhakkaya.com/2024/10/how-to-sort-items-by-custom-date-in.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">nurhakkaya.com/2024/10/how-to-</span><span class="invisible">sort-items-by-custom-date-in.html</span></a></p><p><a href="https://umbracocommunity.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> <a href="https://umbracocommunity.social/tags/search" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>search</span></a> <a href="https://umbracocommunity.social/tags/umbraco" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>umbraco</span></a> <a href="https://umbracocommunity.social/tags/examine" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>examine</span></a> <a href="https://umbracocommunity.social/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <a href="https://umbracocommunity.social/tags/hacktoberfest" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>hacktoberfest</span></a> <a href="https://umbracocommunity.social/tags/contribution" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>contribution</span></a></p>
Philipp Krenn<p><a href="https://mastodon.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> 10 is out: <a href="https://lucene.apache.org/core/corenews.html#apache-lucenetm-1000-available" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">lucene.apache.org/core/corenew</span><span class="invisible">s.html#apache-lucenetm-1000-available</span></a><br>and the <a href="https://mastodon.social/tags/elasticsearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>elasticsearch</span></a> upgrade is already kicking off, starting the countdown for version 9: <a href="https://github.com/elastic/elasticsearch/pull/114741" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/elastic/elasticsear</span><span class="invisible">ch/pull/114741</span></a></p>
Charlie Hull<p>Some people blog about K8s, I prefer to blog about K9. <a href="https://opensourceconnections.com/blog/2024/10/10/dogfights-in-open-source-search-solr-opensearch-and-elasticsearch/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">opensourceconnections.com/blog</span><span class="invisible">/2024/10/10/dogfights-in-open-source-search-solr-opensearch-and-elasticsearch/</span></a> <a href="https://hachyderm.io/tags/solr" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>solr</span></a> <a href="https://hachyderm.io/tags/elasticsearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>elasticsearch</span></a> <a href="https://hachyderm.io/tags/opensearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensearch</span></a> <a href="https://hachyderm.io/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a> <a href="https://hachyderm.io/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a></p>
Dalatangi<p><span class="h-card" translate="no"><a href="https://cosocial.ca/@timbray" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>timbray</span></a></span> The dk.brics.automaton Java library comes to my mind immediately. Very minimalistic, incredibly fast and efficient (C-like code actually) and only Junit test dependencies. </p><p><a href="https://github.com/cs-au-dk/dk.brics.automaton" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/cs-au-dk/dk.brics.a</span><span class="invisible">utomaton</span></a></p><p><a href="https://www.brics.dk/automaton/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="">brics.dk/automaton/</span><span class="invisible"></span></a></p><p>Plus, it is widely used in e.g. <a href="https://fosstodon.org/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> and via this in things like <a href="https://fosstodon.org/tags/solr" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>solr</span></a> or <a href="https://fosstodon.org/tags/ElasticSearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ElasticSearch</span></a></p>
Philipp Krenn<p>8️⃣.0️⃣ approximate kNN search based on HNSW with float vectors on the _knn endpoint. this like many other (but not all) changes is based on improvements in <a href="https://mastodon.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> 3/18</p>
Erik C. Thauvin<p>Elasticsearch is Open Source, Again</p><p><a href="https://mastodon.social/tags/amazon" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>amazon</span></a> <a href="https://mastodon.social/tags/apache" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>apache</span></a> <a href="https://mastodon.social/tags/java" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>java</span></a> <a href="https://mastodon.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> <a href="https://mastodon.social/tags/opensource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensource</span></a></p><p><a href="https://www.elastic.co/blog/elasticsearch-is-open-source-again?utm_medium=erik.in&amp;utm_source=mastodon" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">elastic.co/blog/elasticsearch-</span><span class="invisible">is-open-source-again?utm_medium=erik.in&amp;utm_source=mastodon</span></a></p>
dada<p>Vous sauriez s'il existe une option dans une Lucene Query de Grafana de passer une variable de majuscule à minuscule ou inversement ?</p><p>J'ai une variable Grafana que le user doit renseigner en majuscule mais qui doit être exploitée en minuscule dans des graphiques...</p><p><a href="https://diaspodon.fr/tags/aide" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>aide</span></a> <a href="https://diaspodon.fr/tags/grafana" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>grafana</span></a> <a href="https://diaspodon.fr/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a></p>
Philipp Krenn<p>approximate kNN search:<br>* good estimate<br>* you can control speed vs precision through the num_candidates setting (basically overfetching on the approximation for getting very close to exact kNN)<br>* <a href="https://mastodon.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> uses HNSW: think of it as highways, roads &amp; streets 🛣️ 3/9</p>
Philipp Krenn<p>great validation on all the progress <a href="https://mastodon.social/tags/elasticsearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>elasticsearch</span></a> and <a href="https://mastodon.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> have made as a vector database (besides all the other features and improvements)<br>guess we'll "just" have to do a lot more shouting about it 📣<br><a href="https://reddit.com/r/elasticsearch/comments/1d3xtds/is_elastic_search_better_than_chromadb/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">reddit.com/r/elasticsearch/com</span><span class="invisible">ments/1d3xtds/is_elastic_search_better_than_chromadb/</span></a></p>
Dave Mackey<p>Any friends (or potential friends :)) in <a href="https://hachyderm.io/tags/stockholm" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>stockholm</span></a> ? I'll be in Stockholm and have some availability to meet up. Interested in connecting in general as well as with folks in <a href="https://hachyderm.io/tags/search" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>search</span></a>, <a href="https://hachyderm.io/tags/informationretrieval" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>informationretrieval</span></a>, <a href="https://hachyderm.io/tags/knowledgemanagement" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>knowledgemanagement</span></a>, <a href="https://hachyderm.io/tags/library" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>library</span></a> sciences.</p><p><a href="https://hachyderm.io/tags/sweden" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>sweden</span></a> <a href="https://hachyderm.io/tags/elastic" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>elastic</span></a> <a href="https://hachyderm.io/tags/ElasticSearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ElasticSearch</span></a> <a href="https://hachyderm.io/tags/opensearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>opensearch</span></a> <a href="https://hachyderm.io/tags/solr" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>solr</span></a> <a href="https://hachyderm.io/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> <a href="https://hachyderm.io/tags/ir" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ir</span></a></p>
Philipp Krenn<p>pgvector 0.7 is out: <a href="https://www.postgresql.org/about/news/pgvector-070-released-2852/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">postgresql.org/about/news/pgve</span><span class="invisible">ctor-070-released-2852/</span></a><br>1. never forget about <a href="https://mastodon.social/tags/postgresql" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>postgresql</span></a><br>2. fascinating what the "right" quantization / granularity per vector dimension is for each system. postgres picks 4 and 2 byte plus 1 bit. whereas <a href="https://mastodon.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> / <a href="https://mastodon.social/tags/elasticsearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>elasticsearch</span></a> are going for 4 and 1 byte plus 4 bit. who can pick the right tradeoffs here?</p>
Alexander Reelsen<p>Zulia Search Engine: Another Lucene based distributed search engine, that is written in Java. Using MongoDB in cluster mode for metadata sharing/node management it seems. Seems to be tailored for a certain use-case due to not having a lot of analyzers exposed. Communicating via GRPC.</p><p>Did not spot anything regarding distributed error handling and documentation is relatively sparse.Lucene is here to stay and going nowhere. <a href="https://mastodon.social/tags/search" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>search</span></a> <a href="https://mastodon.social/tags/java" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>java</span></a> <a href="https://mastodon.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a></p><p><a href="https://github.com/zuliaio/zuliasearch" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/zuliaio/zuliasearch</span><span class="invisible"></span></a></p>
Marc R. Hoffmann<p>Is there really cheese in Java?</p><p>The <a href="https://javabubble.social/tags/javaalmanac" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>javaalmanac</span></a> now has full text search 🥳</p><p>Search JavaDoc, JEPs, language and JVM specs across versions.</p><p><a href="https://javaalmanac.io/find/?q=cheese" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">javaalmanac.io/find/?q=cheese</span><span class="invisible"></span></a></p><p><a href="https://javabubble.social/tags/openjdk" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>openjdk</span></a> <a href="https://javabubble.social/tags/java" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>java</span></a> <a href="https://javabubble.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a></p>
Philipp Krenn<p>there are a lot of vector similarities: dot product, euclidean, max inner product, hamming distance,...<br>how to support more in <a href="https://mastodon.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> while keeping it tested and benchmarked? plugins — PR in progress with two different groups: <a href="https://github.com/apache/lucene/pull/13288" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/apache/lucene/pull/</span><span class="invisible">13288</span></a><br>* core: currently including dot product, euclidean, and max inner product<br>* lucene codecs (similar to compression extensions): starting with hamming distance<br>also cleaning up with generics and maybe allowing more native code optimizations later on</p>
Philipp Krenn<p><a href="https://mastodon.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> 10 is starting to shape up: <a href="https://lists.apache.org/thread/4bhnkkvvodxxgrpj4yqm5yrgj0ppc59r" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">lists.apache.org/thread/4bhnkk</span><span class="invisible">vvodxxgrpj4yqm5yrgj0ppc59r</span></a> 🙌</p>
Philipp Krenn<p>searching HNSW in parallel on <a href="https://mastodon.social/tags/lucene" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>lucene</span></a> 9.10 and then <a href="https://mastodon.social/tags/elasticsearch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>elasticsearch</span></a> got a great speedup for all your (dense) vector fields: <a href="https://github.com/apache/lucene/pull/12962" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/apache/lucene/pull/</span><span class="invisible">12962</span></a><br>merging to a single segment will still be faster but the difference is getting much smaller. moar smart optimizations :)</p>