Reranking text documents with Ollama and Qwen3 Embedding model - in Golang:
https://www.glukhov.org/post/2025/06/reranking-with-ollama-qwen3-embedding-golang/
#ollama #embedding #reranking #golang #ai #llm
Reranking text documents with Ollama and Qwen3 Embedding model - in Golang:
https://www.glukhov.org/post/2025/06/reranking-with-ollama-qwen3-embedding-golang/
#ollama #embedding #reranking #golang #ai #llm
Qwen3 Embedding & Reranker Models on Ollama: State-of-the-Art Performance
https://www.glukhov.org/post/2025/06/qwen3-embedding-qwen3-reranker-on-ollama/
#Qwen3 #Embedding #Reranker #LLM #AI #ollama
#Development #Techniques
Introducing php-node · How to seamlessly blend PHP with Node.js https://ilo.im/164g4x
_____
#Programming #Coding #Embedding #NodeJS #PHP #WordPress #CMS #WebDev #Frontend #Backend
https://blog.gslin.org/archives/2025/06/09/12444/mariadb-11-8-lts/
MariaDB 11.8 LTS
#2038 #2106 #database #embedding #hnsw #mariadb #mysql #problem #rdbms #signed #timestamp #unsigned #vector #year
[備忘録] Google Colabで30行!Qwen3-Embedding-0.6Bで日本語テキスト類似度計算
https://qiita.com/Tadataka_Takahashi/items/4ff6e114db134746c835?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items
'Variance-Aware Estimation of Kernel Mean Embedding', by Geoffrey Wolfer, Pierre Alquier.
http://jmlr.org/papers/v26/23-0161.html
#embeddings #embedding #empirical
Photographer Asks Supreme Court to Decide if Embedded Instagram Posts Infringe Copyright https://petapixel.com/2025/04/09/photographer-asks-supreme-court-to-decide-if-embedded-instagram-posts-infringe-copyright/ #copyrightinfringement #embedfeature #supremecourt #Technology #embedding #instagram #lawsuit #News #Law
How to speak in ways AI bots won’t understand.
今朝毎朝ボット
https://youtube.com/watch?v=F4KQ8wBt1Qg
#ai #llm #machinelearning #embedding #context
SOTA Code Retrieval with Efficient Code Embedding Models — https://www.qodo.ai/blog/qodo-embed-1-code-embedding-code-retreival/
#HackerNews #SOTA #Code #Retrieval #Code #Embedding #AI #Technology #Machine #Learning
GitHub - lancedb/lancedb: Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps! https://github.com/lancedb/lancedb #persistence #OpenSource #embedding #database #GitHub #search #vector #ai
初めてのAI開発!ワクワクしながら作った問い合わせ対応チャットボット
https://qiita.com/SatoRyota_zvc/items/c5d647f5174ca8136bcb?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items
I’m excited to share my newest blog post, "Don't sure cosine similarity carelessly"
https://p.migdal.pl/blog/2025/01/dont-use-cosine-similarity
We often rely on cosine similarity to compare embeddings—it's like “duct tape” for vector comparisons. But just like duct tape, it can quietly mask deeper problems. Sometimes, embeddings pick up a “wrong kind” of similarity, matching questions to questions instead of questions to answers or getting thrown off by formatting quirks and typos rather than the text's real meaning.
In my post, I discuss what can go wrong with off-the-shelf cosine similarity and share practical alternatives. If you’ve ever wondered why your retrieval system returns oddly matched items or how to refine your embeddings for more meaningful results, this is for you!
`
I want to thank Max Salamonowicz and Grzegorz Kossakowski for their feedback after my flash talk at the Warsaw AI Breakfast, Rafał Małanij for inviting me to give a talk at the Python Summit, and for all the curious questions at the conference, and LinkedIn.
Damn, this is really cool, but I wish it had a big “pre-requisites” in the readme with “NVIDIA” in it #AI #RAG #Embedding #Documents #Ollama https://github.com/TilmanGriesel/chipper
Efficient #Deep Pre-trained Sentence #Embedding Model for #Similarity Search #by Khushboo Taneja, Jyoti Vashishtha and Saroj Ratnoo
Encoder only model that's a direct drop-in replacement for existing BERT models
- First major upgrade to BERT-style models in six years
- Significantly reduced processing costs for large-scale applications
- Enables longer document processing without chunking
- Better performance in retrieval tasks
- Suitable for consumer-grade GPU deployment
#llm #ai #embedding
https://huggingface.co/blog/modernbert
SQLite's Use Of Tcl (2017): I had no idea the database was originally written to be used as a TCL extension. Explains a lot of good things.
https://www.tcl.tk/community/tcl2017/assets/talk93/Paper.html
#via:lobsters #programming #embedding #sqlite #tcl #+
Fine-tuning #embedding models clarifies enterprise semantics, business metrics, and ranking relevance prior to users issuing prompts.
https://thenewstack.io/the-secret-sauce-for-vector-search-training-embedding-models/
Jina Al just released Jina ColBERT v2, a Multilingual Late Interaction Retriever for #Embedding and #Reranking. The new model supports 89 languages with superior retrieval performance, user-controlled output dimensions, and 8192 token-length.
#Development #Pitfalls
YouTube embeds are bananas heavy · Lighter ways to add YouTube videos on your website https://ilo.im/15zdd6
_____
#Video #Youtube #Embedding #WebComponent #ProgressiveEnhancement #WebPerf #WebDev #Frontend #HTML #JavaScript
Gave https://ollama.com/avr/sfr-embedding-mistral a spin but took way to long (+3hours) to generate 5K embeddings on my m3 pro (32gb).. #llm #embedding #ollama