Blog

Discoveries from the AI/ML ecosystem — interesting projects, tools, and libraries worth knowing about.

The AI Inference Wars: Comparing Taalas, Cerebras, Groq, Etched, and NVIDIA

2026-02-22 1 min read

Custom AI chips are crushing NVIDIA GPUs on inference speed. Taalas HC1 hits 17,000 tokens/s, Etched Sohu claims 500,000 tokens/s. Here's how they all compare.

ai-hardwareinferencellmacceleratorsnvidia

CrawlAI RAG: Turn Any Website Into a Queryable Knowledge Base

2026-02-22 3 min read

Crawl entire websites, index their content, and ask natural-language questions using RAG. Built with FastAPI, LangChain, ChromaDB, and Groq's LLaMA 3.3 70B.

raglangchainweb-scrapingllmvector-databaseopen-source

Building a Completely Local Voice AI Agent: LiveKit, VAPI, and the Voice-Native Revolution

2026-02-22 8 min read

A complete guide to self-hosted voice AI: from LiveKit-based local setups to voice-native models like PersonaPlex and Moshi that eliminate STT/TTS latency entirely.

voice-ailivekitself-hostedai-agentstelephonyopen-source

From Meshes to Neural Operators: The Future of Physics Simulation

2026-02-22 5 min read

Traditional CFD and FEA spend 80% of time on meshing. PINNs go mesh-free but retrain every simulation. Neural Operators (PINOs) train once and solve forever. Here's how they compare.

scientific-computingmachine-learningsimulationphysicsneural-networkscfd

AI Capital: The Organizational Shift Nobody's Talking About

2026-02-21 20 min read

Companies have Human Resources for managing human capital. As AI agents become a core workforce, we need a parallel function for managing AI capital. This shift is already underway.

ai-agentsorganizational-designenterprise-aiworkforcestrategy

Build Your Own Medical Foundation Model: A DINO-Based Blueprint

2026-02-21 9 min read

How researchers are creating domain-specific foundation models from DINOv2. A practical guide using RedDino as a case study, applicable to cardiac imaging, pathology, and beyond.

foundation-modelsmedical-imagingself-supervised-learningDINOv2computer-vision

Dify: The Open-Source Platform for Building Production AI Agents

2026-02-21 4 min read

Dify combines visual workflow building, RAG pipelines, agent capabilities, and LLMOps into one self-hostable platform. Here's why it's becoming the go-to for agentic app development.

ai-agentsllmragopen-sourceworkflowself-hosted

The Smart Annotation Strategy: Human-in-the-Loop for Object Detection & Segmentation

2026-02-21 8 min read

A practical guide to building production-ready detection and segmentation models with minimal manual labeling using SAM, SAM 2, SAM 3, and active learning workflows.

computer-visionannotationSAMmachine-learningactive-learning

TimesFM 2.5: Google's Open-Source Time Series Foundation Model

2026-02-21 3 min read

Google Research just open-sourced a 200M parameter foundation model for time series forecasting. It works zero-shot on any data—no training required.

googletime-seriesfoundation-modelsforecastingopen-source

PageIndex vs. Vector Databases: The RAG Showdown Nobody Expected

2026-02-20 5 min read

Did hierarchical tree indexing just kill vector databases? A deep dive into PageIndex's 98.7% accuracy claim and when to use reasoning-based vs. embedding-based retrieval.

ragvector-databasepageindexretrievalllmai-agents

The RAG Storage Decision Guide: From Local Files to Serverless Vectors

2026-02-20 4 min read

When to use Upstash, local file caching, embedded databases, managed vector services, or skip vectors entirely. A practical framework for choosing your RAG infrastructure.

ragvector-databaseupstasharchitectureinfrastructure

Zvec: The SQLite of Vector Databases Has Arrived

2026-02-20 3 min read

Alibaba open-sources Zvec, an embedded vector database that runs in-process with zero infrastructure. Over 8,000 QPS, 2x faster than the previous leader.

vector-databaseragembeddingsalibabaopen-source

Code vs JSON: The Evolution of LLM Tool Calling

2026-02-19 4 min read

From academic research to production systems, why the AI industry is converging on code-based tool calling over JSON schemas

ai-agentsllmanthropichuggingfacetool-calling

Crust: A Security Gateway That Protects You From Your Own AI Agents

2026-02-19 3 min read

An open-source tool that intercepts and blocks dangerous AI agent behaviors before they can access your secrets, delete files, or exfiltrate data

ai-securityai-agentsopen-sourcedeveloper-toolsllm-safety

FreeMoCap: Research-Grade Motion Capture Using Just Your Webcams

2026-02-19 6 min read

An open-source motion capture system that delivers professional results without expensive hardware — just standard webcams and a pip install

motion-captureopen-sourcecomputer-visionresearch-toolspython

OpenClaw: Run Your Own Personal AI Assistant Anywhere

2026-02-19 7 min read

An open-source AI assistant that connects to WhatsApp, Telegram, Slack, Discord, and more — running entirely on your own devices

ai-assistantopen-sourceself-hostedchatbotmulti-platform

TinyFish: Turn Any Website Into an API With Natural Language

2026-02-19 4 min read

A web agent infrastructure that treats real websites like programmable surfaces — send a URL and a goal in plain English, get structured JSON back

web-agentsautomationapiai-agentsweb-scraping

Unsloth: Fine-Tune LLMs from VS Code Using Free Colab GPUs

2026-02-19 3 min read

How to fine-tune LLMs directly from your IDE using Unsloth and Google Colab's free GPUs—no expensive hardware required

llmfine-tuningopen-sourcemachine-learningvscode

Accomplish: The Open Source AI Desktop Agent That Actually Does Things

2026-02-18 4 min read

A local-first AI agent that manages files, creates documents, and browses the web — without monthly subscriptions or sending your data anywhere.

ai-agentsopen-sourcedesktopautomationtools

7 RAG Patterns You Need to Know in 2026

2026-02-18 3 min read

Most teams built RAG in 2023 and never rebuilt it. Here's why your AI answers feel average — and the design patterns that actually work at scale.

ragllmai-architectureretrievalpatterns

Clawe: Trello for AI Agents — Built on the OpenClaw Phenomenon

2026-02-18 4 min read

The viral AI agent framework that amassed 200K+ GitHub stars now has a multi-agent coordination layer. Deploy squads of agents that share a Kanban board.

ai-agentsmulti-agentopenclawautomationcoordination

ClawWork: Turn Your AI Agent Into a Money-Earning Coworker

2026-02-18 2 min read

An economic benchmark where AI agents start with $10, pay for their own tokens, and must complete real professional tasks to survive. Top performers earn $1,500+/hr equivalent.

ai-agentsopenclawautomationbenchmarkeconomics

DeepDoc: Deep Research on Your Local Documents Instead of the Internet

2026-02-18 2 min read

An open-source tool that applies deep research workflows to your own files—PDFs, Word docs, images—generating structured markdown reports without manual digging.

ragtoolsopen-sourceresearchlocal-first

PaperBanana: Google's Multi-Agent System for Publication-Ready Academic Illustrations

2026-02-18 3 min read

Google introduces an agentic framework that automatically generates methodology diagrams and statistical plots from text descriptions—no design skills required.

ai-agentsacademic-toolsgoogleresearchautomation

WebMCP: Chrome's New Standard for Agent-Ready Websites

2026-02-18 4 min read

Google and Microsoft propose a web standard that lets sites expose structured tools to AI agents — no more DOM scraping and button-guessing.

ai-agentsweb-standardsbrowsermcpchrome

HermitClaw: A Tamagotchi That Does Research

2026-02-17 3 min read

An autonomous AI creature that lives in a folder on your computer, continuously researching, writing, and building — all on its own.

ai-agentsopen-sourcetools

Memvid: Single-File Memory for AI Agents

2026-02-17 2 min read

Package embeddings, data, and search structures into a single portable file. No vector database needed — just self-contained memory for your AI agents.

ai-agentsragtoolsopen-source

MiniMax M2.5: A Coding Agent That's Actually Affordable

2026-02-17 2 min read

State-of-the-art on SWE-Bench at 80.2%, trained on 200K real coding environments, and priced at $1/hour. The economics of AI coding just changed.

llmai-agentstools

Qwen3.5: The Open Frontier Model with 397B Parameters

2026-02-17 3 min read

Alibaba's massive open-weights model brings 397B parameters, native multimodal capabilities, and support for 201 languages — with efficient MoE inference.

llmopen-sourcecomputer-vision

SAM 3: Meta's Segment Anything Now Understands Text

2026-02-17 2 min read

No more clicking on objects — describe what you want to segment in plain English. Trained on 4 million unique concepts with 50x the vocabulary of existing datasets.

computer-visionai-agentsopen-source

AutoFigure: Generate Publication-Ready Scientific Diagrams from Text

2026-02-16 2 min read

A dual-agent system that generates polished scientific illustrations from text descriptions or directly from research papers, using iterative refinement.

ai-agentstoolsopen-source

CyberScraper 2077: AI-Powered Web Scraping with Style

2026-02-08 2 min read

Use natural language instead of brittle CSS selectors to extract web data. Supports multiple LLM backends, Tor routing, and stealth mode.

ai-agentstoolsopen-source

H2O LLM Studio: Fine-Tune LLMs Without Writing Code

2026-02-04 2 min read

A browser-based GUI for fine-tuning large language models. Upload data, pick a model, adjust settings with sliders, and train — no coding required.

llmtoolsopen-source

TEN Framework: Build Conversational Voice AI Agents

2026-02-04 2 min read

An open-source toolkit for real-time multimodal voice AI — handling speech recognition, turn-taking, interruption, and low-latency text-to-speech.

ai-agentstoolsopen-source

Browser-Use: AI Agents That Actually Use the Web

2024-12-23 2 min read

An open-source library that gives LLMs direct browser control — letting AI agents navigate websites, fill forms, and complete tasks that require human-like interaction.

ai-agentstoolsopen-source

Paper-QA: Superhuman RAG for Scientific Literature

2024-12-03 2 min read

A RAG system built specifically for scientific papers — with structure-aware retrieval, high-accuracy citations, and the ability to detect contradictions across your paper collection.

ragai-agentstoolsopen-source

Medical SAM 2: 3D Medical Image Segmentation with Meta's Foundation Model

2024-11-12 2 min read

Adapts Meta's SAM2 for medical imaging by treating 3D CT/MRI scans as videos — enabling automatic propagation of segmentations through entire volumes.

healthcare-aicomputer-visionopen-source