Reka Flash, open source 21B model comparable to QWQ 32B
Qwen/QwQ-32B · Hugging Face
Mac Studio 2025
NVIDIA's GeForce RTX 4090 With 96GB VRAM Reportedly Exists; The GPU May Enter Mass Production Soon, Targeting AI Workloads
Chain of Draft: Thinking Faster by Writing Less
Atom of Thoughts (AOT): lifts gpt-4o-mini to 80.6% F1 on HotpotQA, surpassing o3-mini and DeepSeek-R1
Crossing the uncanny valley of conversational voice
Apparently microsofts new phi-4-mini is business-tuned?
Framework just announced their new Desktop: an AI powerhorse?
Anyone found "optimal" settings for llama.cpp partial offload?
Psychoanalysis
Does DeepSeek* Solve the Small Scale Model Performance Puzzle?
AMD CPU, Apple M4 Pro Performance - Ryzen AI MAX Review
Trying out old GPUs with Vulkan
Models not loading into RAM
AMD denies rumors of Radeon RX 9070 XT with 32GB memory
Recommend models for GTX 1660 Super (6GB)
AMD reportedly working on gaming Radeon RX 9070 XT GPU with 32GB memory
The Anthropic Economic Index - an initiative aimed at understanding AI's effects on labor markets and the economy over time.
AI Action Summit in Paris