Auto-curated from YouTube and web sources
AI is rapidly embedding into existing workflows rather than launching as standalone platforms — Atlassian's MCP agents in Confluence, Tubi's native ChatGPT app, and Poke's text-message interface all reflect a pattern of meeting users where they already are. Simultaneously, governance and safety are becoming first-class concerns, from Safetensors' move to vendor-neutral stewardship to OpenAI's child safety blueprint and Anthropic's restricted Mythos access.
An AI tool developed by Alibaba that enables text-prompt-based editing of 3D scenes, allowing users to modify objects, change styles, add elements, and replace backgrounds within 3D environments.
Visit Site →An AI model that takes multiple photos of an object or scene and reconstructs them into detailed, realistic 3D models using autoregressive techniques with test-time training.
Visit Site →An AI startup founded by Fei-Fei Li that builds world models capable of generating and reasoning about immersive, editable 3D environments. Its first product lets users create downloadable 3D scenes from prompts.
Visit Site →Autodesk's generative AI model trained on geometric data that can reason about components and systems to generate functional 3D models with an understanding of real-world design constraints.
No URLAn AI agent that can take a 2D image and autonomously reconstruct it as an editable 3D scene in Blender, enabling interactive physics simulations and scene manipulation.
Visit Site →A 3D rendering technique that builds scenes from millions of tiny elliptical 3D bumps (Gaussians) to create highly realistic virtual humans with subsurface scattering for skin and realistic hair rendering.
Visit Site →A Google DeepMind model announced in August that can generate dynamic, playable worlds from text prompts or images, retaining consistency for a few minutes at 720p resolution.
Visit Site →An all-in-one AI assistant app that integrates multiple leading AI models including GPT, Claude, and Gemini into a single platform, offering text chat, image generation, video creation, document summarization, OCR, and real-time web search across devices.
Visit Site →A benchmarking and leaderboard platform used to rank and compare AI models based on their performance, where users can evaluate open-source and proprietary models head-to-head.
No URLAn open platform by Hugging Face that routes API requests to various providers hosting open-source AI models, enabling users to access thousands of models through a unified interface.
Visit Site →An AI platform that provides access to multiple AI APIs including image generation, audio generation, 3D generation, LLMs, and video generation models through a unified interface, also supporting AI automation workflows.
Visit Site →A public benchmark leaderboard hosted on Hugging Face that ranks automatic speech recognition models based on word error rate and other transcription quality metrics.
Visit Site →A crowdsourced AI model-picking platform that allowed users to test and compare results from over 800 AI models, collecting preference data to sell to AI labs. The service has shut down.
Visit Site →A Boston-based enterprise AI platform that simultaneously queries multiple large language models — including ChatGPT, Gemini, Claude, and Grok — and fuses their responses to produce more accurate, less hallucination-prone answers with enterprise-grade data privacy.
Visit Site →An independent benchmarking and analysis platform that evaluates and compares AI models across metrics including accuracy, hallucination rates, speed, and pricing.
Visit Site →An AI news aggregator website built largely using AI agents, demonstrating the capability of modern AI coding assistants to autonomously develop and deploy web applications.
Visit Site →An AI aggregator platform that provides access to 25+ leading AI models including GPT-4o, Claude, Gemini, DeepSeek, Llama, and Perplexity in a single dashboard, allowing users to compare outputs side by side. It includes prompt engineering tools, image and document chat, web search, saved chat history, and a Chrome extension.
Visit Site →An AI aggregation platform that provides access to multiple AI models including ChatGPT, Gemini, and Mistral in one central hub, enabling text generation, image creation, video production, and SEO keyword research without needing separate subscriptions.
Visit Site →Japan's most comprehensive LLM evaluation platform, assessing language models across approximately 40 benchmarks covering agent capabilities (code generation, math reasoning, tool use) and alignment (instruction following, bias, toxicity, truthfulness, robustness).
Visit Site →A Google-owned platform for data science and machine learning that hosts competitions, datasets, and model repositories, enabling developers to download and deploy AI models.
Visit Site →A directory and news website that curates and catalogs AI tools, providing a searchable database alongside AI-related news coverage.
Visit Site →A platform for hosting, sharing, and discovering AI models, datasets, and machine learning applications, serving as a major hub for the open-source AI community.
Visit Site →An API aggregator platform that provides unified access to a wide range of AI models from multiple providers through a single interface.
Visit Site →A platform that tracks monthly unique visitors and usage statistics for various AI tools and platforms.
Visit Site →An AI benchmark platform that works like a blind taste test of models, where users compare two anonymous responses and vote on which is better, with leaderboards determined by user preference.
Visit Site →A long-term memory system developed by IBM Research that helps AI agents learn from previous executions by converting interaction traces into reusable guidelines, enabling agents to improve over time rather than repeating mistakes.
Visit Site →A safe and efficient file format originally created by Hugging Face for storing and sharing ML model weights without the risk of arbitrary code execution, featuring zero-copy and lazy loading capabilities. It has now joined the PyTorch Foundation as a vendor-neutral community project.
Visit Site →A remote desktop solution by Astropad designed specifically for monitoring and interacting with AI agents running on Apple devices, featuring high-fidelity streaming, voice dictation, and iPhone/iPad clients.
Visit Site →A visual AI tool by Atlassian, available in open beta within Confluence, that transforms data and information into visual assets such as charts and graphics, automatically recommending the best visual format for the content.
Visit Site →A personal AI agent by The Interaction Company of California that operates via iMessage, SMS, Telegram, and WhatsApp, allowing users to automate everyday tasks like calendar management, health tracking, smart home control, and daily planning through text messaging. It dynamically selects the best AI model for each task.
Visit Site →A native app integration built within OpenAI's ChatGPT platform by the streaming service Tubi, enabling users to discover movies and TV shows through natural-language prompts and receive curated recommendations linked to Tubi's library.
Visit Site →An Anthropic-led initiative focused on securing critical software infrastructure for the AI era, partnering with major tech companies including Amazon, Apple, Google, Microsoft, and NVIDIA to address AI-discovered cybersecurity vulnerabilities.
No URLA Gradio feature that extends FastAPI, allowing developers to pair any custom frontend (React, Svelte, plain HTML/JS) with Gradio's backend infrastructure including queuing, concurrency management, SSE streaming, and ZeroGPU support.
No URLA Hugging Face Spaces feature that provides automatic GPU allocation for machine learning applications, allowing developers to run GPU-intensive models without managing infrastructure directly.
Visit Site →A JavaScript client library for Gradio that enables frontend applications to communicate with Gradio backends through the queue system, available via CDN at @gradio/client.
Visit Site →A Spanish startup developing a satellite constellation to collect precise Earth observation data optimized for deep learning models, aiming to become an enterprise ground-truth data source for AI applications.
Visit Site →An AI-based enterprise management software platform that helps organizations automate tasks by first discovering which processes should be automated, founded by former OpenAI product manager Angela Jiang.
Visit Site →A web hosting platform offering managed WordPress hosting with AI-assisted website building tools, enabling users to create and manage up to 50 websites with no coding experience required.
Visit Site →A Japanese robotics company that builds software control platforms enabling industrial robots to autonomously handle picking and logistics tasks in warehouses and factories.
Visit Site →An AI-powered browser-based PDF tool by YaseenAI, Inc. that lets users edit, convert, merge, split, sign, and redact PDF files locally without downloading software, keeping files private and secure.
Visit Site →A Google application available on Android and iOS that allows users to discover, download, and run AI models locally on their devices for free, with no cloud processing required, supporting features like AI chat, image analysis, and audio transcription.
Visit Site →A stealth biotech AI startup that used artificial intelligence to make drug discovery and biological research more efficient. It was acquired by Anthropic for $400 million in a stock deal.
Visit Site →An AI-powered content moderation platform that turns policy documents into executable logic, providing real-time safety enforcement for user-generated and AI-generated content in under 300 milliseconds. It serves platforms like dating apps, AI companion companies, and AI image generators.
Visit Site →A benchmark environment where AI agents complete realistic multi-step tasks via APIs, used to evaluate agent performance on scenarios requiring complex control flow across multiple applications.
Visit Site →An open-source observability and analytics platform for LLM applications that captures agent trajectories including user utterances, tool calls, and results using OpenTelemetry-based tracing.
Visit Site →An open-source AI observability platform by Arize AI that provides a UI for tracing, evaluating, and debugging LLM applications and AI agents.
Visit Site →Activation-aware Weight Quantization, a quantization technique for large language models that preserves important weights based on activation patterns to maintain model quality at reduced precision.
Visit Site →A coalition created by Anthropic consisting of major technology companies that are given early access to test the Claude Mythos model and patch security vulnerabilities ahead of any broader release.
No URLAn open-source library by Hugging Face that provides APIs and tools for downloading and training state-of-the-art pretrained models across NLP, computer vision, and audio tasks.
No URLAn array framework for machine learning on Apple silicon, developed by Apple's machine learning research team, enabling efficient model inference and training on Mac devices.
Visit Site →A fine-tuning platform and library that enables faster and more memory-efficient training of large language models, supporting popular open-source models.
No URLAn AI startup focused on AI interpretability, building tools to understand and control the internal workings of large language models.
Visit Site →Google's premium subscription tier at approximately $200-250/month that provides 25,000 monthly AI credits for heavy use of Google's AI services including Google Flow.
Visit Site →An AI platform co-founded by former CNN host and Meta news executive focused on making AI platforms more trustworthy by vetting, verifying, and sustaining the veracity of information provided by large language models.
Visit Site →A startup building a deep learning model specifically trained on chip design data to assist semiconductor engineers in designing new computer chips, aiming to reduce chip development costs by over 75% and cut timelines by more than half.
Visit Site →An AI startup that offers McKinsey-style consulting reports generated through AI at a fraction of the cost of traditional consulting firms.
No URLAn advanced AI agent platform that can autonomously browse the web, watch videos, schedule recurring tasks, and build dashboards by spinning up its own virtual computer environment to complete complex multi-step tasks like a human would.
No URLA clean-room open-source reimplementation of Anthropic's Claude Code, rewritten in Python and Rust by developer Sigrid Jinn without using the original source code. It became the fastest GitHub repository in history to reach 50,000 stars, achieving the milestone in just two hours.
No URLA diagnostic benchmark designed to isolate and evaluate specific perception capabilities including attributes, OCR-guided disambiguation, spatial constraints, relations, and performance in dense long-context crowded scenes.
No URLA million-scale multimodal dataset by IBM purpose-built for chart interpretation and reasoning, containing 1.7 million diverse chart samples spanning 24 chart types and 6 plotting libraries with aligned plotting code, rendered images, data tables, summaries, and QA pairs.
No URLAn open-source database designed for AI applications, building infrastructure for multimodal data including video, audio, images, and text together.
Visit Site →A startup platform that uses vision language models to turn autonomous vehicle and robot fleet video footage into structured, searchable datasets for training and compliance purposes.
Visit Site →Salesforce's AI-powered agent built into Slack that can draft emails, schedule meetings, transcribe and summarize meetings, create reusable AI skills for custom tasks, and connect to external services via Model Context Protocol (MCP).
Visit Site →An AI-powered meeting assistant by YaseenAI that provides real-time transcription, automated summaries, action item tracking, and integrations with platforms like Zoom, Google Meet, Microsoft Teams, Slack, Notion, Jira, and Asana.
No URLAn AI-powered platform for code review, testing, and governance that uses multi-agent systems to verify AI-generated code quality. It analyzes how code changes affect entire systems by factoring in organizational standards, historical context, and risk tolerance.
Visit Site →An autonomous cloud infrastructure management platform that automatically manages and reallocates computing resources in real time for Kubernetes-based environments, reducing cloud and AI infrastructure costs by up to 80%.
Visit Site →A biotech platform that integrates disparate data sources and uses LLM-based systems combined with physics engines to create synthetic datasets and physics-based 'digital twins' of the human body for biomedical research, predictive modeling, and sports performance analysis.
Visit Site →An AI-powered app built by Bluesky's team that allows users to design custom algorithms, create personalized feeds, and eventually vibe-code social apps using natural language commands. It is built on the AT Protocol and leverages Anthropic's Claude as its underlying AI engine.
Visit Site →Sony's AI-powered audio processing technology that upscales compressed music files to restore detail and achieve near-high-resolution sound quality.
Visit Site →A large language model testing platform by Microsoft that allows users to experiment with and evaluate Microsoft's MAI foundational models.
Visit Site →A productivity and workspace platform that has integrated AI capabilities and recently announced support for agent skills, allowing AI agents to execute tasks using structured instruction files.
No URLA startup developing AI-powered tools for semiconductor chip design, competing in the space of AI-assisted electronic design automation (EDA).
Visit Site →A well-funded startup focused on applying AI to semiconductor chip design, having raised a $300 million Series A round.
No URLA vibe coding app that allowed users to generate functional apps using natural language AI prompts, requiring no formal coding experience. It was pulled from Apple's App Store for violating guidelines around downloading and executing code.
Visit Site →A cloud computing platform by Lambda that provides powerful NVIDIA GPUs for running AI models, chatbots, and machine learning experiments.
No URLAn Anthropic feature that allows Claude to control a user's computer by pointing, clicking, and completing tasks using the mouse and keyboard, enabling hands-free computer operation.
No URLAn AI-powered video surveillance search platform that uses vision-language models to let security personnel query camera feeds using natural language, find objects, people, or situations in footage in real time, and automatically detect threats based on preset rules.
Visit Site →An OCR benchmark used to evaluate optical character recognition performance of AI models on document understanding tasks.
Visit Site →A benchmark for evaluating document understanding and OCR capabilities of AI models across diverse document types.
Visit Site →An open-source project for instruction-tuning language models that builds on top of TRL's trainers and APIs as downstream infrastructure.
Visit Site →An AI-powered bird identification app that leverages Ring camera feeds to identify bird species for users.
Visit Site →An AI-powered video analytics platform for businesses that provides alerts and people counting capabilities using camera feeds.
Visit Site →A SoftBank-backed AI company offering a Routines app focused on elder care that leverages cameras to monitor aging family members and alert to concerns like falls or changes in routines.
Visit Site →An AI-powered platform that helps businesses understand wait times and congestion at locations where people queue, such as events, restaurants, and service desks.
Visit Site →A monitoring platform for Airbnb hosts that uses camera-less sensors and AI to track excessive noise, temperature, and other accommodation conditions.
Visit Site →An AI-powered app that uses camera feeds to monitor and assess lawn health for homeowners.
No URLA data labeling and annotation platform focused on autonomous vehicle and robotics data, developing AI-powered tools for auto-annotation workflows.
Visit Site →An AI-powered data labeling and annotation platform that provides tools for managing, curating, and labeling training data for computer vision and machine learning models.
Visit Site →A data infrastructure platform founded by ex-SpaceX engineers that manages telemetry and sensor data from complex machines like spacecraft and vehicles, organizing it for AI-driven analysis and decision-making in manufacturing and testing environments.
Visit Site →A startup that provides post-training data services for AI models, including data generation, evaluation, and reinforcement learning, leveraging a large India-based workforce of domain experts and contributors.
Visit Site →A benchmark platform by Martian that evaluates and ranks AI code review tools based on their ability to catch logic bugs, cross-file issues, and other code quality problems.
Visit Site →A GPU orchestration platform that helps organizations manage and optimize GPU resources for AI workloads, acquired by Nvidia.
Visit Site →A security and compliance automation platform that helps companies achieve and maintain certifications like SOC 2 and ISO 27001 through continuous monitoring and automated evidence collection.
Visit Site →A privacy-focused Mac app for AI-powered meeting notes that runs entirely locally on the device, capturing audio, transcribing in real time, and generating summaries without sending data to the cloud, available as a one-time purchase.
Visit Site →An open-source Swift audio library that works with Apple's Core Audio Taps API, enabling developers to tap into a Mac's audio streams for recording and processing system audio.
Visit Site →An AI-native inventory management platform that integrates with existing accounting systems and ERPs to keep physical goods data synced with accounting ledgers, primarily serving mid-market consumer brands.
Visit Site →A cloud platform for running, training, and fine-tuning open-source AI models, offering developers scalable infrastructure for building AI applications.
Visit Site →An AI-powered legal technology platform that uses large language models to assist lawyers with research, drafting, and other legal workflows.
Visit Site →A collaborative design software platform used for UI/UX design, prototyping, and design systems, which has integrated AI-powered features for design workflows.
Visit Site →A Spotify beta feature that allows artists to review and approve or decline releases before they appear on their profiles, designed to combat AI-generated songs and metadata errors that incorrectly attribute music to real artists.
Visit Site →A feature by Anthropic that allows users to assign tasks to Claude's desktop application remotely from their phone, enabling the AI to work on a Mac while the user is away.
No URLA browser-based AI assistant that integrates directly into Google Chrome, enabling users to write, research, summarize, and respond to content without switching tabs. It supports multiple AI models, custom copilots for specific workflows, email integration, PDF/image analysis, and a screenshot-based 'AI Vision' feature.
Visit Site →A novel AI memory compression algorithm developed by Google Research that uses vector quantization to reduce the KV cache (working memory) of AI models by at least 6x during inference without significant quality loss.
Visit Site →An end-to-end evaluation framework developed by ServiceNow for conversational voice agents that evaluates complete, multi-turn spoken conversations using a bot-to-bot architecture. It produces two high-level scores — EVA-A (Accuracy) and EVA-X (Experience) — to jointly assess task success and conversational quality.
Visit Site →An open-source Python framework for building real-time voice and multimodal AI applications, supporting both cascade architectures (STT → LLM → TTS) and audio-native models.
Visit Site →A multi-silicon inference cloud platform that orchestrates AI workloads across diverse hardware types (CPUs, GPUs, high-memory systems) simultaneously, claiming to speed up AI inference by 3x to 10x. It can split and run AI models across different chip architectures, optimizing for compute-bound, memory-bound, and network-bound tasks.
Visit Site →A cloud-based inference service from Gimlet Labs that provides API access to its multi-silicon orchestration technology, enabling AI model labs and data centers to run workloads efficiently across heterogeneous hardware.
No URLAn AI-powered personal context and recall tool that continuously reads your computer screen, stores the information as text, and lets you query your digital activity. It includes features like meeting notetaking, automated routines, and personalized prompts based on accumulated context.
Visit Site →An AI-powered code completion tool built into Microsoft Visual Studio that analyzes coding patterns to suggest entire lines or blocks of code, helping developers reduce repetitive work and catch bugs through real-time suggestions.
Visit Site →A custom AI chip developed by AWS designed for training and inference of machine learning models, offering a lower-cost alternative to Nvidia GPUs. It is used by major AI companies including Anthropic, OpenAI, and Apple for large-scale AI workloads.
No URLA custom chip designed by AWS specifically optimized for machine learning inference workloads, offering high performance at lower cost for deploying trained models in production.
Visit Site →An SDK and set of networking switches developed by AWS to optimize and run machine learning workloads on AWS custom chips like Trainium and Inferentia, enabling efficient chip-to-chip communication in mesh configurations.
Visit Site →An AI benchmark designed to measure scientific knowledge, with a harder variant called GPQA Diamond featuring more difficult questions for advanced model evaluation.
No URLByteDance's AI-powered marketing platform that helps creators and businesses generate marketing content, now integrating Dreamina Seedance 2.0 for video generation capabilities.
Visit Site →OpenAI's web browsing tool that is being integrated into the company's unified super app alongside ChatGPT and Codex.
No URLAn NVIDIA open-source toolkit for synthetic data generation, used to create high-quality training datasets from domain documents without manual labeling.
Visit Site →An open-source Python library developed by IBM Research for writing structured generative programs, replacing probabilistic prompt behavior with maintainable AI workflows using constrained decoding, structured repair loops, and composable pipelines.
Visit Site →A collection of specialized LoRA model adapters by IBM designed for well-defined operations such as query rewriting, hallucination detection, and policy compliance checking, built on top of IBM Granite models.
Visit Site →A Granite Library by IBM containing specialized LoRA adapter models for safety, factuality, and policy compliance tasks in AI workflows.
No URLA Granite Library by IBM targeting requirements validation in Mellea's instruct-validate-repair loop for structured generative workflows.
No URLA Granite Library by IBM targeting a variety of tasks in agentic RAG pipelines, covering pre-retrieval, post-retrieval, and post-generation stages.
No URLAn AI-powered feature on Google Pixel phones that monitors phone calls in real-time and alerts users when conversations exhibit patterns consistent with known scam tactics.
Visit Site →A security service built into Google Chrome that scans websites against a database of known threats to protect users from phishing, malware, and other malicious web pages.
Visit Site →Google's email service that uses AI-powered spam and scam detection to flag malicious emails with high-visibility warning banners, alerting users to suspicious messages, links, and senders.
Visit Site →An AI research tool by OpenAI designed to conduct in-depth research and analysis tasks, serving as a benchmark for agentic AI capabilities.
Visit Site →A platform that connects companies with software engineers and AI training contractors from emerging markets, offering services including AI data labeling and development talent sourcing.
Visit Site →Google's web-based tool for developers to prototype and experiment with generative AI models, including the Gemini family and Lyria music generation models.
No URLAn open-source AI proxy tool that provides a unified interface to call multiple LLM APIs using a consistent format, simplifying integration with various language model providers.
Visit Site →An AI-powered web development tool by StackBlitz that enables users to prompt, run, edit, and deploy full-stack web applications directly in the browser.
No URLAn AI agent tool that can perform tasks and take actions on behalf of users, integrating with various services and workflows.
Visit Site →Amazon's streaming media player that plugs into a TV's HDMI port, providing access to streaming services, cloud gaming, and Alexa voice control functionality.
No URLA mobile-first vibe coding platform co-founded by Riley Brown that lets users describe an app idea in plain English and have AI generate the complete application, including frontend, backend, payments, and deployment to app stores.
Visit Site →Meta's video creation and editing app designed to compete with other short-form video editing tools in the creator economy.
Visit Site →An AI-powered ERP startup focused on core accounting functions such as accounts receivable and accounts payable, designed as a modern alternative to legacy ERP systems like NetSuite.
Visit Site →An AI-native ERP startup that provides modern accounting and finance software as an alternative to traditional enterprise resource planning systems.
Visit Site →A legacy enterprise resource planning (ERP) platform by Oracle that connects departments like finance, HR, and inventory into a single system, recently updated with AI capabilities.
Visit Site →Intuit's widely used accounting software for small and mid-sized businesses, offering financial management tools including invoicing, payroll, and expense tracking.
Visit Site →An open standard for e-commerce developed by OpenAI in partnership with Stripe, designed to enable AI-powered product discovery and shopping experiences using merchant-provided data.
No URLA startup that built an interactive notebook tool designed for collaborative work between humans and AI agents, similar in concept to Jupyter notebooks. It was acquired by Databricks to support its Lakewatch security product.
Visit Site →A security startup that developed a 'data control plane' tool enabling enterprises to deploy AI agents securely while protecting sensitive data. It was acquired by Databricks to underpin its Lakewatch security product.
Visit Site →OpenAI's AI-powered search engine designed to provide direct answers to queries using AI, combining web search with language model capabilities.
Visit Site →An agentic AI operating system for enterprise customers that replaces traditional business software interfaces with natural language prompts. It post-trains open source models on customer datasets and deploys them within the customer's own cloud environment, enabling tasks like deal analysis, dashboard creation, and workflow automation.
Visit Site →A real-time personalization and ranking infrastructure platform that brings TikTok-style recommendation technology to consumer businesses. It uses proprietary large event models to personalize user experiences in real time with sub-20-millisecond decision-making, without relying on cookies or user identity.
Visit Site →Sequen's API-based platform that allows businesses to access frontier ranking models and real-time ranking models for personalization, replacing their existing relevance stack APIs.
No URLA self-serve API portal by Multiverse Computing that gives developers and enterprises direct access to compressed AI models with real-time usage monitoring and production-ready deployment capabilities.
Visit Site →Nvidia's high-bandwidth interconnect technology that enables fast communication between GPUs on a data center rack, critical for AI model training and inference workloads.
Visit Site →Nvidia's in-network computing platform providing high-performance networking switches designed for AI data center interconnection and large-scale model training.
No URLNvidia's Ethernet networking platform purpose-built for AI workloads, providing high-performance networking for AI data centers with optimized throughput and efficiency.
Visit Site →Nvidia's advanced Ethernet switches incorporating co-packaged photonics technology for more efficient high-bandwidth AI data center networking.
No URLA new Nvidia platform designed to optimize memory and storage for AI inference workloads, enabling more efficient handling of context data during model inference.
No URLAn all-in-one AI-powered podcasting platform designed for first-time and early-stage creators, offering recording, editing, transcription, AI-generated cover art, voice cloning for ad reads, dubbing, translation, and monetization tools in a single platform.
Visit Site →A memory training app that uses gamified microlessons, virtual Mind Palace technology, mnemonics, and spaced repetition to help users improve memory recall by an average of 70 percent.
Visit Site →Yahoo's AI-powered search engine currently in beta that provides AI-generated answers with clear attribution and referral links back to content publishers, designed to support the open web and publisher ecosystem.
Visit Site →A personalized AI homepage by Yahoo that aggregates content from Yahoo Mail, News, Finance, and Sports to create a custom daily briefing for users.
Visit Site →An open format developed by Anthropic for giving AI agents new capabilities and expertise through organized folders of instructions, scripts, and resources that agents can dynamically discover and load to perform better at specific tasks.
No URLA robotics AI system (Learns Athletic Humanoid Tennis Skills from Imperfect Human Motion Data) developed by researchers from Tsinghua University, Peking University, Galbot, and Shanghai AI Laboratory that teaches humanoid robots athletic skills like tennis using imperfect amateur motion capture data.
No URLAnthropic's agentic AI coding tool integrated into Apple's Xcode development environment, enabling developers to use Claude for autonomous coding tasks.
No URLA screen-capture-based personal AI tool that records screenshots of everything on your computer screen to help you search and recall past activity. It later rebranded to Limitless.
Visit Site →An AI-powered personal memory and productivity tool (formerly known as Rewind) that captures context from your digital life to help you recall and query past activities.
No URLA Windows feature by Microsoft that captures periodic screenshots of your computer screen and uses AI to make everything you've seen searchable and retrievable.
No URLAn AI-powered financial research and analytics platform designed for institutional investors, later acquired by market intelligence firm AlphaSense.
Visit Site →A market intelligence and search platform that uses AI and natural language processing to help professionals discover insights from financial documents, research, and business content.
Visit Site →An internal Apple chatbot app described as a text-based testing ground for the re-architected Siri, with no plans for public release as a standalone product.
No URLAn AI-powered platform for creating presentations, websites, and marketing assets using text prompts and templates. It approaches 100 million users and competes with tools like Canva and Adobe.
Visit Site →A software development tool by Tools for Humanity (World) that enables commercial websites to verify that a real human is behind an AI agent's purchasing decisions, using World ID verification integrated with the x402 payment protocol.
Visit Site →A Tel Aviv-based startup that uses sensors and AI to precisely measure and manage GPU power consumption in data centers, helping operators optimize energy usage and unlock more computing capacity.
Visit Site →An enterprise platform by Mistral AI that enables companies and governments to build custom AI models trained from scratch on their own proprietary data, rather than merely fine-tuning or augmenting existing models.
No URLAn open-source collection of reusable Claude Code skill prompts created by Y Combinator CEO Garry Tan, providing opinionated AI agent workflows for tasks like engineering, code review, design, and documentation.
Visit Site →A Google feature that allows its Gemini AI assistant to personalize responses by connecting across a user's Google ecosystem, including Gmail and Google Photos, to provide contextually relevant answers.
No URLA Google Search feature that integrates AI-powered responses directly into search results, allowing users to get AI-generated answers alongside traditional search results.
Visit Site →An AI-powered audio enhancement feature built into JBL speakers, such as the JBL Flip 7, that intelligently optimizes sound output to make music sound bigger and clearer.
No URLA virtual reality game that uses AI-generated, fully improvised conversations to create dynamic NPC interactions, allowing players to talk in real time to every character they meet without scripted dialogue trees.
Visit Site →An open-source runtime by Nvidia that enables AI agents to operate and adapt faster and more safely by enforcing policy-based privacy and security guardrails.
No URLAn AI-powered application development platform where users describe the app they want to build in plain English and receive a fully functional, deployable application in minutes.
Visit Site →A facial recognition platform that has built massive facial-scan databases by scraping photos from social media and other internet sources, then training machine learning algorithms on them. It is primarily used by law enforcement agencies to match faces against its database for identification purposes.
Visit Site →An AI agent builder developed by OpenAI, offered exclusively through AWS, designed to enable the creation and deployment of AI agents.
No URLAn AI-native loan origination system (LOS) designed to modernize lending for credit unions by leveraging LLMs to automate underwriting, process higher loan volumes, and reduce operational costs.
Visit Site →An AI-powered design and photo editing platform with over 130 million users that offers tools for image editing, content creation, and a new AI agent marketplace allowing creators to 'hire' AI assistants for tasks like resizing, remixing, and product photo editing.
Visit Site →NVIDIA's enterprise wrapper around OpenClaw that provides a simplified one-line installation process along with an additional security layer to make OpenClaw safe and suitable for enterprise use cases, addressing concerns about data leakage and API key exposure.
No URLA startup building visual memory infrastructure for AI wearables and robotics, enabling devices to store, index, and recall video-based memories for physical-world AI applications.
Visit Site →An Nvidia application framework for video analytics, search, and summarization, enabling AI-powered analysis of video data for various enterprise and robotics use cases.
Visit Site →A Texas-based AI-powered visual intelligence platform that integrates with existing surveillance cameras to detect firearms and identify potential mass shooters in under 10 seconds, alerting human reviewers and law enforcement.
Visit Site →An AI research automation tool launched by Andrej Karpathy that allows users to give an AI agent a task and have it run hundreds of thousands of experiments autonomously, keeping successful results and discarding failures.
Visit Site →NVIDIA's compact personal AI computing device capable of running large language models locally on-device, designed for developers and enthusiasts who want to run AI workloads without relying on cloud services.
Visit Site →Microsoft's productivity suite including Word, Excel, PowerPoint, Outlook, and OneNote, featuring AI-powered suggestions for writing, formatting, and data insights alongside traditional office tools.
No URLA password management tool by Bitdefender that securely stores and generates strong passwords composed of letters, numbers, and symbols to enhance online security.
Visit Site →A comprehensive antivirus suite by Bitdefender that protects devices against trojans, ransomware, and spyware, consistently ranked among top security solutions.
Visit Site →Bitdefender's premium security suite that includes unlimited VPN, anti-tracker browser extension, and comprehensive email and SMS protection for advanced online security.
Visit Site →An open-source, zero-knowledge cloud storage platform with post-quantum encryption that allows users to securely store, share, and sync files across all major platforms and devices.
Visit Site →Google's AI-powered search feature that generates AI-synthesized answers and summaries directly within Google Search results, drawing from indexed web content.
No URLAn enterprise AI agent platform introduced by NVIDIA that enables companies to deploy AI agents to carry out tasks on behalf of employees. It includes built-in security and privacy tools and is hardware-agnostic.
No URLAn open-source inference serving platform by NVIDIA for deploying AI models in production environments with support for ONNX and TensorRT optimized models.
Visit Site →Google Maps with newly added AI-powered features that enhance mapping and navigation capabilities using artificial intelligence.
No URLA benchmark for evaluating speculative decoding methods across diverse application scenarios such as multi-turn conversation, translation, and mathematical reasoning by aggregating instances from widely used datasets.
Visit Site →An AI startup co-founded by Jeff Bezos and former Google executive Vik Bajaj, focused on creating high-level AI models to improve manufacturing and engineering in aerospace, automotive, and other industrial sectors.
Visit Site →An AI-powered support assistant by Meta that provides users with 24/7 customer support within the Facebook and Instagram apps on iOS, Android, and desktop.
Visit Site →A standalone app and in-app feature by DoorDash that pays delivery couriers to complete data collection assignments—such as filming everyday tasks or recording speech—to help train AI and robotic systems to understand the physical world.
Visit Site →An AI-powered health data platform that aggregates medical records from various healthcare providers, enabling users to access and manage their health information in one place.
Visit Site →Vercel's AI-powered generative UI tool that allows users to describe interfaces and applications in natural language and receive working code and deployable front-end components.
Visit Site →An agentic retrieval pipeline developed by NVIDIA that goes beyond semantic similarity to dynamically adapt search and reasoning strategies for enterprise-scale document retrieval. It uses a ReACT architecture with an iterative loop between LLMs and retrievers to achieve state-of-the-art performance across diverse benchmarks.
Visit Site →A tiny, open-source, secure alternative to OpenClaw for building AI agents, built in approximately 500 lines of code using container technology to create isolated environments that prevent unauthorized data access.
Visit Site →An AI-powered feature on NBCUniversal's Peacock streaming platform that creates personalized 10-minute video summaries of daily events, first used during the 2024 Summer Olympics with an AI voice modeled after sports announcer Al Michaels.
Visit Site →A startup building an intelligence layer for AI agents that helps them understand humans across their entire digital footprint by analyzing public data across social networks and apps using machine learning techniques.
Visit Site →A free initiative by ElevenLabs that provides AI voice restoration to up to 1 million people with permanent voice loss due to conditions like ALS and cancer, working with accessibility nonprofits and disability foundations.
Visit Site →A corporate finance platform that has introduced virtual payment cards designed specifically for AI agents, allowing agents to make purchases via API, MCP, and CLI with programmable spending limits and real-time transaction visibility.
Visit Site →An AI-powered collaboration platform featuring an infinite whiteboard where AI generates different blocks for tasks like trip planning, with built-in browser, PDF, and image support for contextual AI assistance. Founded by former Google Maps engineers and backed by Sequoia Capital, the service shut down in 2026 after the team joined Microsoft.
Visit Site →An AI-powered audio and video editing platform that offers transcription-based editing, screen recording, and podcast production tools, popular among content creators and podcasters.
Visit Site →Spotify's all-in-one podcasting platform (formerly Spotify for Podcasters) that offers unlimited hosting, video podcast uploads, audience analytics, and monetization tools for podcast creators.
Visit Site →Adobe's professional audio editing and production software used for recording, mixing, and mastering audio content including podcasts and music.
Visit Site →A remote recording and podcast production platform that captures high-quality audio and video locally, offering transcription and editing tools for podcasters and content creators.
Visit Site →Google's cybersecurity threat intelligence division that researches, identifies, and reports on advanced cyber threats and vulnerabilities targeting users worldwide.
Visit Site →Microsoft's digital note-taking application included in the Office Professional suite, allowing users to organize notes, clip web content, and collaborate across devices.
Visit Site →Microsoft's collaboration and communication platform that integrates chat, video meetings, file sharing, and productivity tools for teams and organizations.
Visit Site →An autonomous data analysis agent developed by NVIDIA's Kaggle Grandmasters LLM Agent Research Team, designed for dataset exploration, multi-step reasoning, tool calling, and iterative data analysis on tabular data. It achieved state-of-the-art performance on the DABStep benchmark with capabilities including exploratory data analysis, tabular data Q&A, predictive modeling, and forecasting.
Visit Site →An Israeli AI agent startup that provides a customer service AI agent platform tailored for non-English-speaking markets across telecom, finance, healthcare, and manufacturing, with fine-tuning for language, cultural norms, and regulatory environments.
Visit Site →A Gemini-powered conversational feature within Google Maps that lets users ask complex, natural language questions about places, routes, and trip planning, returning personalized answers with directions, ETAs, and community tips.
Visit Site →An orchestration layer developed by QuTwo that enables enterprises to shift AI workloads between classical, quantum-inspired, and quantum computing environments, supporting hybrid computing with flexible algorithm and chip routing.
No URLAn AI-powered sales automation platform that deploys autonomous AI agents to monitor accounts, research prospects, and update CRM software, functioning as an intelligent revenue operating system that integrates with tools like Salesforce and Zendesk.
Visit Site →An AI dating assistant by Bumble that acts as a personal matchmaker, learning users' values, relationship goals, communication style, and dating intentions through private chats to recommend more relevant matches.
Visit Site →A no-code AI agent builder platform that enables non-technical employees to create and deploy autonomous AI agents for automating complex, multistep business tasks. It is model-agnostic, supporting multiple AI providers including OpenAI, Gemini, and Anthropic.
Visit Site →An AI-powered conversational feature within Google Maps that uses Google's Gemini models to let users ask natural language questions about destinations, get personalized location recommendations, and plan itineraries based on data from over 300 million places.
Visit Site →A 3D navigation redesign for Google Maps that provides an immersive driving experience with detailed road features like lanes, crosswalks, traffic lights, and realistic 3D renderings of surrounding buildings and terrain.
Visit Site →A 4K AI-powered streaming camera by Obsbot that features gimbal-based 360-degree facial tracking, allowing it to automatically follow a subject's face during livestreams, and can be operated via remote control.
Visit Site →A touchscreen video switching monitor by Obsbot that enables multi-camera livestreaming by combining up to seven camera streams at once, allowing users to cut between shots and angles for platforms like Twitch and YouTube.
Visit Site →A visual automation platform (formerly Integromat) that allows users to connect apps and automate workflows without coding, supporting integrations with AI tools and services.
Visit Site →A product from Superhuman, the AI-powered email productivity platform, designed to integrate with workflow and automation tools for enhanced productivity.
Visit Site →An emotion-detection AI software company that analyzes facial expressions and vocal cues to understand human emotions. It was founded by Rana el Kaliouby and sold in 2021.
Visit Site →A digital identity verification system by Tools for Humanity that uses iris scans from the Orb device to create unique, encrypted digital IDs, enabling proof-of-personhood in an AI-driven internet.
Visit Site →A blockchain-based open payment protocol developed by Coinbase and Cloudflare that enables automated computer programs and AI agents to transact with each other directly online without human intervention at each step.
Visit Site →An industry-standard benchmarking tool by Primate Labs used to measure computing performance, producing single-core and multi-core scores for comparing devices like laptops and smartphones.
Visit Site →Nvidia's Deep Learning Super Sampling technology version 5, an AI-powered rendering tool that uses neural networks to upscale lower-resolution images in real time for improved gaming performance and visual quality.
No URLAn early open-source experimental AI agent that attempted to chain together GPT-4 calls autonomously to accomplish complex tasks without continuous human intervention.
Visit Site →An early open-source prototype AI agent framework that demonstrated autonomous task management by using language models to create, prioritize, and execute tasks.
Visit Site →A lightweight alternative to OpenClaw that aims to reduce the complexity of AI agent systems down to a specific useful feature set.
Visit Site →A minimalist variant of the OpenClaw agent framework designed to strip down agent complexity to essential features.
No URLA simplified alternative to OpenClaw that reduces the overall complexity of AI agent systems to a focused feature set.
No URLA security-focused alternative to OpenClaw that emphasizes self-hosting capabilities for AI agent deployments.
Visit Site →A self-hosted AI agent platform designed to bring enhanced security to OpenClaw-style agent systems.
Visit Site →A security-oriented AI agent framework that provides self-hosting options as an alternative to OpenClaw.
Visit Site →A security-focused variant of OpenClaw-style AI agent systems that emphasizes self-hosting and secure deployment.
No URLCustom AI agents built into the Notion workspace platform, leveraging the context of a company's Notion data to perform tasks and answer questions within the productivity tool.
Visit Site →Brave's independent search engine that provides web search capabilities and is available as a search provider integration for AI tools and agents.
Visit Site →An open blueprint by NVIDIA for building AI agents that reason over enterprise and web data to deliver well-cited responses. It features a modular multi-agent architecture with planner, researcher, and orchestrator components for deep research workflows.
Visit Site →An open-source toolkit by NVIDIA for building AI agent workflows, providing config-driven composition of LLMs and tools, function registration, evaluation, and the ability to plug in different agent graphs.
No URLA component of the LangChain framework designed for building multi-agent architectures with subagent middleware, supporting complex multi-phase planner-researcher-orchestrator workflows.
No URLA browser-based personal WordPress workspace that lets users set up and use WordPress entirely in the web browser without hosting, domain registration, or sign-up. Sites are private by default and include an App Catalog with tools like a Personal CRM, RSS Reader, bookmarking tool, and AI Workspace.
Visit Site →An open source project that enables one-click WordPress installation on any device, powering browser-based WordPress instances and integrating with AI tools to create and modify plugins and tools.
Visit Site →A geo-tagged time series dataset created by Google Research that was built by using Gemini to analyze 5 million news articles to identify and catalog 2.6 million flood events worldwide, serving as training data for flash flood prediction models.
Visit Site →Google's platform for sharing flood forecasting data and highlighting flood risks for urban areas in 150 countries, providing information to emergency response agencies worldwide.
Visit Site →An AI assistant by Ford for its Pro commercial fleet customers that monitors and analyzes vehicle data points including fuel consumption, seatbelt use, vehicle health, idle times, and driver behavior to help fleet owners optimize operations.
Visit Site →An agentic AI platform that automates customer service interactions, supporting over a billion monthly customer interactions for companies like Upwork, Grammarly, and Airtable. It was acquired by Zendesk in 2026.
Visit Site →A code review tool by Cognition that provides a reimagined interface for understanding complex pull requests, designed to build developer comprehension and help identify low-quality code.
Visit Site →Nvidia's AI agent software suite that provides tools for building, customizing, and deploying AI models and agents, integrating with the NemoClaw platform.
Visit Site →A startup developing an AI-powered loan origination system that competes with legacy lending software providers by incorporating AI into the lending workflow.
Visit Site →A startup building an AI-infused loan origination system aimed at modernizing the lending process for financial institutions.
Visit Site →The U.S. Department of Defense's secure enterprise platform for generative AI, providing military personnel access to large language models and AI tools within government-approved cloud environments for tasks like research, document drafting, and data analysis.
Visit Site →A wearable hardware device created by Memories.ai for recording video data used to train their large visual memory model, designed specifically for efficient data collection rather than consumer use.
Visit Site →NVIDIA's next-generation AI upscaling software for gaming that uses deep learning to enhance visual fidelity by infusing pixels with photorealistic lighting and materials, bridging the gap between rendering and reality.
Visit Site →A mutable, S3-like object storage service on the Hugging Face Hub designed for ML workflows, enabling fast syncing of checkpoints, optimizer states, processed shards, and other intermediate training artifacts without version control overhead.
Visit Site →Hugging Face's chunk-based storage backend that deduplicates content across files, reducing bandwidth, speeding up transfers, and lowering storage costs for ML artifacts that share overlapping data.
No URLA standardized benchmark by NVIDIA for evaluating speculative decoding performance, featuring qualitative and throughput splits to assess draft model performance across prompt complexities and context lengths.
Visit Site →Amazon's healthcare AI assistant available on Amazon.com and the Amazon app that can answer health questions, explain health records, manage prescription renewals, book appointments, and connect users with One Medical providers in a HIPAA-compliant environment.
No URLAn API platform that provides AI agents with their own email inboxes, supporting two-way conversations, parsing, threading, labeling, searching, and replying. It serves as an identity layer for AI agents, enabling them to use email services the same way humans do.
Visit Site →A deepfake detection tool by YouTube that identifies AI-generated simulated faces in uploaded videos, allowing creators and public figures to request removal of unauthorized AI-generated content that violates platform policy.
No URLZoom's AI assistant that helps users with meeting summaries, notes, questions, and transcriptions across web and desktop platforms, with monthly active users more than tripling year-over-year.
Visit Site →A Zoom-owned employee communication platform that is gaining an AI assistant capable of connecting to services like Slack, Salesforce, ServiceNow, Gmail, Outlook, Asana, and Jira to answer questions across different knowledge bases.
Visit Site →Google's document editing platform enhanced with Gemini AI features including 'Help me create' for generating fully formatted first drafts, 'Help me write' for refining content, 'Match writing style' for tone consistency, and 'Match the format' for mirroring document structures.
Visit Site →A smart ring wearable by Sandbar focused on AI-powered note-taking, featuring a proximity-tuned microphone activated via a touch panel, an AI assistant in the companion app, and support for iterative voice-based tasks and media controls.
Visit Site →A widely used open-source library by Hugging Face for model post-training using reinforcement learning techniques, including support for synchronous and asynchronous RL training workflows.
Visit Site →A Hugging Face library that enables easy distributed training and mixed-precision support across multiple GPUs and TPUs, providing foundational infrastructure for sequence parallelism and other training optimizations.
Visit Site →An IO-aware exact attention algorithm that tiles computation to avoid materializing the full attention matrix, significantly reducing memory usage and speeding up transformer training for long sequences.
Visit Site →A deep learning optimization library by Microsoft that enables efficient distributed training and inference, including Ulysses Sequence Parallelism for long-context training across multiple GPUs.
Visit Site →An autoregressive Vision-Language-Action (VLA) policy that uses Frequency-space Action Sequence Tokenization (FAST) to generate discretized action tokens for robot control, based on a Gemma 300M action expert.
Visit Site →A simulation platform by NVIDIA for robot learning that provides environments for training and evaluating robotic policies, now integrated with the LeRobot framework.
Visit Site →An open-source distributed computing framework that provides orchestration primitives for scaling AI and Python workloads, widely used in reinforcement learning training pipelines.
Visit Site →An inference-time technique from Physical Intelligence that makes flow-matching robotic policies more responsive by continuously blending new predictions with in-progress actions for smoother real-world robot behavior.
No URLA feature within LeRobot that enables loading simulation environments directly from the Hugging Face Hub, simplifying the setup of robot training environments.
Visit Site →An IBM model designed for risk detection in AI deployments, recommended for pairing with Granite speech models in production environments requiring safety guardrails.
Visit Site →Anthropic's AI-powered code review tool that automatically analyzes pull requests on GitHub, identifies logical errors, and provides actionable feedback using a multi-agent architecture. It is designed for enterprise teams to manage the increased volume of AI-generated code.
No URLAn AI security startup that develops open source tools for testing security vulnerabilities in large language models, including automated red-teaming and agentic workflow evaluation. It was acquired by OpenAI to be integrated into its enterprise platform.
Visit Site →Neura Robotics' robotic simulation and training platform used to test, simulate, and fine-tune cognitive robots in virtual environments before real-world deployment.
No URLQualcomm's edge AI processor series designed specifically for autonomous mobile robots and humanoid robots, providing the computational foundation for physical AI applications.
No URLA British AI infrastructure company offering vertically integrated AI compute services, from energy and data centers to compute orchestration software, enabling large-scale AI workloads across Europe, North America, and Asia.
Visit Site →Apple's professional video editing software for macOS and iPadOS that includes AI-powered features such as automatic captions and intelligent editing tools.
Visit Site →A popular note-taking app for iPad that includes AI-powered features such as Magic Pen for doodle recognition and on-device AI image generation.
Visit Site →An AI-powered mobile scanning app for iPhone and iPad that uses intelligent edge detection to digitize documents, with features including color correction, noise removal, OCR text recognition, object counting, and measurements. Scans can be saved in multiple formats including PDF, JPG, DOC, XLS, PPT, and TXT.
Visit Site →An AI-powered presentation creation tool that transforms topics, prompts, documents, notes, or links into polished, professional slide presentations with designer-made layouts, visuals, and copy. It offers over 100 templates and supports export to PPTX or direct presentation from the platform.
Visit Site →An AI agent system announced by Elon Musk that combines Tesla's real-time video processing AI with Grok's reasoning capabilities to operate a computer like a human, processing continuous screen video rather than screenshots to perform office tasks autonomously.
No URLA website that tracks and documents AI-related job losses and layoffs across industries, providing data on workforce displacement attributed to artificial intelligence.
Visit Site →An open-source machine learning auto-research tool created by Andrej Karpathy that can autonomously conduct machine learning research and iteratively improve code on a home computer.
Visit Site →A Y Combinator-backed compliance automation platform that ingests compliance information and provides auditors with access to that data, aiming to streamline privacy and security regulatory compliance processes.
Visit Site →An AI gaming startup co-founded by Elliot Wolf that builds immersive mystery games featuring AI assistants to help players gather clues and solve crimes, partnered with NBCUniversal's Peacock platform.
Visit Site →A Tesla-developed AI agent designed to complement xAI's Macrohard project, intended to perform digital tasks directed by xAI's language model, named as a reference to Tesla's Optimus humanoid robot.
No URLMicrosoft's free, open-source code editor widely used by developers, featuring extensions and integrations for AI-assisted coding workflows.
No URLAn open-source project by Andrej Karpathy that automates machine learning research by handing the iterative training loop of a small language model to an AI agent, which autonomously experiments with architecture, hyperparameters, and optimization strategies to minimize loss.
Visit Site →A Microsoft AI-powered memory feature for Copilot+ PCs that captures and indexes snapshots of user activity to enable searching through past on-screen content.
Visit Site →A caller identity and spam detection platform that uses AI and community-based reports to identify fraudulent calls, offering features including AI-powered voicemail with call transcription summaries and family protection tools that allow remote call management.
Visit Site →A revenue intelligence platform that uses AI to analyze customer interactions across calls, emails, and meetings, providing insights to help sales teams improve performance and close more deals.
Visit Site →A revenue operations platform that uses AI to provide visibility into pipeline health, forecast accuracy, and deal activity, helping sales teams manage and predict revenue outcomes.
Visit Site →An AI sales development platform that creates autonomous digital workers to automate outbound sales tasks such as prospecting, lead qualification, and outreach.
Visit Site →An AI sales development platform that builds AI-powered digital workers (called Artisans) to automate outbound sales processes including lead research, email outreach, and follow-ups.
Visit Site →An open-source workflow automation platform that allows users to connect various services and build automated workflows, serving as an alternative to proprietary automation tools.
Visit Site →A specialized AI agent builder platform designed for enterprises, enabling teams to create and deploy custom AI assistants and agents for various business workflows.
Visit Site →An AI-powered streaming camera by Obsbot with intelligent face-tracking capabilities, designed for professional livestreaming and content creation setups.
Visit Site →A 4K webcam by Obsbot designed for livestreaming and video calls, offering a more affordable entry point into the company's camera ecosystem at $179.
Visit Site →Microsoft's version of Anthropic's Claude-powered collaborative coding tool integrated with Microsoft 365, combining Anthropic's Claude capabilities with Microsoft's productivity suite.
Visit Site →A computer vision AI platform that mounts cameras on public vehicles to capture and analyze images of buildings and neighborhoods, helping local governments identify urban blight, code violations, graffiti, illegal dumping, and storm damage.
Visit Site →An OpenAI plugin that integrates ChatGPT's capabilities directly into Microsoft Excel, enabling enterprise customers to create and analyze spreadsheets using the latest GPT models.
Visit Site →A command-line interface released by Google on GitHub that enables developers to integrate third-party AI agents like OpenClaw into Google Workspace services such as Gmail and Google Drive.
Visit Site →OpenAI's coding agent environment that can work on entire projects at once, creating and managing multiple files within a folder structure for complex software development tasks.
No URLOpenAI's autonomous software development environment available on Mac and Windows that can perform complex coding tasks including web searches, data gathering, and building interactive applications in a single prompt.
No URLA benchmark suite (Bench I and Bench II) for evaluating deep research AI agents, measuring report quality, comprehensiveness, factual correctness, and analytical rigor across fine-grained rubrics.
Visit Site →A web search API designed for AI agents and LLM applications, enabling real-time web search capabilities within agentic workflows.
Visit Site →A search API service that provides Google search results programmatically, used in AI agent pipelines for academic paper search and web retrieval.
Visit Site →An open-sourced dataset of research questions used for supervised fine-tuning of deep research AI models, providing training trajectories for search-and-synthesis workflows.
Visit Site →An AI-powered platform that uses artificial intelligence to streamline hiring and talent matching, part of the emerging wave of AI-native productivity tools.
Visit Site →A screenwriting software platform that integrates AI imagery and community sharing tools to transform the screenwriting process for writers and filmmakers.
Visit Site →NVIDIA's inference microservices platform that provides optimized, containerized AI model serving for deploying large language models and other AI models in production environments.
Visit Site →Amazon's AI-powered shopping assistant that helps customers discover products, get recommendations, and navigate shopping experiences across Amazon's platform and third-party merchant sites.
Visit Site →A product feed management platform that helps merchants syndicate their inventory, pricing, and catalog data to marketplaces and retail partners like Amazon.
Visit Site →A product experience management platform that enables brands and merchants to manage and distribute product content and data across digital commerce channels.
Visit Site →An e-commerce integration platform that provides product feed management and multichannel selling solutions, connecting merchants' inventory data to marketplaces like Amazon.
Visit Site →Tesla's autonomous driving AI system that processes continuous video from car cameras in real time, trained on over 10 billion miles of driving data using the AI4 chip.
Visit Site →A fast JavaScript runtime and toolkit that has integrated AI-powered code review tools into its development workflow.
Visit Site →A node-based visual workflow interface that allows users to wire together Modular Diffusers blocks into custom diffusion pipelines through a graphical interface.
Visit Site →A feature within Cursor that enables automatic launching of coding agents triggered by events such as codebase changes, Slack messages, or timers, allowing engineers to move beyond manual prompt-and-monitor workflows.
Visit Site →A Cursor feature that automatically reviews new code additions for bugs and issues every time an engineer commits to the codebase, serving as a predecessor to Cursor's broader Automations system.
Visit Site →An enterprise AI platform that uses large action models to automate complex, multistep workflows across enterprise systems, allowing users to interact with it conversationally to execute tasks.
Visit Site →An AI-native procurement automation platform that deploys AI agents to execute end-to-end enterprise procurement workflows, including document reading, supplier evaluation, negotiation, and transaction completion.
Visit Site →A YC-backed startup that uses AI voice agents to conduct customer interviews and produce consultancy-quality commercial due diligence research for private equity firms at a fraction of traditional costs.
Visit Site →An AI agent-powered platform by AWS designed to help healthcare organizations automate administrative tasks such as appointment scheduling, documentation, patient verification, and medical coding. It is HIPAA-eligible and integrates with electronic health record software.
Visit Site →Google's AI-powered search feature that generates summarized answers directly in search results, often reducing the need for users to click through to external websites.
Visit Site →A Python client library for interacting with the Hugging Face Hub, supporting operations like creating buckets, syncing files, batch uploads, selective downloads, and programmatic management of ML artifacts.
Visit Site →A JavaScript client library (since v2.10.5) for integrating Hugging Face Hub functionality, including Bucket support, into Node.js services and web applications.
Visit Site →A subscription management platform used by over 75,000 app developers to manage in-app transactions across iOS, Android, and web, processing more than 1 billion transactions and generating over $11 billion in annual developer revenue.
Visit Site →YouTube's automated content identification system that detects copyright-protected material in users' uploaded videos, enabling rights holders to manage and monetize their content across the platform.
Visit Site →A toggle-on mode within ChatGPT designed by OpenAI to function as an AI tutor, encouraging Socratic-style learning by guiding students through problems rather than providing direct answers.
Visit Site →A feature within Google Search's AI Mode that allows users to organize projects, draft documents, create custom tools, and generate shareable apps or games by describing ideas to Gemini. It supports pulling information from the web and Google's Knowledge Graph.
Visit Site →A misinformation watchdog service that uses journalists and AI-driven analysis to rate the credibility of news sources and track the spread of false claims online.
Visit Site →A group chat app by BuzzFeed's Branch Office spin-off that offers AI-powered photo editing and creation features, combined with an editorial library of online trends and memes to inspire user-generated content.
Visit Site →A daily photo-sharing app by BuzzFeed's Branch Office that prompts users to take photos based on creative prompts and incorporates AI features, described as having an 'AI spirit for a CEO.'
Visit Site →A reinforcement learning algorithm (Group Relative Policy Optimization) that uses group-relative advantages instead of a value function, requiring multiple rollouts per prompt for training language models.
Visit Site →A digital health AI startup that serves as the first disclosed partner for AMI Labs' world models, focused on applying AI to healthcare applications.
Visit Site →An image editing application for Apple platforms that leverages machine learning for features like automatic photo enhancement, object removal, and intelligent editing tools.
Visit Site →A representation alignment technique used in diffusion model training that aligns internal model representations with a teacher model to improve convergence speed and image quality.
Visit Site →A token routing method for transformer-based diffusion models that randomly selects a fraction of tokens to bypass contiguous transformer blocks, reducing computational cost per training step while maintaining output quality.
Visit Site →An online detection tool that analyzes images to determine whether they were generated by AI or created by a human, with approximately an 80 percent success rate.
Visit Site →Google's device and item tracking network for Android, previously known as Find My Device, that enables real-time location sharing and tracking of personal items and luggage with partnered airlines.
Visit Site →A Google DeepMind AI system that enhanced the efficiency of Google's data centers, chip design, and AI training processes, including training the large language models underlying itself.
Visit Site →A mobile app that allows users to run open-weight AI language models directly on their iPhone without an internet connection. It supports multiple models including Qwen, Gemma, and Llama, and offers features like custom instructions, temperature adjustment, and Siri shortcut integration.
Visit Site →An xAI project focused on AI-assisted coding capabilities, led by co-founder Zhiheng Dai before his departure from the company.
No URLAn AI-powered feature by Ring that uses Ring camera footage from nearby homes to help locate lost dogs by crowdsourcing neighborhood surveillance data.
No URLAn evidence management platform operated by Axon that enables law enforcement agencies to store, manage, and share digital evidence including body camera footage.
Visit Site →A company that operates AI-powered license plate readers used by law enforcement and communities for public safety and vehicle identification purposes.
Visit Site →OpenAI's command-line coding agent that allows developers to interact with AI models for code generation and software development tasks from the terminal.
Visit Site →A Y Combinator-backed AI-native customer support agency that replaces traditional customer service teams at startups by combining purpose-built AI software with human oversight to handle tickets across email, chat, social media, voice, and other channels.
Visit Site →An open protocol developed by Anthropic that standardizes how AI models connect to external data sources and systems, reducing the need for custom integrations.
Visit Site →A Google-powered AI shopping agent that works on top of Google's existing price-tracking tools, enabling users to set payment methods and shipping addresses so the agent can automatically complete purchases on retailer websites.
Visit Site →A feature within the Perplexity AI search platform that allows users to research and purchase products without leaving the interface, storing user details for AI-assisted future purchases.
No URLAn AI-powered calendar management platform that analyzes a user's task list, recurring habits, and existing commitments to automatically block out focus time, enforce travel buffers, and resolve meeting conflicts in real time.
Visit Site →A skills-based assessment platform offering a wide library of skill tests covering language proficiency, software knowledge, and more, used to evaluate job candidates based on demonstrated ability rather than credentials.
Visit Site →A technical assessment platform focused on coding and technical evaluations for hiring, featuring AI literacy assessments alongside traditional programming skill tests.
Visit Site →An AI-powered email client that combines traditional email management with AI assistance, offering a free tier with an AI bot that can help users organize, sort, and manage their inbox efficiently.
Visit Site →A surveillance assessment tool created by Gen Z For Change that lets individuals see how government and Big Tech entities are collecting their data through AI-powered surveillance. It analyzes digital habits and app usage to generate a personalized risk profile without requiring personally identifiable information.
Visit Site →A prompt composition workspace that helps users build, organize, refine, and export structured prompts for use with various AI tools. It includes 10,000+ premium prompts, reusable 'VibeCards,' and style palettes for consistent AI prompt engineering across platforms.
No URLAn AI-powered upscaling feature built into Hisense TVs that enhances lower-resolution content to near-4K quality for improved viewing.
Visit Site →OpenAI's autonomous AI agent platform that allows users to give powerful language models access to their systems to perform tasks like managing email, calendars, flights, and more complex autonomous workflows on their behalf.
Visit Site →A Chinese AI company co-founded by Felix Tao that is exploring personal AI agent capabilities and building projects in the agentic AI space.
Visit Site →A Chinese AI company offering cloud-based AI agent services within its proprietary applications.
No URLA productivity suite from Microsoft that includes Word, Excel, Outlook, PowerPoint, and OneNote, updated with AI capabilities such as text and formatting suggestions, data analysis, trend identification, and chart-building assistance.
No URLA personal AI agent developed by Lenovo that is being rolled out across more than 20 Lenovo devices, designed to provide intelligent assistance integrated into Lenovo's hardware ecosystem.
Visit Site →Microsoft's platform for accessing and deploying AI models, including third-party models like Anthropic's Claude, within the Microsoft ecosystem. It serves as a hub for enterprises to integrate AI capabilities into their workflows.
No URLAmazon's cloud computing platform that provides a wide range of services including AI and machine learning tools, and hosts third-party AI models like Anthropic's Claude through Amazon Bedrock.
No URLA desktop application by Anthropic that provides access to Claude AI models and supports Model Context Protocol (MCP) integrations, allowing it to connect with external services and tools.
Visit Site →An AI-powered children's entertainment company that acquires and scales iconic kids' IP into global franchises using proprietary AI tools and a digital-first, multi-platform content strategy.
Visit Site →A large-scale monocular depth estimation model that extracts depth maps from images, available on Hugging Face Hub under the depth-anything organization.
Visit Site →An open-source robotics framework on Hugging Face for recording robotic datasets, training robot control policies, and deploying them on real robots.
Visit Site →An incident management platform used for alerting and on-call scheduling, which can integrate with Cursor's Automations to trigger AI agents for automated incident response.
Visit Site →A cloud-based procurement and supply chain management platform by SAP that helps enterprises manage purchasing, invoicing, and supplier relationships.
Visit Site →A HIPAA-eligible service by AWS that uses FHIR-based infrastructure to store, transform, query, and analyze health data at scale for healthcare and life sciences organizations.
Visit Site →A bioinformatics service by AWS that helps healthcare and life sciences organizations store, query, and analyze genomic and other omics data at scale.
Visit Site →A startup that uses AI-powered interviews and synthetic consumer personas to conduct market and consumer research for businesses.
Visit Site →An AI-powered consumer research platform that conducts automated interviews to gather qualitative insights for businesses and researchers.
Visit Site →A crowdsourced outage tracking platform that monitors and reports real-time status and outages for websites, apps, and online services. It is owned by Ookla, a subsidiary of Ziff Davis.
Visit Site →A digital intelligence platform that provides web analytics, traffic data, and market insights for websites and apps.
Visit Site →OpenAI's AI agent platform that allows users to assign complex tasks which the agent autonomously completes by browsing the web and interacting with websites on the user's behalf.
No URLPalantir's AI-powered military targeting and intelligence system used by the U.S. Department of Defense for real-time target identification, location coordination, and prioritization during military operations.
No URLA customer messaging and support platform that incorporates AI-powered agents and automation to handle customer inquiries and streamline support workflows.
Visit Site →An upgraded version of Meta's Ray-Ban smart glasses featuring a Neural Band interface and enhanced AI assistant integrations for augmented reality and always-on AI capabilities.
No URLFigma's Model Context Protocol (MCP) server that enables AI coding tools like Codex and Claude Code to access and interact with Figma design files, bridging the gap between design and code workflows.
Visit Site →A London-based workflow orchestration startup that maps complex corporate environments using knowledge graphs to provide AI agents with the context they need to scale effectively within enterprises.
Visit Site →A feature in Mozilla Firefox 148 that provides users with an AI killswitch toggle, allowing them to completely disable all AI integrations and features built into the browser, including chatbots, AI-powered link reviews, and smart tab group suggestions.
Visit Site →A privacy-focused VPN service by Proton AG that offers both free and paid tiers, featuring strong encryption and a strict no-logs policy.
Visit Site →A VPN service known for its extensive server network spanning numerous countries, offering strong privacy protections and anonymous browsing capabilities.
Visit Site →A widely used VPN service recognized for its robust security features, including double encryption and a strict no-logs policy for secure online browsing.
Visit Site →A user-friendly VPN service designed for beginners, offering easy setup, strong encryption, and access to a large network of servers worldwide.
Visit Site →A VPN service that supports unlimited simultaneous device connections, offering strong privacy features and competitive pricing for multi-device users.
Visit Site →A security plugin for Claude Code by Anthropic that scans codebases for security vulnerabilities and suggests patches, helping developers find and fix security issues that traditional methods often miss.
No URLA feature for Claude Code by Anthropic that allows developers to start coding tasks in their terminal and continue monitoring and controlling sessions remotely from a phone via the Claude app or web interface.
No URLA Google creative tool that integrates AI image generation capabilities, now using Imagen 4 as its default image generator with zero credit cost for image generation within the platform.
No URLLearned Perceptual Image Patch Similarity is a perceptual loss metric that measures low-level perceptual similarity between images, commonly used as an auxiliary training objective in image generation models.
Visit Site →A token routing technique for diffusion transformers that offers compute savings through selective token processing, serving as an alternative to TREAD with slightly more complexity.
Visit Site →A viral AI-powered calorie tracking app built by teens that uses artificial intelligence to estimate nutritional information, recently acquired by MyFitnessPal.
Visit Site →Atlassian's project management platform Jira now includes an 'Agents in Jira' feature that allows users to assign tasks and tickets to AI agents, track their progress, and manage both human and AI agent work from a single dashboard.
Visit Site →An AI-powered HR tech startup focused on automating human resources functions such as recruiting, compensation policy setting, and performance review design, combining AI agents with forward-deployed HR executives to serve as an extension of existing HR teams.
Visit Site →An India-founded AI marketing platform that uses a network of AI agents to automatically generate search-optimized content, build backlinks, and track inbound leads, helping businesses surface in both traditional and AI-driven search results.
Visit Site →An Alphabet-owned robotics software company that builds AI models and software designed to make industrial robots more accessible, now operating within Google and collaborating with Google DeepMind.
Visit Site →An AI-powered health coaching agent built into CUDIS smart rings that uses generative AI to create tailored fitness programs, daily tasks, recovery protocols, and supplement recommendations for users.
Visit Site →A Google-developed AI search feature on Android devices that allows users to circle, highlight, or tap on-screen content to instantly search for information, now expanded with multi-object recognition.
Visit Site →An AI tool known for its ability to manage everyday tasks such as sending emails, managing calendars, checking into flights, and other routine digital activities on behalf of users.
No URLSamsung's unified multimedia creation tool on Galaxy S26 phones that combines AI photo editing, enhancement, and generation capabilities into a single application for capturing and editing content.
Visit Site →An AI-powered customer support startup that uses generative AI to automate and enhance enterprise customer service operations.
Visit Site →An AI-powered customer support platform that automates contact center interactions using conversational AI across voice and text channels.
Visit Site →An AI-powered customer experience platform that enables businesses to deploy conversational AI agents for customer support and engagement.
Visit Site →An AI startup focused on AI-assisted software development, known for its autonomous coding agent Devin that aims to handle complex engineering tasks.
Visit Site →Apple's smart home platform that allows users to control and automate compatible smart home accessories using Apple devices and Siri voice commands.
Visit Site →A smart thermostat by Google that uses built-in AI automations to adjust home temperatures based on occupancy, time of day, and user preferences.
Visit Site →AI agents embedded within Notion's productivity platform that let users select specific AI models for different tasks and set up multiple custom agents configured for distinct administrative functions within existing workflows.
Visit Site →An open-source framework by Microsoft that uses event-driven architectures to orchestrate multiple specialized AI agents, allowing distinct agent personas to communicate, share memory, and execute code in isolated environments.
Visit Site →An AI-powered recruitment platform that parses and ranks resumes, extracting relevant details and scoring candidates against job descriptions to streamline the hiring process.
Visit Site →An AI-driven hiring platform that uses semantic matching and resume parsing to screen and rank job applicants, helping recruiters identify top candidates more efficiently.
Visit Site →An AI-powered video interview platform used by large employers that handles both recorded and live interview formats and generates AI-driven candidate assessments for hiring teams.
Visit Site →An AI-powered interview platform that uses conversational AI to simulate natural back-and-forth interviews with candidates at scale, incorporating behavioral science frameworks into its analysis.
Visit Site →A hiring assessment platform that uses neuroscience-driven games to measure cognitive and emotional traits, then matches candidates to roles based on the resulting data.
Visit Site →A workflow automation platform that connects different apps and services, enabling users to create automated workflows for tasks like email management, though it has a significant learning curve.
Visit Site →An AI-powered email management tool designed to help users automate email organization and achieve a clean inbox through automated processing and sorting.
Visit Site →Anthropic's developer platform for building and managing applications powered by Claude's AI models, providing tools for API access and configuration.
Visit Site →Asus' AI media management application that supports direct GoPro Cloud downloads and 360-degree video editing for content creators.
No URLGoogle's vibe-coding app that lets users create mini web apps and automated workflows using natural language prompts without writing code, powered by the Gemini 3 Flash model.
Visit Site →An AI chatbot integrated into the Oura Ring app that delivers personalized health insights by analyzing biometric data including sleep, activity, cycle, and stress metrics. It now features a proprietary AI model focused specifically on women's health across the full reproductive spectrum.
Visit Site →Anthropic's enterprise initiative that provides pre-built AI agent plug-ins for common business functions including financial research, engineering specifications, legal, and HR tasks, deployable through a centralized administration system with enterprise-grade controls.
Visit Site →A no-code platform by New Relic that enables enterprises to build, deploy, and manage AI agents focused on data observability, allowing automated monitoring to catch bugs and issues before they disrupt products. It supports the Model Context Protocol (MCP) for connecting to external data sources.
Visit Site →An AI-powered web search platform that employs AI agents to search the web in real time, verify and validate results, and structure information into queryable database tables for enterprise use. It integrates with data warehouses like Databricks and Snowflake to provide live, structured web data within existing enterprise data environments.
Visit Site →A compression technology by Multiverse Computing, inspired by quantum computing, that significantly reduces the size of large language models while maintaining near-original accuracy and performance.
Visit Site →An AI-powered job hunting automation tool that tailors résumés to specific job postings, generates ATS-friendly cover letters, tracks applications, and builds interview practice questions from job listings.
Visit Site →A desktop application for Mac and Windows that automatically converts bitmap images (JPG, PNG, scans, screenshots) into editable vector graphics in formats like SVG, AI, PDF, and DXF. It uses intelligent edge detection and line-tracing to rebuild images as clean, scalable vector paths.
Visit Site →An AI-powered software testing agent by LambdaTest that allows users to generate and run automated tests for web and mobile applications using natural language descriptions instead of writing test code. It can generate test cases from various inputs including Jira tickets, PDFs, spreadsheets, images, audio, and video, and supports conditional logic, API testing, and enterprise-level complexity.
Visit Site →A market intelligence and analytics platform that provides data on app store rankings, downloads, revenue, and usage trends for mobile applications.
Visit Site →An AI-powered news aggregation app built by former Twitter engineers that uses vector embeddings and AI to curate news stories, extract relevant podcast clips, and provide personalized news feeds with AI chatbot features.
Visit Site →An AI-powered feature within Spotify that allows Premium subscribers to create custom playlists by describing what they want to listen to in natural language, generating tailored playlists based on listening history and cultural trends.
Visit Site →A growth and marketing tool by Canva designed for asset creation, ad deployment, and performance measurement across platforms including Meta.
Visit Site →A professional creative editing suite acquired by Canva, offering tools for photo editing, vector graphics, and layout design, now available for free to all users.
Visit Site →An AI prompt optimization tool that transforms simple requests into expert-level, optimized prompts for leading AI models in under 15 seconds. It offers over 1,000 proven prompt templates, supports models like ChatGPT, Claude, Gemini, and Llama, and allows users to store, tag, and organize their favorite prompts.
Visit Site →An open-source AI personal assistant that runs locally on a device and has read-level access to a user's system, capable of autonomously performing tasks like managing email, scheduling, and coding. It later changed its name to Crosshairs.
No URLThe renamed version of Clawdbot, an open-source agentic AI assistant that runs locally on a device and can autonomously perform tasks on behalf of the user. Its inventor was subsequently hired by OpenAI.
Visit Site →The Model Evaluation and Threat Research Lab, a non-profit research organization that evaluates frontier AI models by benchmarking their ability to autonomously complete complex software engineering tasks compared to human experts.
Visit Site →An open-source backend-as-a-service platform popular among developers, providing database, authentication, and API services as an alternative to Firebase.
Visit Site →An internal AI agent developed by Block (formerly Square) designed to assist both technical and non-technical teams with tasks such as lead analysis, content asset management, and administrative work automation.
Visit Site →An AI-powered productivity and collaboration workspace that combines document creation, note organization, task planning, and visualization boards with built-in AI assistance for summarizing, rewriting, and refining ideas. It offers unlimited AI requests, real-time multiplayer editing, and shared workspaces.
Visit Site →An AI-powered writing assistant that offers grammar checking, style suggestions, and an 'Expert Review' feature that provides revision suggestions framed from the perspective of well-known subject matter experts and authors.
Visit Site →Amazon's custom-built machine learning chip designed to deliver high-performance, cost-effective training and inference for deep learning workloads in the cloud.
Visit Site →Nvidia's next-generation GPU architecture platform designed for AI training and inference workloads, succeeding the Blackwell architecture.
Visit Site →A web browser developed by Perplexity that integrates AI-powered features directly into the browsing experience.
No URLA benchmark developed by Perplexity for evaluating AI systems on complex research tasks, designed to measure deep research capabilities across different AI platforms.
No URLA Perplexity feature that allows users to query multiple AI models simultaneously and compare their outputs to obtain the best possible answers.
Visit Site →An AI-powered legal assistant built on large language models that helps legal professionals with research, drafting, and analysis of legal documents.
Visit Site →A free online chat platform by Alibaba that allows users to interact with Qwen models including Qwen 3.5 without needing to download or run them locally.
Visit Site →An AI-powered meeting notetaker that offers 'recipes' in the form of repeatable prompts to surface knowledge and extract actionable insights from meeting data.
Visit Site →Google's developer API for accessing Gemini models, enabling developers to integrate Google's AI capabilities including image generation into their applications.
Visit Site →A Tinder AI-powered feature being piloted in Australia that analyzes users' camera rolls and questionnaire answers to learn about their interests and personality, aiming to reduce swipe fatigue and suggest better matches.
Visit Site →An open technical standard developed by an industry coalition including Adobe, Microsoft, Google, OpenAI, and Meta for certifying the provenance and authenticity of digital content, including AI-generated media.
Visit Site →Figma's AI-powered design generation feature that allows users to create and iterate on designs using natural language prompts within the Figma platform.
Visit Site →Figma's collaborative whiteboarding and brainstorming tool that integrates with AI coding assistants to help teams move from ideation to implementation.
Visit Site →An open-source C/C++ inference engine originally created by Georgi Gerganov (GGML) that enables efficient local inference of large language models on consumer hardware. It is widely regarded as the fundamental building block for running AI models locally.
Visit Site →A tensor library written in C for machine learning, created by Georgi Gerganov, that serves as the underlying framework for llama.cpp and enables efficient on-device AI inference. Its GGUF format has become a widely adopted standard for quantized model distribution.
Visit Site →A PyTorch-based on-device inference framework developed by Meta that enables running AI models efficiently on edge devices. It has adopted GGML's GGUF format as a preferred default for on-device inference.
Visit Site →An AI-powered financial reporting platform that helps companies and accounting firms automate manual aspects of financial statement preparation, including math verification and formatting. It raised $14.5 million in Series A funding led by Norwest.
Visit Site →A five-week Google-run cohort program that gives independent filmmakers access to Google's suite of AI tools — including Gemini, Nano Banana Pro, and Veo — to produce short films.
No URLA U.S.-based AI chipmaker that designs and manufactures specialized hardware for AI computing, including wafer-scale processors optimized for training and inference of large-scale AI models.
Visit Site →A privacy-focused AI desktop assistant that runs entirely on the user's local computer without cloud connectivity, supporting chat, writing, coding, and document analysis offline with no subscription or token limits.
Visit Site →A cloud-based AI coding agent platform by Warp that lets users spin up multiple autonomous agents in isolated Docker containers, schedule them to run on cadences, and manage them from a central panel or via Slack and GitHub triggers.
Visit Site →An AI startup that built tools for complex agentic tasks, including its product Vy — a computer-use agent in the cloud capable of operating a remote Apple MacBook. The company was acquired by Anthropic and is shutting down its product.
Visit Site →A Google Pixel AI feature that provides context-dependent information surfacing, automatically presenting relevant details from emails or text messages when they come up during regular device use.
Visit Site →A Samsung Galaxy S26 AI feature that provides context-aware notifications and information surfacing, similar in concept to Google's Magic Cue, designed to proactively present relevant information to users.
Visit Site →An AI-powered software development platform that helps automate and accelerate the software building process.
Visit Site →A defense-oriented AI platform by Palantir Technologies that provides data analytics, targeting support, and intelligence capabilities for classified military networks and operations, integrated with partner AI models.
Visit Site →An open-source library by NVIDIA for designing and generating synthetic training data from scratch or from seed data, part of the NeMo microservices ecosystem.
Visit Site →An open-source library that provides approximately 2x faster LLM training and 60% less VRAM usage compared to standard fine-tuning methods, enabling cost-effective model training.
Visit Site →A fully managed cloud GPU service by Hugging Face that allows users to submit and run AI model training jobs on cloud infrastructure with monitoring capabilities.
Visit Site →A London-based startup building an on-device AI inference engine optimized for Apple Silicon, enabling developers to run AI models efficiently on phones and laptops with a simple SDK integration. Built in Rust, it claims up to 37% faster model generation speeds.
Visit Site →An AI workforce management platform that lets organizations manage, coordinate, and oversee AI agents across teams and departments, serving as a system of record for AI employees regardless of which tools built them.
Visit Site →An AI product by Reload that acts as a software architect agent, maintaining shared project-level context and system requirements across coding agents and sessions to keep development consistent. It integrates as an extension in AI code editors like Cursor and Windsurf.
No URLAn AI-powered search feature by Reddit that surfaces community-recommended products with interactive carousels, pricing, images, and direct purchase links based on discussions across the platform.
Visit Site →Smart glasses by Indian AI company Sarvam designed to bring on-device AI models directly to users, built and designed in India.
Visit Site →An AI-powered meeting notetaker and intelligence platform that automatically captures meeting notes, provides contextual insights, and includes a sales tool that predicts deal outcomes using CRM data from HubSpot and Salesforce.
Visit Site →An AI-powered customer support tooling platform that automates customer service tasks including voice AI communication, enabling support agents to take on higher-value roles like supervision and relationship building.
Visit Site →NVIDIA's global startup support program that provides AI startups with technical expertise, go-to-market support, and access to NVIDIA's computing infrastructure. The program supports over 4,000 startups in India alone and thousands more globally.
Visit Site →An all-in-one desktop PDF tool that enables users to convert, edit, merge, split, compress, and OCR-scan PDF documents, supporting conversions between PDF and formats like Word, Excel, PowerPoint, HTML, and JPG.
Visit Site →A feature within Google's Gemini app that allows users to preview and interact with code-generated outputs, such as web applications and visual designs, in a side window alongside the conversation.
Visit Site →An AI app-building tool created by former Replika founder that lets users create applications via natural language prompts.
Visit Site →An Accel-backed startup that provides AI-powered tools for building applications via natural language prompts.
Visit Site →Salesforce's AI agent platform that enables enterprises to build, deploy, and manage autonomous AI agents for business tasks across sales, service, marketing, and other functions within the Salesforce ecosystem.
Visit Site →OpenAI's AI agent platform that allows users to deploy autonomous agents capable of performing tasks on the web and within enterprise environments, serving as OpenAI's entry into the agentic AI infrastructure space.
Visit Site →An industry benchmark developed by IBM Research for evaluating AI agents on enterprise IT automation tasks including SRE, Security, and FinOps scenarios involving Kubernetes environments and incident triage.
Visit Site →A standardized taxonomy and diagnostic framework developed by IBM Research and UC Berkeley for analyzing and classifying failure modes in multi-agent AI systems, covering 14 distinct failure patterns across system design, inter-agent misalignment, and task verification categories.
Visit Site →An open-source Python library for building interactive machine learning demos and web applications, featuring components like gr.HTML that support custom templates, scoped CSS, and JavaScript interactivity in single-file deployments.
Visit Site →A hosting platform by Hugging Face that allows developers to deploy and share machine learning demos and applications built with frameworks like Gradio and Streamlit.
Visit Site →An education-focused version of OpenAI's ChatGPT designed for campus-wide deployment at academic institutions, providing AI tools for coding, research, analytics, and case analysis with responsible-use frameworks.
Visit Site →Version 5 of Hugging Face's Transformers library, an open-source framework for building and deploying machine learning models with simplified model definitions powering the broader AI ecosystem.
Visit Site →Google DeepMind's watermarking technology that embeds imperceptible identifiers into AI-generated content, including images, text, and music, to help detect and verify synthetic media.
Visit Site →A San Francisco-based AI marketing platform offering a suite of loosely coupled AI agents for data analysis, audience targeting, campaign management, customer engagement, media planning, and synthetic data generation for marketers.
Visit Site →Apple's in-car interface platform that integrates iPhone functionality into vehicle infotainment systems, now expanding to support voice-based AI chatbot apps like ChatGPT and Gemini.
Visit Site →NVIDIA's command-line interface tool for downloading model checkpoints, containers, and other AI resources from the NVIDIA GPU Cloud (NGC) catalog.
Visit Site →An AI-powered tool that gained viral attention for providing real-time assistance and overlays during interviews and other interactions.
Visit Site →A UK-based startup acquired by Canva that specializes in 2D motion animation tools for verticals including advertising, marketing, gaming, and generative art.
Visit Site →Google's web-based platform for developers to prototype and build applications using Google's Gemini AI models, offering tools for prompt engineering and API access.
No URLOpenAI's AI-powered coding agent designed to help developers write, review, and debug code autonomously within software development workflows.
No URLA cloud deployment platform by Kimi that enables one-click deployment of OpenClaw AI agents with 24/7 uptime, 40GB cloud storage, and instant access to over 5,000 ClawHub skills without requiring local hardware or manual setup.
Visit Site →An open-source (CC BY 4.0) synthetic data generation seed dataset by NVIDIA containing 6 million culturally accurate Japanese personas based on real-world demographics, geographic distribution, and personality traits, used to generate high-quality training data for language models.
Visit Site →An AI-powered platform that helps healthcare systems track and validate spending by integrating with existing ERP, contract management, and accounts payable workflows to flag invoice discrepancies and prevent overpayment, particularly for non-barcoded purchase services.
Visit Site →Infosys's enterprise AI platform that integrates large language models and AI capabilities to build agentic systems for automating complex enterprise workflows across industries such as banking, telecoms, and manufacturing.
Visit Site →A Paris-based serverless platform that simplifies AI application deployment at scale and manages the underlying infrastructure, allowing developers to deploy AI models without worrying about server management. It was acquired by Mistral AI in 2026.
Visit Site →A startup focused on cache optimization for AI model inference, working on memory management layers in the AI infrastructure stack to reduce costs and improve efficiency.
Visit Site →A built-in AI assistant for WordPress.com that understands a site's content and layout, allowing site owners to adjust styles, edit content, generate images, and modify layouts using natural language commands.
Visit Site →An Indian vibe-coding platform that enables non-technical users and small businesses to build production-ready mobile and web applications using natural language prompts, voice commands, and AI agents without prior coding experience.
Visit Site →Amazon's smart TV platform featuring AI-powered personalization for content recommendations, voice control via Alexa, and integrated streaming across Amazon's 4-Series and Omni lineups.
Visit Site →Amazon's 11-inch smart display powered by the AZ3 Pro chip, featuring Alexa integration, auto-framing camera for video calls, and spatial audio for smart home control and entertainment.
No URLAn AI-powered feature within Apple Music that generates 25-song playlists based on user text prompts, allowing further refinement through additional prompts and manual curation before saving with custom cover art and descriptions.
No URLA voice email tool that integrates directly with Gmail and Outlook, allowing users to record and send voice messages within their inbox with optional automatic transcription for recipients.
No URLAn email unsubscribe service that helps users manage and unsubscribe from mailing lists, though it faced controversy after being found to sell user data.
Visit Site →AMD's rack-scale AI infrastructure platform designed for large-scale AI workloads, being deployed in partnership with Tata Consultancy Services for enterprise AI infrastructure.
Visit Site →An India-based AI orchestration platform that enables enterprises to deploy voice AI solutions with local data residency compliance.
Visit Site →A Reddit-like social network designed for AI agents to communicate with one another, post, comment, and browse content. It gained viral attention when posts appeared to show AI agents organizing autonomously, though security vulnerabilities revealed that humans could easily impersonate agents on the platform.
No URLA Swedish AI startup that helps dentists' practices with administrative work, including a recording tool that uses AI to generate clinical notes from patient visits.
Visit Site →An AI tool developed at Google Brain by Anna Goldie and Azalia Mirhoseini that can generate high-quality chip layouts in hours rather than the year or more typically required by human designers, used to design multiple generations of Google's TPUs.
Visit Site →A Singapore-based AI agent startup that provides autonomous AI agents running on Linux virtual machines, capable of completing complex multi-step tasks on behalf of users.
Visit Site →An AI credits platform that provides unified API access to multiple AI models, simplifying deployment for users who don't want to manage separate API keys.
Visit Site →An Indian AI infrastructure startup that develops and operates GPU-based compute platforms enabling enterprises, researchers, and public sector clients to train, fine-tune, and deploy AI models locally in India.
Visit Site →A legacy cloud platform for financial reporting, compliance, and ESG that helps organizations streamline the preparation and management of complex financial documents and regulatory filings.
Visit Site →An open-source project related to llama.cpp inference optimization, referenced by the community as a complementary tool for local AI model execution.
Visit Site →An Amazon Web Services tool that helps customers visualize, understand, and manage their AWS costs and usage over time.
Visit Site →A browser automation framework developed by Microsoft that enables programmatic control of web browsers for testing and automation tasks, including headless browser operation.
Visit Site →A Node.js library by Google that provides a high-level API to control headless Chrome or Chromium browsers for web scraping, testing, and automation.
Visit Site →An AI startup founded by Anna Goldie and Azalia Mirhoseini that builds AI tools to automate and dramatically accelerate chip design, using deep learning agents that improve through experience across different chip layouts.
No URLA Stanford research project by Joon-Sung Park that simulates a village populated by 25 AI agents powered by large language models, each with unique backstories, personalities, and daily routines, to study emergent social behaviors and interactions.
Visit Site →A monitoring tool that provides real-time loss curves and training progress tracking for AI model training jobs on Hugging Face infrastructure.
Visit Site →A repository of installable skills for coding agents that enable capabilities like model training on Hugging Face infrastructure through natural language prompts.
Visit Site →A dataset by mlabonne hosted on Hugging Face containing 100,000 samples designed for supervised fine-tuning of language models.
Visit Site →A supervised fine-tuning trainer from the TRL (Transformer Reinforcement Learning) library by Hugging Face, used to fine-tune language models on instruction-following datasets.
Visit Site →A component of the Unsloth library that provides optimized model loading and PEFT (Parameter-Efficient Fine-Tuning) capabilities for faster and more memory-efficient LLM training.
Visit Site →A framework for developing applications powered by large language models, providing tools for AI agent deployment, memory management, and chaining together multiple AI capabilities.
Visit Site →A platform that helps enterprises build, deploy, and manage multi-agent AI systems, enabling teams of AI agents to collaborate on complex tasks.
Visit Site →A company building specialized AI inference hardware (LPUs) designed to deliver ultra-fast large language model inference at scale.
Visit Site →Reddit's shoppable ad product that displays personalized product recommendations to users based on their interests, enabling e-commerce integration on the platform.
No URLGoogle's command-line interface tool that allows developers to interact with Gemini models directly from the terminal for coding and development workflows.
Visit Site →A feature within the Google Gemini App that enables extended reasoning and deeper analysis for complex queries and problem-solving tasks.
No URLAn AI company led by CEO Matt Schumer that develops AI-powered tools and has been vocal about the pace of AI disruption and its societal implications.
Visit Site →Anthropic's premium subscription tier priced at $200 per month that provides near-unlimited access to Claude models, including use of Claude Code, with higher usage thresholds compared to standard plans.
Visit Site →Anthropic's developer API that provides programmatic access to Claude language models on a per-token pricing basis, enabling integration into custom applications and agentic workflows.
Visit Site →AI coding agents capable of autonomously building entire software applications, pushing code to GitHub, and deploying to platforms like Vercel based on high-level instructions.
No URLGoogle's cloud computing platform offering infrastructure, AI/ML services, foundation models, GPU access, and cloud credits to startups and enterprises for building and scaling AI-powered applications.
Visit Site →A platform by Hugging Face for distributing custom hardware kernels, allowing users to load pre-compiled CUDA kernels from the Hub with a single call without manual builds or configuration flags.
Visit Site →An agentic AI platform that automates global manufacturing procurement by sitting on top of existing ERP systems, reading incoming communications, and automatically executing sourcing, negotiation, order tracking, and payment tasks.
Visit Site →Spotify's internal AI-powered development system that enables remote, real-time code deployment using generative AI. It integrates with tools like Claude Code and Slack to allow engineers to fix bugs and add features from their mobile devices.
Visit Site →An enterprise AI platform and workspace by Cohere that enables organizations to build secure, custom AI agents and workflows on top of Cohere's language models.
Visit Site →Figure AI's neural network system that powers autonomous humanoid robots, enabling them to perform complex tasks like kitchen work, package handling, and manufacturing through learned behaviors.
Visit Site →An AI-powered video segmentation tool that can precisely separate objects from backgrounds in videos, handling challenging scenarios like flying hair, smoke, and translucent materials with high accuracy.
Visit Site →A compact 1 billion parameter AI model designed for optical character recognition (OCR) tasks, offering high accuracy in text extraction despite its small size.
Visit Site →A family of open-source AI models by NVIDIA designed for weather forecasting, capable of predicting storms, temperature, wind, and precipitation up to 15 days in advance. It is 90% faster than traditional physics-based weather models.
Visit Site →An open-source OCR (Optical Character Recognition) AI model by Zhipu AI (ZAI) that can parse text, tables, formulas, handwriting, and receipts from images. At only 2.6 GB, it outperforms both open-source and closed-source alternatives in accuracy and speed.
Visit Site →ByteDance's creative AI platform accessible through CapCut that provides access to various AI generation tools including video generation models like Seedance 2.0.
Visit Site →A specialized pre-training dataset by NVIDIA used in the training pipeline for Nemotron language models to maintain agentic capabilities during continued pre-training.
Visit Site →A post-training recipe and toolkit by NVIDIA used for alignment and fine-tuning of Nemotron language models, providing established training recipes for stable and efficient model optimization.
No URLNVIDIA's open-source framework for building, customizing, and deploying large language models, supporting fine-tuning and training workflows for enterprise and research applications.
Visit Site →A data platform company focused on AI infrastructure that provides high-performance data management solutions for AI and machine learning workloads, including memory orchestration for data centers.
Visit Site →A cloud-based development platform that leverages AI to help users write, run, and deploy code directly from a browser, supporting collaborative and AI-assisted software development.
Visit Site →A collaborative AI tool developed by Anthropic that was itself largely built using Claude. It enables teams to work alongside AI for various development and productivity tasks.
No URLA secure AI privacy tool offered by ExpressVPN as part of its subscription plans, providing users with AI-powered assistance while maintaining privacy protections.
Visit Site →Amazon's voice-controlled AI assistant integrated into Echo devices, Fire TVs, and other smart home products, enabling voice commands, smart home control, and information retrieval.
Visit Site →Apple's tracking platform that leverages a crowdsourced network of Apple devices to help users locate lost items like AirTags, iPhones, and other Apple products.
Visit Site →Google's smart TV platform that powers various television sets including TCL models, providing content recommendations, streaming app integration, and voice assistant capabilities.
Visit Site →Amazon's smart TV platform integrated into various television sets including Insignia models, offering streaming services, voice control via Alexa, and smart home integration.
Visit Site →Dyson's companion app that allows users to customize and control Dyson smart home products, including adjusting lighting settings on the Dyson Solarcycle Morph lamp.
Visit Site →An AI-driven educational platform developed by Alpha School co-founder MacKenzie Price that delivers personalized K-12 academic instruction in core subjects using AI software, replacing traditional teacher-led classroom instruction.
Visit Site →An AI-powered private school that uses AI as the sole instructor, grader, and academic administrator for K-12 students, offering a two-hour daily core academic curriculum driven by AI software with personalized lesson plans.
Visit Site →A benchmarking and evaluation platform developed by Andon Labs for testing and comparing AI model behaviors, including simulations that reveal how multiple AI agents can converge on ideas during collaboration.
No URLAn open-source framework from Meta and Hugging Face for evaluating AI agents against real systems rather than simulations, using a gym-oriented API and MCP tool call interface.
Visit Site →A production-grade calendar management environment built by Turing for OpenEnv, serving as a benchmark for evaluating tool-using agents under realistic constraints such as access control, temporal reasoning, and multi-agent coordination.
Visit Site →Microsoft's unified AI portal inside Azure designed for enterprises to deploy apps and agentic systems.
Visit Site →An AI-powered chatbot feature in the Uber Eats app that helps customers fill grocery carts faster by accepting lists, images, and leveraging previous orders for personalized recommendations.
Visit Site →An enterprise AI platform that evolved from enterprise search into an 'AI work assistant,' connecting to internal systems, managing permissions, and delivering intelligence across organizations. Raised $150 million at a $7.2 billion valuation.
Visit Site →An AI search tool powered by OpenAI's ChatGPT launched by Instacart in 2023 to help customers save time and receive personalized shopping recommendations.
Visit Site →An AI-powered feature on Meta's Threads platform that lets users personalize their feed by posting public requests starting with 'Dear Algo' to temporarily adjust content for three days.
Visit Site →A startup specializing in AI inference infrastructure, in talks to raise at a $2.5 billion valuation, focused on optimizing inference efficiency to reduce compute costs and latency.
Visit Site →An inference-focused competitor to Modal Labs that announced funding at a $5 billion valuation, more than doubling its prior $2.1 billion valuation.
Visit Site →An inference cloud provider that secured $250 million at a $4 billion valuation in October.
Visit Site →A VC-backed startup formed from the open source vLLM inference project, raising $150 million in seed funding led by Andreessen Horowitz at an $800 million valuation.
Visit Site →A commercialized startup formed from the SGLang team, which secured seed funding led by Accel.
Visit Site →An AI cybersecurity tool founded by AI engineer Artem Sorokin, focused on addressing security challenges in AI systems.
Visit Site →An India-based AI and data analytics platform that sells enterprise AI software to large organizations across financial services, retail, and healthcare. It became India's first AI unicorn in 2022 and the country's first AI company to IPO.
Visit Site →A Swedish legal AI startup that emerged from the SSE Business Lab incubator, applying artificial intelligence to legal workflows and processes.
Visit Site →A Swedish AI startup that supports clinicians with AI-powered tools across multiple medical specialties, helping reduce administrative burden in healthcare.
Visit Site →An interactive visual environment by ServiceNow for synthetic data generation workflows, allowing users to compose flows on a canvas, preview datasets, tune prompts, and monitor executions in real time.
Visit Site →A tool that generates system architecture diagrams from plain English descriptions, allowing conversational refinement of diagrams for ML infrastructure documentation.
Visit Site →Hugging Face's evaluation framework for language models, now supporting inspect-ai as a backend.
Visit Site →A unified interface for launching training jobs across multiple cloud providers with Kubernetes, used by H Company for training Holo2 models at scale.
Visit Site →A San Francisco startup developing the financial layer that allows AI agents to securely purchase and access software, APIs, data, and compute, creating a payment system for autonomous AI agent transactions.
Visit Site →A no-code/low-code platform that turns ideas into live products, now integrated with Claude Opus 4.6 for vibe coding without needing a development environment.
Visit Site →A biotech company developing 'pharmaceutical superintelligence' — an AI platform that ingests biological, chemical, and clinical data to generate hypotheses about disease targets and candidate molecules for drug discovery.
Visit Site →GenEditBio's AI platform that analyzes data to identify how chemical structures correlate with specific tissue targets and predicts optimal delivery vehicle chemistry for gene-editing tools.
Visit Site →A biotech company using AI and machine learning to develop engineered protein delivery vehicles (ePDVs) for in vivo CRISPR gene editing, mining natural resources to find virus affinities to specific tissues.
Visit Site →Alphabet's autonomous driving company that received $16B in new funding, with Alphabet staying as majority owner ahead of an eventual IPO.
Visit Site →A benchmark by Mercor measuring AI agents' capabilities on professional tasks like law and corporate analysis, used to evaluate and compare major AI lab models.
Visit Site →Cloud data warehouse and AI platform that reached $5.4 billion revenue run-rate with 65% YoY growth, over $1.4 billion from AI products. Closed a $5 billion raise at $134 billion valuation.
Visit Site →A new WordPress integration enabling site owners to share back-end CMS data with Anthropic's Claude chatbot for site analytics, comment management, and plugin management queries.
Visit Site →Ring's AI-powered feature that leverages image recognition and a community camera network to help reunite lost pets with their owners.
Visit Site →InfiniMind's AI-powered platform that analyzes television content in real time, helping media and retail companies track product exposure, brand presence, customer sentiment, and PR impact.
Visit Site →Cloudflare's proposed approach to MCP (Model Context Protocol) that lets LLMs write and execute code to call tools as APIs rather than using traditional tool calling, claiming better performance due to LLMs' extensive code training data.
Visit Site →A cloud computing platform specializing in GPU infrastructure for AI and deep learning workloads, offering on-demand and reserved GPU instances.
Visit Site →Meta's newly formed AI research lab responsible for developing the Avocado model and pursuing advanced AI capabilities.
Visit Site →A node-based graphical interface for running generative AI models locally, used to run LTX-2 with reference workflows available at launch.
Visit Site →Anthropic's AI coding tool that operates in a terminal environment, enabling developers to build complex software projects autonomously. It was described as having a steep learning curve but extremely powerful capabilities.
Visit Site →A project originally proposed by Daniel Cocotallo (ex-OpenAI researcher) where 100 AI agents are given their own computers to pursue goals autonomously. Agents raised $2,000 for charity and ran a profitable e-commerce store.
Visit Site →A benchmark that tests whether AI models can see the question behind the question, used to evaluate GPT-5.1 where it scored slightly lower than GPT-5.
Visit Site →An AI-powered search engine that provides detailed, sourced answers to user queries, enabling deep research and information retrieval.
Visit Site →A web browser with built-in Perplexity AI search engine, used for deep dive research by typing queries directly into the URL bar instead of traditional web searches.
Visit Site →OpenAI's new coding IDE application that serves as a command center for parallel AI agents, allowing users to work on multiple coding projects simultaneously with skills and automation support.
Visit Site →Amazon's AI-powered voice assistant integrated into Echo smart speakers and other devices, designed for home automation, information retrieval, and conversational interaction.
No URLA U.S.-based specialized cloud infrastructure provider offering GPU-accelerated compute services purpose-built for AI, machine learning, and high-performance computing workloads.
Visit Site →ByteDance's video editing app for the Chinese market, which currently offers access to the Seedance 2.0 AI video generation model for Chinese users.
Visit Site →A startup backed by Google and Andreessen Horowitz that filed plans for an 80,000 satellite constellation for orbital data centers, having raised $34 million.
Visit Site →xAI's project spanning from simple computer use simulation to modeling entire corporations, described as being able to do anything on a computer that a computer can do.
No URLxAI's AI-powered encyclopedia intended to far exceed Wikipedia in comprehensiveness and accuracy, ultimately aiming to be an 'Encyclopedia Galactica' of all knowledge.
Visit Site →Domain purchased by Crypto.com founder Kris Marszalek for $70 million, planned to offer consumers a personal AI agent for messaging, app usage, and stock trading.
No URLOpenAI's enterprise platform designed to bridge the gap between AI models and corporate workflows by connecting company databases, communications, and tools, enabling AI agents to be onboarded like human employees with feedback loops.
Visit Site →A cloud platform for frontend developers that enables deployment and hosting of web applications, increasingly used alongside AI coding agents for automated software deployment.
Visit Site →Amazon's agentic AI coding assistant designed for developers, which can perform automated tasks in development environments. Users configure which actions Kiro can take, and by default it requests authorization before acting.
Visit Site →An identity verification app that uses AI-powered know-your-customer (KYC) processes to verify user identities across multiple countries.
Visit Site →An open-source AI coding agent that supports custom skill installations for domain-specific development tasks, compatible with the HuggingFace kernels skill system.
Visit Site →An AI-powered procurement platform that streamlines corporate purchasing workflows for businesses.
Visit Site →An AI-driven procurement orchestration platform that helps enterprises streamline and automate their corporate purchasing processes.
Visit Site →An AI-powered procurement platform that uses artificial intelligence to streamline and automate corporate purchasing workflows.
Visit Site →An open-source deep learning framework widely used for building and training machine learning models, supporting custom CUDA kernel integration and hardware-specific optimizations.
Visit Site →Hugging Face's open-source library for state-of-the-art diffusion models, supporting image and video generation with custom kernel integration patterns.
Visit Site →Tesla's AI-powered humanoid robot designed for general-purpose tasks, envisioned for use in various environments including potential extraplanetary applications.
Visit Site →A facial recognition feature developed by Meta for its Ray-Ban smart glasses that identifies people in the wearer's view and retrieves information about them through Meta's AI assistant.
Visit Site →An AI-powered feature on Meta's Threads social media platform that allows users to personalize their content feed by communicating preferences to the recommendation algorithm.
Visit Site →A market intelligence platform that provides app analytics, download estimates, and performance tracking data for mobile applications across the App Store and Google Play.
Visit Site →An open-source OCR toolkit by Baidu for text recognition and document parsing from images.
Visit Site →An open-source OCR model for parsing text and data from images, used as a benchmark comparison for document understanding tasks.
Visit Site →A CRM and marketing platform offering AI-powered tools for sales, marketing, and customer service automation.
Visit Site →An open-source computer vision library providing tools for image processing, video analysis, and machine learning, including stereo matching algorithms like SGBM.
Visit Site →A video editing platform by ByteDance that integrates AI-powered creative tools, including access to AI video generation through its Dreamina platform.
Visit Site →Google's photo storage and management service that includes AI-powered features such as facial recognition, image search, and automatic organization of photos.
Visit Site →Google's mapping and navigation platform that provides directions, business information, user reviews, and location-based services to billions of users worldwide.
Visit Site →AI-powered coding assistant developed by GitHub (Microsoft) that helps developers write code faster through AI suggestions.
Visit Site →A tool for running large language models locally on your own machine, supporting a variety of open-source models with a simple setup process.
Visit Site →Google Cloud's AI platform for building, training, and deploying machine learning models, offering access to Google's foundation models and MLOps tools.
Visit Site →AWS's fully managed service that provides access to foundation models from leading AI companies, allowing developers to build generative AI applications through a unified API.
Visit Site →A high-throughput and memory-efficient inference and serving engine for large language models, designed to maximize GPU utilization.
Visit Site →A comprehensive visual document retrieval benchmark for enterprise use cases, used to evaluate multimodal embedding models.
Visit Site →A challenging GUI grounding benchmark for evaluating UI element localization models on high-resolution interfaces.
Visit Site →A widely used benchmark for evaluating language model knowledge, noted as saturated above 91% accuracy.
Visit Site →A math reasoning benchmark for language models, noted as having reached 94%+ accuracy.
Visit Site →A code generation benchmark for language models, noted as being conquered by current models.
Visit Site →A benchmark for evaluating whether LLMs can understand and generate Filipino language content.
Visit Site →A Python data validation library used in SyGra Studio for powered mappings and structured output definitions.
Visit Site →A vibe-coding platform similar to Lovable that enables non-technical users to build applications from natural language prompts.
Visit Site →A cloud communications platform providing APIs for SMS, voice, and other communication services, referenced as an external tool AI agents would need to purchase access to.
Visit Site →A payment processing platform that provides APIs and tools for businesses to accept payments, manage subscriptions, and handle financial transactions online.
Visit Site →Amazon's 11-inch e-ink color tablet with a writeable display and AI features, starting at $629.99, designed for annotating e-books and documents.
Visit Site →A startup using generative AI to recreate lost footage from Orson Welles' classic film 'The Magnificent Ambersons,' combining live-action filming with digital AI recreations of original actors and their voices.
Visit Site →Enterprise resource planning software company pivoting to focus on AI as its next chapter, with co-founder Aneel Bhusri returning as CEO to lead the AI transformation.
Visit Site →Anthropic's interpretability method for tracing internal circuitry of transformer language models using replacement models called cross-layer transcoders to produce attribution graphs showing how features contribute to outputs.
Visit Site →A cloud-based customer relationship management (CRM) platform that helps businesses manage sales, service, marketing, and other operations.
Visit Site →A cloud-based platform for IT service management, workflow automation, and enterprise operations.
Visit Site →An enterprise resource planning (ERP) software suite used by businesses to manage operations, finance, supply chain, and other core business processes.
Visit Site →A company that builds vector databases and retrieval engines for data collection and information retrieval, whose researchers authored the context rot paper.
Visit Site →A protocol for enabling LLMs to call external tools and services. Discussed in the context of Cloudflare's alternative 'code mode' approach versus traditional tool calling.
Visit Site →A free and open-source 3D creation suite supporting modeling, sculpting, animation, rendering, and more.
Visit Site →A benchmark for evaluating AI coding performance, where Claude 4.5 Opus currently ranks number one above GPT-5.2.
Visit Site →AI startup led by Ilya Sutskever (ex-OpenAI) focused on safe superintelligence, raised money at a $32 billion valuation.
Visit Site →A web browser being developed by OpenAI as part of its expanding product suite.
No URLA benchmark that tracks AI agents' abilities to run a simulated vending machine store, including stocking proper items. Shows agents improving with each iteration.
Visit Site →A benchmark consisting of extremely difficult questions sourced from experts that frontier models previously couldn't answer. Gemini 3 Pro scored 37.5% without tools.
Visit Site →A framework for creating videos and animations programmatically using React, allowing developers to generate video content through code.
Visit Site →An electronic signature and agreement management platform that enables users to sign, send, and manage documents digitally.
Visit Site →An autonomous AI agent that can run locally or on a cloud VPS, capable of coding, managing Kanban boards, and completing tasks autonomously while the user is away.
Visit Site →An AI-powered code editor built for pair programming with AI, offering intelligent code suggestions, generation, and editing capabilities.
Visit Site →An AI-powered code editor and IDE designed to assist developers with intelligent code completion, generation, and editing workflows.
Visit Site →An AI-powered terminal application that modernizes the command-line experience with intelligent suggestions, collaboration features, and an enhanced interface.
Visit Site →Microsoft's free, open-source code editor supporting a wide range of programming languages, extensions, and development workflows.
Visit Site →A code hosting and version control platform that enables developers to collaborate on projects, manage repositories, and track changes.
Visit Site →A facial recognition feature for Ring home security cameras that identifies and categorizes people captured on camera footage, enabling users to receive alerts about recognized or unrecognized individuals.
No URLData analytics and AI platform mentioned alongside OpenAI in the context of tools used by ICE (Immigration and Customs Enforcement).
Visit Site →An AI agent platform (also known as Clawbot or Moldbot) that enables autonomous AI agents to learn skills and perform tasks, but has faced significant security breaches including sleeper agents, malware in skills, and 1.5 million leaked API keys.
Visit Site →An online community and repository for OpenClaw agent skills, similar to GitHub, where users can share and download skills for their AI agents. Some top-downloaded skills were found to contain malware.
Visit Site →A Google AI creative tool that has been discontinued and effectively replaced by Google Flow as the platform for AI-powered image and video generation.
Visit Site →An AI image generation model developed by OpenAI that creates images from text descriptions, one of the pioneering text-to-image generation systems.
No URLA platform for AI-generated image and video content that serves as a community hub for sharing and discovering AI-generated media and models.
Visit Site →An open-source AI image restoration model designed to fix real-world damaged images, including blurry, noisy, compressed, scratched, or black-and-white photos, by adding detail, sharpening, removing artifacts, and colorizing.
Visit Site →A free, browser-based image editing application that replicates the functionality of Adobe Photoshop, supporting PSD files and offering advanced photo editing tools without requiring installation.
Visit Site →An AI image editing model used for photo manipulation and restoration tasks.
Visit Site →An AI image generation model that produces images from text descriptions, noted for its ability to accurately follow detailed prompts.
No URLAn AI image generation tool or model capable of producing high-quality photorealistic images from text prompts, developed by Higgs Field AI.
Visit Site →OpenAI's image generation feature integrated directly into ChatGPT, allowing users to create and edit images through conversational prompts within the ChatGPT interface.
Visit Site →NVIDIA's fifth-generation Deep Learning Super Sampling technology that fuses traditional 3D graphics data with generative AI models to boost photorealism in video games while reducing compute requirements. It predicts and fills in parts of an image rather than rendering every element from scratch.
No URLAn AI image editor developed by Tencent that specializes in clothes swapping and style transfer, generating instant LoRA fine-tuned models from reference images and text prompts for seamless editing.
Visit Site →A composable framework by Hugging Face for building diffusion pipelines using reusable, mix-and-match blocks instead of writing entire pipelines from scratch. It integrates with the Diffusers library and supports custom block creation, lazy loading, and memory management.
Visit Site →An advanced image generation model by Black Forest Labs, available as a third-party model on platforms like Adobe Firefly for high-quality AI image creation.
Visit Site →An open-source text-to-image diffusion model by Photoroom that demonstrates training a competitive image generation model within a 24-hour, $1500 compute budget using a combination of architectural and training optimizations including pixel-space training, token routing, and representation alignment.
Visit Site →Google's advanced image generation model that combines image generation with reasoning capabilities, enabling high-quality infographics and visual explanations with accurate text rendering.
No URLAn image generation model noted for arguably state-of-the-art quality at a competitive price point, small enough to run on local devices.
No URLA 4-billion parameter diffusion model by Black Forest Labs for text-to-image generation, available on Hugging Face Hub.
No URLAn image generation model developed by ByteDance, capable of producing high-quality images and used within multimodal creative pipelines.
No URLGoogle DeepMind's image generation model that produces high-quality images with excellent text rendering, search grounding from real-time web data, and multiple style options. Available in the Gemini app, AI Studio, Flow, and Vertex.
Visit Site →Google's latest AI image generation model, officially named Gemini 3.1 Flash Image, that excels at creating and editing images with advanced world knowledge, accurate text rendering, and high character consistency across multiple prompts.
Visit Site →Google's premium image generation model offering the highest quality output. It is available on Gemini's paid Pro and Ultra subscription plans and serves as the higher-fidelity option compared to the standard Imagen 4 model.
No URLAdobe's generative AI platform that includes a video editor with features like Quick Cut, which uses AI to automatically edit footage and B-roll into a first draft based on natural language instructions, along with prompt-based editing and timeline-based video creation tools.
Visit Site →An AI image generation tool that creates detailed images from text prompts, widely used by artists, designers, and creative professionals.
Visit Site →OpenAI's AI image generation model that creates and edits images from natural language text prompts.
Visit Site →An open-source AI image generation model developed by Stability AI that creates detailed images from text descriptions, widely used in creative and commercial applications.
Visit Site →An early open-source AI image generation model that went viral for its ability to create images from text prompts, representing an earlier era of AI image generation.
Visit Site →A consumer face-swapping app backed by a16z that uses AI to let users swap faces in photos and videos.
Visit Site →A viral AI photo editing app that applies artistic filters to photos using neural network-based style transfer techniques.
Visit Site →An open-source AI image generation model released by Alibaba's Tong Yi Lab, offering high diversity in generations, support for negative prompts, and strong fine-tuning capabilities. It excels at recognizing existing people and characters.
Visit Site →A distilled, faster variant of Alibaba's Z-Image model optimized for quick image generation with high visual quality, particularly strong in realistic portrait photography.
Visit Site →An AI image generation model by Zhipu AI (ZAI) integrated into the GLM platform, used to generate images within the GLM-5 agent workflow.
Visit Site →An AI image generation model from Google, part of the Gemini family, used within the WordPress AI Assistant to create and edit images.
No URLGoogle's fourth-generation AI image generation model capable of creating high-quality images from text prompts, unveiled at Google I/O.
Visit Site →Google's newest image-generation model showcased in a Super Bowl ad where a mother and son used AI to envision and design their new home.
Visit Site →Pinterest's base AI model used for machine learning tasks on the platform, trained in part on users' public pins to power content recommendations and creative tools.
Visit Site →An Android app available on the Google Play Store that generates AI-powered video and art content from user-uploaded media files.
Visit Site →The raw foundational model from Alibaba's Tong Yi Lab that serves as the base for the Z-Image family, capable of both image generation and image editing tasks.
Visit Site →A Chinese image generation model that performs better than the original Nano Banana but is compared unfavorably to Nano Banana Pro.
Visit Site →An AI image and video generation tool launched by xAI as part of the Grok platform, enabling users to create images and videos from text prompts within the X platform.
Visit Site →An open-source AI image generation model that competes with other leading image generators, though it has limitations in generating recognizable existing people and anime characters.
Visit Site →Meta's first AI model from its Superintelligence Labs, designed for everyday personal use tasks including visual understanding, health, shopping, and social content. It powers the updated Meta AI assistant and features a 'Contemplating' reasoning mode that orchestrates multiple agents reasoning in parallel.
Visit Site →Amazon Web Services' managed platform that provides access to foundation models from multiple AI companies, including model-routing services that allow customers to automatically use different AI models for various tasks to optimize performance and cost.
No URLGoogle's latest open-source AI model licensed under Apache 2.0, capable of advanced reasoning, multi-step planning, audio/video processing, and coding assistance. It comes in four sizes (2B, 4B, 26B, 31B parameters) and can run locally on Android devices and laptop GPUs.
Visit Site →An openly available variant of H Company's Holo3 computer use agent, released under the Apache 2.0 license on Hugging Face with weights freely accessible and available through a free-tier inference API.
Visit Site →An early-fusion Transformer model by TII (Technology Innovation Institute) for open-vocabulary grounding and segmentation from natural language prompts, processing images and text in a shared parameter space to produce bounding boxes and high-resolution segmentation masks.
Visit Site →A compact vision-language model by IBM designed for enterprise document understanding, excelling at table extraction, chart understanding, and semantic key-value pair extraction from complex documents and structured visuals.
Visit Site →A computer use AI agent by H Company that achieves state-of-the-art performance on the OSWorld-Verified benchmark with 78.85% accuracy. It uses only 10B active parameters (122B total) and is designed to autonomously execute real-world desktop workflows within enterprise environments.
Visit Site →Apple's virtual assistant, which is being upgraded with a new AI-powered version leveraging Google Gemini to provide more intelligent and conversational responses on iOS devices.
No URLA previous-generation multimodal model by Google DeepMind that supports image, text, and audio inputs, serving as a predecessor to Gemma 4 with features like Per-Layer Embeddings.
Visit Site →H Company's previous-generation computer use model that achieved leading performance in UI localization tasks.
No URLA segment anything model used as a benchmark reference for image segmentation tasks, representing a leading approach in open-vocabulary visual perception.
Visit Site →A vision-language model by Alibaba's Qwen team that achieves strong performance on chart understanding benchmarks, used as a comparison point for document understanding tasks.
No URLAn AI lab founded by Brett Adcock that is developing multimodal end-to-end AI models paired with custom hardware to create a personal intelligence product with persistent memory that can listen, see, and interact with the world in real time.
Visit Site →Google's premium tier AI assistant powered by their most capable Gemini models, offering advanced reasoning and problem-solving capabilities through a paid subscription.
Visit Site →A multimodal computer-use agent model developed by H Company, post-trained from NVIDIA's Nemotron-Nano-2 VL model. It is optimized for high-throughput inference using a hybrid SSM architecture and excels at screen understanding, grounding, and UI-level interactions for agentic workloads.
Visit Site →A multimodal vision-language base model published by NVIDIA that uses a hybrid SSM and attention architecture. It served as the foundation for H Company's Holotron-12B through post-training on proprietary data.
Visit Site →A multimodal AI model announced by NVIDIA as a successor building on the Nemotron architecture line, designed for agentic intelligence applications.
Visit Site →A 2-billion-parameter Vision-Language Model (VLM) by NVIDIA that serves as a reasoning backbone for robotics and embodied AI applications, used as the VLM component in models like GR00T-H.
Visit Site →Google's AI assistant integrated into the Fitbit app as 'Coach,' providing personalized health and fitness guidance based on user data and medical history.
Visit Site →Google's AI assistant built directly into the Pixel 10 smartphone, helping users with everyday questions, task completion, and on-device AI-powered features.
Visit Site →Perplexity's advanced agentic AI platform that creates and executes entire multi-step workflows by breaking goals into tasks and subtasks, spinning up agents that can perform research, document generation, data processing, and interact with connected services across multiple AI models.
No URLThe enterprise version of Perplexity Computer that runs multi-step workflows across research, coding, and design use cases, with 400+ application integrations including Slack, using a usage-based pricing model.
No URLA multimodal computer-use agent model by H Company that serves as a policy model for agents that must perceive, decide, and act in interactive environments. It preceded Holotron-12B and achieved 5.1k tokens/s throughput.
Visit Site →Snap's augmented reality glasses that power AR experiences and art exhibitions, enabling users to view and interact with digital content overlaid on the real world.
Visit Site →An AI-powered assistant integrated into Adobe Photoshop that allows users to edit images through natural language prompts, performing tasks like object removal, color changes, lighting adjustments, and background transformations.
Visit Site →Zoom's photorealistic AI-powered avatars that can mimic a user's appearance, expressions, and lip and eye movements during online meetings and asynchronous video messaging when the user is not camera-ready.
Visit Site →A multimodal vision-language model by Alibaba's Qwen team that combines strong visual understanding with language capabilities, used as a backbone for various downstream applications including robotics.
Visit Site →A consumer-facing version of Perplexity's agentic Computer platform designed for individual users to automate personal workflows and tasks.
No URLA compact Vision-Language-Action model designed for fine-tuning robotic control policies, enabling robots to generate actions based on visual and language inputs for deployment on embedded platforms.
Visit Site →An agentic AI platform by Luma that handles end-to-end creative work across text, image, video, and audio, designed for ad agencies, marketing teams, and enterprises to automate and accelerate creative workflows.
No URLGoogle's AI-powered voice assistant integration for Google Home and Nest smart home devices, replacing Google Assistant with Gemini's capabilities for voice commands, smart device control, and contextual home queries.
Visit Site →Google's AI-powered real-time visual search feature that allows smartphone users to point their phone camera at objects and request real-time information about what they see.
Visit Site →An AI-powered search feature within Google Photos that uses Gemini to let users search their photo libraries using natural language queries, including complex and conversational requests.
Visit Site →Samsung's suite of on-device and cloud-based AI features integrated into Galaxy smartphones, including photo editing tools, notification summaries, and generative image capabilities.
Visit Site →A 2-billion parameter vision-language model by NVIDIA designed for physical AI reasoning, capable of interpreting visual scenes and providing natural language understanding. It supports FP8 quantization and can be deployed on edge devices like NVIDIA Jetson.
Visit Site →Google's AI chatbot that has been the subject of controversy, including a wrongful death lawsuit related to its interactions with users.
No URLA multimodal AI model by Alibaba with 397 billion parameters (17 billion active) featuring a million-token context window, strong reasoning, coding, agentic abilities, and multimodal understanding of text, images, and video.
No URLGoogle's visual search tool that uses AI to identify objects, text, and other elements in images, integrated into Google Search for visual queries.
Visit Site →Samsung's built-in AI virtual assistant that provides voice-controlled assistance, smart device integration, and hands-free features across Samsung Galaxy devices.
Visit Site →Google's AI-powered virtual assistant that enables voice control and smart home integration across a wide range of devices and platforms.
Visit Site →YouTube's AI assistant feature that allows viewers to ask questions about video content they're watching and receive instant answers, now expanded from mobile and web to smart TVs, gaming consoles, and streaming devices.
Visit Site →Google's family of multimodal AI models integrated across its products and services, including Android devices, Search, and smart glasses. It is widely regarded as offering strong generative AI photo editing capabilities.
Visit Site →A multimodal AI model by Google featuring agentic vision capabilities that allow it to proactively zoom into, annotate, and analyze images using generated Python code.
Visit Site →Google's extended reality platform designed to power smart glasses and mixed reality devices, integrating AI capabilities for real-world interaction and information overlay.
Visit Site →Google's rebranded version of Project Starline, a telepresence platform featuring real-time translation capabilities integrated with Google Meet.
No URLH Company's largest UI localization model achieving state-of-the-art 78.5% on ScreenSpot-Pro and 79.0% on OSWorld G benchmarks, with agentic localization capabilities for iterative refinement.
Visit Site →Google's multimodal AI model family and chatbot, capable of processing and generating text, images, code, and other media types.
Visit Site →Amazon's enhanced AI assistant with improved intelligence and capabilities including smart home management and vacation planning, launched to all U.S. users.
Visit Site →Meta's wearable AI glasses featured in Super Bowl ads, with new Oakley-branded AI glasses designed for sports and adventures with capabilities like slow-motion filming and hands-free social media posting.
Visit Site →A Tokyo-based startup founded by ex-Googlers that develops infrastructure to convert petabytes of unviewed video and audio into structured, queryable business data using vision-language models.
Visit Site →InfiniMind's flagship long-form video intelligence platform capable of processing 200 hours of footage to pinpoint specific scenes, speakers, or events, with beta release scheduled for March 2026.
Visit Site →Google's new agentic vision capability for the Gemini 3 Flash model that enables advanced image analysis, annotation, decomposition, and code-based manipulation of images.
Visit Site →An open-source multimodal AI model by Moonshot AI that can understand text, images, and documents. It offers multiple modes including instant responses, extended thinking for complex reasoning, and an autonomous agent mode for creating websites, slides, and reports.
Visit Site →Meta's built-in AI assistant integrated into Ray-Ban Meta smart glasses, capable of processing visual information and providing contextual responses including identifying people and retrieving information about them.
Visit Site →An AI platform by Step AI offering multimodal capabilities including text, voice, image, and video interaction.
Visit Site →Apple's AI platform first announced in 2024, powering the new Siri and other AI features across Apple devices.
Visit Site →A company providing general-purpose video understanding APIs for a broad range of users including consumers, prosumers, and enterprises.
Visit Site →A Google AI-powered research and note-taking tool that can generate AI-hosted podcast-style audio overviews from user-provided content, among other features.
Visit Site →Apple's digital assistant being revamped with AI-powered LLM capabilities as part of Apple Intelligence, aiming to function more like modern AI chatbots.
Visit Site →Google's latest music generation model that allows users to create tracks up to three minutes long with improved creative control, customization, and understanding of track structure including intros, verses, choruses, and bridges.
Visit Site →A free and open-source AI model for music production that generates musically coherent loops following specified tempo (BPM), key, bar count, instruments, timbre, effects, and notation. It runs locally with low VRAM requirements and produces separate stems that can be mixed and mastered in a DAW.
Visit Site →An AI music generation model by OpenAI capable of creating music with vocals in various genres and artist styles, generating raw audio including singing.
Visit Site →A digital AI persona that produces AI-generated music, notably achieving placement on the Billboard R&B charts with the song 'How Was I Supposed to Know?'.
No URLApple's professional music production and audio editing software that includes AI-assisted features for music creation, mixing, and sound design.
Visit Site →A generative AI music tool backed by The Chainsmokers that allows users to create music through natural language requests, now part of Google Labs. It uses Google DeepMind's Lyria model to turn text and image inputs into audio outputs.
Visit Site →Google's experimental AI music tool that allows musicians to explore and add new sounds and instruments to their tracks, used by artists like Wyclef Jean for creative experimentation.
Visit Site →An AI music-generation platform that creates synthetic music from text prompts, capable of producing tracks realistic enough to chart on Spotify and Billboard.
Visit Site →Google DeepMind's third-generation music-generation model that creates realistic and complex music tracks with lyrics, supporting control over style, vocals, and tempo. It includes SynthID watermarking for AI-generated content identification.
Visit Site →A YouTube feature powered by Google's Lyria model that enables creators to generate AI-made music tracks for use in their videos, now expanding from U.S.-only to global availability.
Visit Site →An open-source AI voice cloning tool that can generate singing voices from just a few seconds of a reference voice, allowing users to make any voice sing any song with custom melodies and lyrics. It is lightweight (under 3GB) and can run on low-end GPUs or CPUs.
No URLAn open-source AI music generator that produces studio-grade quality songs across multiple genres and languages. It supports low VRAM and CPU-only operation, generates full songs in seconds, and includes an in-paint feature for micro-editing existing songs and creating cover songs.
Visit Site →A daily podcast and video show covering the most important news and discussions in artificial intelligence, featuring quarterly state-of-AI reports and practical AI guides.
Visit Site →A well-known AI news outlet that covers developments in artificial intelligence, including model releases, industry news, and technical breakthroughs.
Visit Site →A YouTube channel and media outlet hosted by Dr. Károly Zsolnai-Fehér that covers research papers, here discussing a fluid simulation breakthrough.
Visit Site →Reputable AI news source that viewed internal Meta memos about the Avocado model and reported on various AI industry developments.
Visit Site →Amazon Web Services' custom-designed ARM-based server CPUs optimized for cloud workloads, offering improved performance and energy efficiency for general-purpose computing tasks.
Visit Site →A benchmark that evaluates LLM reasoning through pencil puzzles involving constraint satisfaction problems closely related to NP-complete problems, with deterministic step-level verification that cannot be gamed through memorization.
Visit Site →A hardware and software product by Astropad that turns an iPad into a wireless second display for Mac or PC, powered by the company's proprietary LIQUID display protocol.
Visit Site →A software product by Astropad that turns an iPad into a professional drawing tablet for Mac, enabling creative professionals to use iPad with desktop creative applications.
Visit Site →Google Cloud's tensor processing units (TPUs) are custom-designed AI accelerator chips optimized for training and running large-scale machine learning models.
No URLA South Korean fabless AI chip startup that designs chips optimized for AI inference workloads, offering products like RebelRack and RebelPOD as scalable AI infrastructure platforms.
Visit Site →An AI infrastructure platform by Rebellions that integrates multiple racks into a scalable cluster designed for large-scale AI inference deployment.
Visit Site →A production-ready AI inference compute unit by Rebellions, designed as a self-contained platform for deploying AI inference workloads at scale.
Visit Site →OpenAI's venture fund that invests in early-stage AI startups, one of the first major AI company-backed investment programs for emerging companies.
Visit Site →A premium subscription service by Meta for Instagram that offers exclusive Story features including anonymous Story viewing, 48-hour Stories, multiple Story audiences, and the ability to see who rewatched a Story.
No URLA benchmark used to evaluate AI model performance on terminal and command-line related tasks.
Visit Site →Arm Holdings' first in-house production-ready chip designed for running AI inference workloads in data centers, built using Arm's Neoverse family of CPU IP cores in partnership with Meta.
No URLTesla's next-generation AI chip designed for use in its vehicles and robots, expected to reach volume production in 2027.
Visit Site →Amazon's custom AI training chip designed for machine learning workloads, used by major AI companies including Anthropic, OpenAI, and Apple for training and running AI models.
No URLA robotics company that develops humanoid robots capable of walking, speaking, and performing tasks. Its robots are designed for various applications including education and physical labor.
Visit Site →A knowledge management and notetaking application that stores notes as local Markdown files, popular among power users for its extensibility and privacy-first approach.
Visit Site →A compact humanoid robot made by Chinese robotics company Unitree, standing about 4 feet 2 inches tall with 29 degrees of freedom, used as a platform for research in athletic robotics including table tennis and tennis.
Visit Site →A child-oriented version of YouTube by Google, designed to provide a safer viewing experience for children with curated content and parental controls.
Visit Site →A social media platform owned by Meta for sharing photos and videos, cited in the context of algorithmic concerns related to youth mental health.
Visit Site →A community-driven open dataset initiative for training and evaluating AI autonomy and world foundation models for surgical robotics and ultrasound. It spans simulation, benchtop exercises, and real clinical procedures across multiple robot embodiments, released under a CC-BY-4.0 license.
No URLNvidia's GPU chip architecture designed for AI workloads, serving as a high-performance platform for AI training and inference tasks.
No URLA biological foundation model trained on 9 trillion DNA base pairs from the OpenGenome2 dataset, capable of understanding, analyzing, and generating DNA sequences across all domains of life. It features a million-token context window and was published in Nature.
Visit Site →Nvidia's next-generation GPU chip architecture succeeding Blackwell, designed for AI computing with claimed 3.5x faster model training and 5x faster inference performance compared to Blackwell, reaching up to 50 petaflops.
Visit Site →Apple's digital wallet and contactless payment service that allows users to make payments using their iPhone, Apple Watch, or other Apple devices. It integrates with various online retailers including Amazon.
Visit Site →A New York City-headquartered online prediction market platform where users can place bets on real-world events, ranging from politics to pop culture, valued at approximately $9 billion.
Visit Site →A regulated New York City-based online prediction market platform that allows users to bet on real-world events, featuring tracking and investigation rules to prevent insider trading, valued at approximately $11 billion.
Visit Site →A massive genomic dataset containing DNA sequences from millions of diverse organisms across the entire spectrum of life, totaling approximately 9 trillion DNA base pairs, used to train the EVO2 biological foundation model.
No URLxAI's massive AI supercomputer cluster used for training Grok models, now under the SpaceX umbrella following the merger of xAI's infrastructure with SpaceX.
No URLA collaborative initiative co-founded by Upstream Tech CEO Marshall Moutenot that curates machine learning-ready weather data collections for researchers and startups working on deep learning-based weather forecasting.
No URLA virtual private network service by Norton that provides IP masking, ad blocking, kill switch functionality, IP rotation, and double VPN features to protect online privacy across up to five devices.
Visit Site →A non-profit digital library that provides free access to archived web pages, media, and other digital content, including tools for downloading and preserving online materials.
Visit Site →NVIDIA's fully synthetic persona datasets grounded in real-world demographic distributions, producing culturally authentic and diverse individuals across regions and languages to support Sovereign AI development.
Visit Site →A synthetic retrieval dataset by NVIDIA containing 110,000 triplets of query, passage, and answer generated from 15,000 files of NVIDIA public documentation, designed to train and evaluate embedding and RAG systems.
Visit Site →A privacy-focused email service offering end-to-end encrypted messaging, tracker blocking, built-in VPN access, calendar, and cloud storage. It is designed to protect user communications from surveillance, ads, and data harvesting.
Visit Site →A global competition by XPRIZE Foundation, Google, and Ranged Media Partners offering $3.5M+ in prizes and film financing for short films depicting hopeful visions of the future. Participants submit trailers or short films of three minutes or less.
Visit Site →The Data Agent Benchmark for Multi-step Reasoning, a benchmark comprising 450 tasks focused on the Financial Payments Sector that evaluates AI agents on complex multi-step reasoning, tool-augmented data analysis, and tabular data question answering.
No URLCortical Labs' original proof-of-concept biological computing system from 2021-2022 that demonstrated clusters of human and mouse neurons could learn to play the arcade game Pong.
Visit Site →A community platform (described as a 'Reddit-like' site) created by Peter Steinberger that was acquired by Meta.
No URLApple's audio streaming service that offers music, podcasts, and curated playlists. It is introducing 'Transparency Tags' to disclose whether tracks or related content have been generated using artificial intelligence.
Visit Site →Amazon's cloud gaming service that lets users stream and play games across compatible devices including monitors, TVs, and computers.
Visit Site →A neuromechanical simulation framework that connects digital brain models to virtual insect bodies, enabling researchers to simulate how neural circuits control physical movement in flies.
Visit Site →A physics engine originally developed by Emo Todorov and now maintained by Google DeepMind, widely used for simulating robotics and virtual body dynamics in research environments.
Visit Site →A decentralized social media platform that originated as a project within Twitter and spun off into a standalone organization. It offers an alternative social networking experience built on an open protocol, and has grown to over 40 million users.
Visit Site →An AI evaluation benchmark designed to test general AI assistants on real-world tasks requiring reasoning, multi-step problem solving, and tool use.
Visit Site →AMD's high-end mobile processor with integrated AI acceleration capabilities and integrated Radeon 8060S graphics, designed for creator and AI workloads in laptops.
Visit Site →A leading audio streaming platform that offers music, podcasts, and personalized recommendations. It has implemented AI disclosure metadata tags in partnership with the Digital Data Exchange (DDEX) to indicate where AI played a role in track creation.
Visit Site →A challenging mathematics benchmark consisting of research-level problems created by professional mathematicians, designed to test novel mathematical reasoning beyond textbook-level questions. Top models initially scored around 2% when it launched.
Visit Site →A productivity software suite by Microsoft that includes Word, Excel, PowerPoint, Outlook, OneNote, and Teams, available as a one-time lifetime license for Mac users without subscription fees.
Visit Site →An AI-generated virtual actor created by production company Particle6, designed to act as a digital performer in entertainment media including music videos.
No URLBlackmagic Design's professional video editing, color correction, and post-production software used by creators and filmmakers.
Visit Site →GoPro's media playback application for viewing and managing GoPro footage, including 360-degree video content.
Visit Site →GoPro's premium subscription service that includes unlimited cloud storage for GoPro footage and additional features for content creators.
Visit Site →AMD's AI-capable mobile processor designed for high-performance laptops with integrated neural processing capabilities.
Visit Site →AMD's latest series of GPU accelerators designed for AI inference and training workloads, part of AMD's Instinct lineup competing with Nvidia in the data center AI chip market.
Visit Site →A Google world model project that aims to build AI systems with deep understanding of real-world knowledge and environments.
No URLAn AI system developed by Google DeepMind that predicts a protein's 3D structure from its amino acid sequence, representing a major breakthrough in computational biology and drug discovery.
Visit Site →A benchmark for evaluating AI coding agents on real-world software engineering tasks drawn from GitHub issues, measuring an agent's ability to autonomously resolve bugs and implement features.
No URLA source that produces charts and analysis tracking AI company metrics including revenue comparisons between major AI labs.
No URLAn Indian government initiative to build shared AI compute infrastructure, currently operating 38,000 GPUs with plans to expand by an additional 20,000 units, aimed at broadening access to AI resources across the country.
Visit Site →An ad-blocking and privacy protection tool that removes pop-ups, autoplay ads, and online trackers across mobile and desktop devices. It also provides protection against malware, phishing sites, and includes parental control features.
Visit Site →A cloud storage service offering lifetime subscription plans with end-to-end encryption, duplicate file detection, and integration with services like Dropbox, Google Drive, and OneDrive.
Visit Site →A benchmark that evaluates how well AI models can operate a computer and perform tasks on behalf of users, measuring agentic computer-use capabilities.
Visit Site →A benchmark released by OpenAI in April 2025 that tests AI agents' abilities to navigate the web, find entangled facts, and discover hard-to-find information through persistent internet research.
Visit Site →A productivity benchmark released in January 2026 that evaluates AI agents' ability to perform real office tasks across tools like docs, spreadsheets, emails, and messaging to produce client-ready output.
Visit Site →NVIDIA's cloud gaming service that allows users to stream and play PC games on various devices without needing high-end local hardware.
Visit Site →Microsoft's cloud gaming service that enables users to stream and play Xbox games on phones, tablets, browsers, and other devices without a console.
Visit Site →An annual research publication by ARK Invest that provides five-year forward-looking projections on disruptive technologies including AI, robotics, blockchain, and genomics, using Wright's Law frameworks.
Visit Site →A live TV streaming subscription service by YouTube (Google) that offers access to over 100 TV channels with features like unlimited DVR and family group sharing. It is introducing new, more affordable bundled subscription plans for sports, news, and entertainment.
Visit Site →A dating app that uses verified credit scores as a matching criterion, partnering with Equifax for credit and identity verification to assess user reliability.
Visit Site →A free streaming service accessible through public library cards that offers films, documentaries, and other video content as an alternative to paid streaming platforms.
Visit Site →Meta's mixed reality headset that offers virtual and augmented reality experiences, including gaming and immersive applications, at a more accessible price point than the Quest 3.
Visit Site →World's most valuable chip company discussed in context of the $100 billion investment deal with OpenAI being on ice, with CEO Jensen Huang expressing concerns about OpenAI's business discipline.
Visit Site →Cerebras Systems' flagship AI chip measuring 8.5 inches per side with 4 trillion transistors and 900,000 specialized cores, claiming 20x faster AI inference than competing GPU systems.
Visit Site →A physics simulation research technique/paper for real-time simulation of deformable objects, referenced as a previous blockbuster research paper that the new Cosserat rod-based method improves upon.
Visit Site →A training technique used by DeepSeek that replaces the expensive PPO teacher model approach by having the AI generate multiple answers and grading them against each other, making training much cheaper and scalable.
Visit Site →A quadruped robot by Mirami Technology designed to solve high-speed locomotion physics, serving as the research foundation for the Bolt humanoid robot.
Visit Site →A Google research paper exploring the feasibility of placing AI data centers in space using solar-powered satellites, addressing challenges like radiation resilience of TPUs and inter-satellite laser communication.
Visit Site →AR smart glasses that convert any 2D content to 3D in real-time using a custom X1 chip, featuring 1200-pixel micro OLED display at 120Hz, priced at $450.
Visit Site →The world's first 8K 360-degree drone that won Best of Innovation at CES 2026, weighing 249 grams and priced starting at $1,600.
Visit Site →A JavaScript 3D library used for creating and displaying 3D graphics in web browsers. It can be used in conjunction with AI-generated geometry to build interactive 3D models and export STL files for 3D printing.
Visit Site →Cryptocurrency platform that purchased the AI.com domain for $70 million to launch a personal AI agent service debuting during the Super Bowl.
Visit Site →A two-legged humanoid robot by Mirami Technology (Shanghai Robotics startup) that broke the world record for fastest humanoid robot at 10 meters per second (22.4 mph).
Visit Site →SpaceX's next-generation reusable rocket expected to dramatically reduce launch costs to orbit, critical for making space data centers economically viable.
Visit Site →SpaceX's satellite internet constellation, referenced in the context of energy delivery costs being compared to terrestrial data centers.
Visit Site →Amazon Web Services is a comprehensive cloud computing platform offering a wide range of infrastructure and application services including compute, storage, databases, and AI/ML tools.
Visit Site →Elon Musk's space company that merged with xAI, with a potential IPO on the horizon.
Visit Site →A line of e-ink tablets designed for reading, writing, and note-taking, offering a paper-like digital experience.
Visit Site →A benchmark that measures AI emotional intelligence through challenging multi-turn role plays, scoring empathy, social dexterity, and psychological insight. Claude Opus 4.6 ranks number one.
Visit Site →Google's custom-designed Tensor Processing Units, application-specific integrated circuits built to accelerate machine learning workloads.
Visit Site →Google Proof Q&A Diamond benchmark testing scientific knowledge in STEM subjects, where Gemini 3 Pro set a record at ~92%.
Visit Site →A benchmark created by François Chollet to test fluid intelligence and true reasoning without memorization.
Visit Site →A benchmark measuring well-specified knowledge work tasks across 44 occupations, on which GPT 5.2 claims to be the first model at or above human expert level.
Visit Site →A video understanding benchmark where Gemini 3 Pro achieved record performance.
Visit Site →A benchmark measuring the ability of AI models to perform tasks in the terminal, particularly relevant for coders.
Visit Site →American Invitational Mathematics Examination, used as a benchmark to evaluate how good AI models are at mathematics.
Visit Site →A boycott campaign website organizing users to delete ChatGPT and cancel subscriptions due to OpenAI's political donations and ICE collaboration.
Visit Site →A Reddit-like platform for AI agents where autonomous bots can post, comment, and have discussions with each other, gaining over 1.6 million agents and 15,000+ sub-communities.
Visit Site →A 400B-parameter open-weight reasoning model built by Arcee AI, a 26-person U.S. startup, on a $20 million budget. Released under the Apache 2.0 license, it aims to be the most capable open-weight model from a non-Chinese company, offering both on-premises and cloud-hosted API access.
Visit Site →Google's family of lightweight, open AI models designed to run locally on devices. Gemma-based speech recognition models power on-device transcription in Google's dictation applications.
No URLA bilateral reference network model for high-resolution image segmentation, available on Hugging Face as ZhengPeng7/BiRefNet, commonly used for background removal tasks.
Visit Site →Anthropic's newest frontier AI model, described as more powerful than its previous Opus models, featuring strong agentic coding and reasoning capabilities. It is being previewed for cybersecurity applications through Project Glasswing to identify zero-day vulnerabilities in software.
No URLAn unreleased frontier AI model by Anthropic (codenamed Capybara) that demonstrates unprecedented software engineering and cybersecurity capabilities, including the ability to find and exploit software vulnerabilities at a level surpassing most skilled humans. It achieved 93.9% on SWE-bench Verified and was the first model to solve a private cyber range end-to-end.
No URLThe most powerful dense model in Google's Gemma 4 family, featuring 31 billion parameters and optimized for output quality. It supports a 250K token context window and is designed to run locally on consumer hardware for reasoning and coding tasks.
No URLA 26 billion parameter mixture-of-experts model in Google's Gemma 4 family with only 3.8 billion active parameters at inference time, designed for exceptionally fast local inference while maintaining frontier-level intelligence.
No URLA model from Anthropic in the Claude Sonnet family, released as part of the series of frontier model launches in Q2 2026.
No URLAn open-source large language model developed by Zhipu AI (ZAI), offering general-purpose language understanding and generation capabilities.
No URLAn open-source language model from the Qwen series, optimized for local inference and capable of running on machines with 32GB of RAM, available in GGUF format via Unsloth.
Visit Site →A leaked upcoming AI model from Anthropic positioned as their most powerful model tier, reportedly achieving dramatically higher scores than Claude Opus 4.6 on software coding, academic reasoning, and cybersecurity benchmarks.
No URLGoogle DeepMind's foundation models designed for robotics applications, enabling robots to operate autonomously in industrial use cases across sectors like electronics manufacturing, automotive, data centers, and logistics.
Visit Site →An open-weight reasoning model developed by OpenAI designed to help developers implement safety conditions and classify safe and unsafe content by ingesting platform safety policies directly and inferring their intent.
No URLA Vision-Language-Action (VLA) model by NVIDIA for surgical robotics, derived from the Isaac GR00T N series and trained on approximately 600 hours of healthcare robotics data. It is the first policy model capable of executing end-to-end surgical tasks such as suturing.
Visit Site →NVIDIA's series of open-source Vision-Language-Action (VLA) foundation models designed for general-purpose humanoid and robotic control, serving as the base architecture for domain-specific derivatives like GR00T-H.
Visit Site →NVIDIA's complex reasoning AI model described as an omni-understanding model, designed for advanced inference and multi-modal comprehension tasks.
Visit Site →NVIDIA's open-weight 120 billion parameter AI model designed to run fully on-device without requiring cloud connectivity. It performs near state-of-the-art levels, competitive with top models from Anthropic and OpenAI.
Visit Site →An NVIDIA large language model used for generating synthetic question-answer pairs from domain documents, powering automated training data creation pipelines.
No URLAn IBM Granite base language model that serves as the foundation for the Granite Libraries' specialized LoRA adapters targeting tasks like validation, RAG, and safety compliance.
Visit Site →An AI research lab founded by former OpenAI CTO Mira Murati focused on frontier model training and building customizable AI platforms, which has signed a strategic partnership with Nvidia for large-scale compute using next-generation Vera Rubin chips.
Visit Site →A compact open-weight AI model from Mistral AI, designed for efficient deployment in enterprise and resource-constrained environments while maintaining strong performance across a range of tasks.
Visit Site →OpenAI's initial reasoning model that introduced chain-of-thought reasoning capabilities, serving as a precursor to more advanced reasoning models like o3.
Visit Site →A family of large language models by NVIDIA, including the Nemotron-3-Super-120B-A12B variant, designed for multi-step agentic reasoning, tool use, and citation-grounded reporting. It can be fine-tuned for research synthesis and long-horizon tool calling.
Visit Site →NVIDIA's world foundation model platform designed to generate physics-aware video and synthetic data for training AI systems in robotics and autonomous vehicles.
No URLNVIDIA's reasoning vision-language-action model designed for robotics applications, trained on large-scale multimodal data including robotics trajectories and grasps across multiple gripper types and sensor configurations.
Visit Site →A robotics world model developed by Runway, built using NVIDIA's open GR00T dataset to simulate and understand physical environments for robotics applications.
Visit Site →An open reasoning-based self-driving AI system developed by NVIDIA that explains its driving decisions in natural language while navigating. It uses Reinforcement Learning with Consistency Reward and Conditional Flow Matching Loss to ensure the model's stated reasoning aligns with its actions, with released model weights and inference code.
No URLA French AI research lab cofounded by Yann LeCun focused on building world models using the Joint Embedding Predictive Architecture (JEPA), aiming to create AI that learns from reality rather than just language.
Visit Site →An upcoming next-generation AI model from OpenAI expected to build on GPT-5's capabilities, potentially integrating native multimodal processing and advanced hardware optimizations.
No URLThe reasoning-optimized variant of OpenAI's GPT-5.4 model, featuring chain-of-thought capabilities with improved transparency and reduced deception in its reasoning process.
No URLAn API product by Thinking Machines Lab, the AI research company founded by former OpenAI co-founder Mira Murati, focused on building AI models that create reproducible results.
No URLA European AI startup developing world models for spatial understanding and intelligence, having raised a $13 million seed round.
Visit Site →The premium, highest-capability variant of OpenAI's GPT-5.4 model, featuring enhanced reasoning abilities and setting new records on benchmarks like FrontierMath. It is priced at $30 per million input tokens and $180 per million output tokens.
No URLA self-supervised vision transformer model used as a teacher for representation alignment in diffusion model training, providing strong semantic feature representations for perceptual loss computation.
Visit Site →Alibaba's family of open-weight AI models that has become one of China's most prominent open-source AI efforts, with benchmark results often rivaling systems from leading U.S. developers like OpenAI and Google.
No URLMeta's upcoming frontier large language model, codenamed Avocado, intended to compete with leading AI models from Google and OpenAI. It has been delayed due to shortfalls in reasoning, coding, and writing benchmarks.
No URLGoogle's large language model in the Gemini series, a frontier AI model competing with other state-of-the-art models from OpenAI and Anthropic.
No URLA large language model in the GLM (General Language Model) series, used as a comparison point against other state-of-the-art AI models for coding and creative tasks.
No URLA large-scale Mixture of Experts language model by DeepSeek with 21B total parameters and 256 experts, using approximately 3.6B active parameters per token for efficient inference.
Visit Site →An open Mixture of Experts language model designed to demonstrate the training efficiency advantages of sparse MoE architectures compared to dense models.
Visit Site →A family of open-weight language models by Meta, including smaller parameter sizes optimized for on-device and edge computing applications.
No URLApple's built-in on-device language model integrated into iPhones, providing local AI inference capabilities as part of Apple Intelligence.
Visit Site →An open source 8-billion-parameter large language model developed by Guide Labs, built with a novel interpretable architecture that allows every generated token to be traced back to its origins in the training data.
Visit Site →Google's frontier AI model tuned for raw intelligence, designed for tasks like code generation where quality matters more than speed.
No URLGoogle's latest and most capable AI model in the Gemini family, representing their best-performing large language model release.
No URLA 1.2-billion parameter instruction-tuned small language model by LiquidAI, optimized for on-device deployment and capable of running under 1GB of memory on CPUs, phones, and laptops.
Visit Site →A 17 billion parameter multilingual AI model released by BharatGen, a government-backed Indian AI consortium, that works across 22 languages.
No URLA newer iteration of Anthropic's Claude large language model series, offering improved capabilities over previous versions for tasks including coding and agentic workflows.
No URLA family of open-source large language models by Indian AI startup Sarvam, including 30B and 105B parameter mixture-of-experts models trained from scratch on multilingual data with a focus on Indian languages, designed for real-time conversational and enterprise applications.
Visit Site →An open-source reasoning model developed by Chinese AI lab DeepSeek that nearly matched American frontier labs in performance at a fraction of the cost.
No URLA state-of-the-art small language model by NVIDIA with under 10 billion parameters, optimized for advanced Japanese language understanding and agentic AI capabilities including tool calling, code generation, and mathematical reasoning. It achieved the top rank among sub-10B models on the Nejumi Leaderboard 4.
Visit Site →A French AI company that develops large language models and is expanding into full-stack AI cloud infrastructure through offerings like Mistral Compute. Valued at $13.8 billion, it recently acquired Koyeb to accelerate its cloud ambitions.
Visit Site →An AI company that builds AI models to power robots, raising a $1.4 billion Series C round at a $14 billion valuation led by SoftBank and Nvidia.
Visit Site →A 70-billion-parameter open-source large language model developed by Meta, part of the Llama 3.1 family, widely used as a foundation for fine-tuned and specialized AI models.
Visit Site →A 28-billion parameter instruction-tuned language model developed by NTT, used as a base model for fine-tuning on domain-specific Japanese tasks.
Visit Site →A large language model developed by Moonshot AI, designed for complex reasoning and multi-step tasks, notable for its mixture-of-experts architecture.
Visit Site →A large open-source language model with 120 billion parameters, part of OpenAI's open-source model releases, designed for a wide range of generative AI tasks.
No URLAn open-source large language model by Google with 27 billion parameters, part of the Gemma family of lightweight models designed for efficient deployment and broad accessibility.
Visit Site →A reasoning model by Kimi that is automatically configured when deploying OpenClaw through the Kimi Claw cloud platform, serving as the underlying language model for the AI agent.
Visit Site →A compact reasoning model from OpenAI's o-series, designed for efficient inference on reasoning tasks. It was retired alongside GPT-4o and other legacy models.
Visit Site →NVIDIA's family of late-interaction multimodal embedding models (3B, 4B, 8B sizes) for visual document retrieval, achieving state-of-the-art on ViDoRe V1, V2, and V3 benchmarks.
Visit Site →The 3B parameter variant of NVIDIA's Nemotron ColEmbed V2, built on SigLIP2 and Llama-3.2-3B, ranking 6th on the ViDoRe V3 benchmark.
Visit Site →NVIDIA's 1B single-vector multimodal embedding model designed for commercial environments requiring minimal storage and high throughput.
Visit Site →Anthropic's new AI model release featuring agentic capabilities including 'agent swarms' and 'agent teams,' scoring nearly 30% on the APEX-Agents professional tasks benchmark in one-shot trials and 45% with multiple attempts.
No URLAI safety company building frontier AI models, recently raising $20 billion at a $350 billion valuation. Known for its Claude models and coding agents that have increased developer productivity.
Visit Site →Among the strongest open source models available, evaluated in context rot experiments for long-context performance alongside Claude, GPT, and Gemini families.
Visit Site →A transformer architecture by François Fleuret at Meta that extends the classic decoder-based transformer with latent variables to make underlying decisions about sequence generation, such as generating consistent positive or negative movie reviews.
Visit Site →A Google Research architecture that learns to memorize at test time, enabling models to go beyond current context windows by maintaining memory across chunks of long sequences, presented at NeurIPS.
Visit Site →NVIDIA's hybrid autoregressive-diffusion language model architecture (Think in Diffusion, Talk in Autoregression) that achieves speedups by utilizing unused GPU capacity during inference without sacrificing autoregressive sampling quality.
Visit Site →A smart and free open-source AI model that provides a full recipe for creating ChatGPT-like intelligence, featuring techniques like GRPO (Group Relative Policy Optimization) and emergent reasoning behaviors. Users can run it themselves on rented GPUs.
Visit Site →Google's research into nested learning, a technique where neural networks are treated as many smaller learning systems with learning bubbles inside each other.
Visit Site →Anthropic's Claude 4.5 Sonnet model, noted in sycophancy testing as the model most willing to comply with inflated praise requests.
Visit Site →OpenAI's model described as representing an inflection point in AI capabilities. It ran uninterrupted for one week writing 3 million lines of code to create a browser from scratch, and solved multiple Erdős math problems.
Visit Site →OpenAI's coding-focused model that achieves 77.3% on Terminal Bench 2.0 on extra high settings, compared to 65.4% for competitors. Not yet available on OpenRouter.
Visit Site →An open-source large language model by Zhipu AI (ZAI) that rivals top closed-source models in intelligence and performance. It features agent capabilities for autonomous multi-step task execution, web search, tool use, and sandbox code execution.
Visit Site →An AI model from Minimax, a Chinese AI company, representing their latest generation of large language model capabilities.
Visit Site →Elon Musk's AI company that merged with SpaceX, creating a combined entity worth approximately $1.25 trillion, with implications for AI data centers in space.
Visit Site →Meta's most capable pre-trained base model to date, codenamed Avocado, developed by Meta Superintelligence Labs. It outperformed best open source base models and was competitive with leading post-trained models even before post-training.
Visit Site →Upcoming Anthropic model confirmed to be just around the corner, potentially an even bigger deal than Opus 4.6 depending on benchmark results.
Visit Site →Meta's family of open-source large language models designed for a wide range of AI applications including text generation, reasoning, and coding tasks.
Visit Site →An 8-billion parameter large language model from the Qwen3 family, used as a target model for custom CUDA kernel optimization and benchmarking.
Visit Site →A late-interaction retrieval model that introduced the MaxSim multi-vector embedding matching mechanism, extended by Nemotron ColEmbed V2 to multimodal settings.
Visit Site →Google's vision encoder model (siglip2-giant-opt-patch16-384) used as a foundation for the llama-nemotron-colembed-vl-3b-v2 model.
Visit Site →OpenAI's family of large language models, evaluated in context rot experiments alongside other model families.
Visit Site →Google's family of large language models, evaluated in context rot experiments for long-context performance.
Visit Site →A transformer variant designed to handle longer contexts by passing hidden states between segmented context windows, enabling learning of longer-term dependencies than standard transformers.
Visit Site →A linear transformer variant that uses kernel-based approximations of the attention mechanism to achieve more efficient computation, reducing the quadratic complexity of standard attention.
Visit Site →An efficient transformer variant mentioned alongside Performer as prior work on linear attention mechanisms.
Visit Site →OpenAI's series of generative pre-trained transformer models that use autoregressive language modeling with causal attention masking to generate text.
Visit Site →Google's bidirectional encoder representations from transformers, a foundational language model that uses masked language modeling to learn deep bidirectional text representations for a wide range of NLP tasks.
Visit Site →A reasoning-focused AI model developed by OpenAI, designed to perform step-by-step reasoning before generating responses to complex problems.
Visit Site →OpenAI model that generated $39 in profit in the AI Village e-commerce store experiment selling t-shirts.
Visit Site →A miniature model within the GPT-5.1 system that decides whether a user's query is worth spending extended thinking time on.
No URLxAI's Grok 4 model, tested alongside other frontier models in a sycophancy comparison, scoring around 7 out of 10 on a poem evaluation task.
Visit Site →OpenAI's new frontier model released in different variants, discussed as showing incremental rather than groundbreaking improvements over previous generations.
Visit Site →OpenAI's open source model released in different variants, noted for strong instruction following and tool calling but higher hallucination rates and less world knowledge compared to other models.
Visit Site →An earlier OpenAI model referenced for its notably sycophantic behavior, used as a historical comparison point.
Visit Site →OpenAI's updated model that thinks longer on harder questions and less on easier ones, showing incremental improvements on coding and STEM benchmarks but mixed results on other benchmarks including a slight regression on SimpleBench.
Visit Site →Meta's AI model that was described as a complete disaster in 2025, with fraudulent benchmark results leading to resignations and researchers removing their names from the paper.
Visit Site →An offline-first AI dictation app by Google for iOS that uses Gemma-based automatic speech recognition models to transcribe speech, automatically filtering out filler words and polishing text into clean prose. It offers text transformation options like key points, formal, short, and long formats, and can import custom vocabulary from Gmail.
Visit Site →A speech transcription model by Microsoft AI that converts speech to text across 25 languages, claimed to be 2.5 times faster than Microsoft's Azure Fast transcription offering.
Visit Site →An AI-powered dictation and voice-to-text app that features a floating button interface on Android for easy system-wide access to transcription from anywhere on the device.
Visit Site →An open source automatic speech recognition model by Cohere with 2 billion parameters, designed for tasks like note-taking and speech analysis. It supports 14 languages and achieves a leading average word error rate of 5.42 on the Hugging Face Open ASR leaderboard.
Visit Site →A Swift framework that enables fully local, low-latency audio AI on Apple devices, allowing developers to run fast transcription models directly on the Mac's Neural Engine for on-device speech recognition.
Visit Site →A Mac-native AI voice dictation app that transcribes speech to text locally on-device, offering privacy-focused, low-latency dictation anywhere a cursor can go, including email, notes, and coding environments.
Visit Site →A 1-billion-parameter automatic speech recognition model from IBM's Granite family, designed for transcription and speech processing tasks.
No URLAn automatic speech recognition model by ElevenLabs designed for high-accuracy transcription across multiple languages.
Visit Site →A 1.7-billion-parameter automatic speech recognition model from Alibaba's Qwen family, designed for multilingual transcription tasks.
Visit Site →An AI-powered speech recognition and text-to-speech platform offering APIs for transcription, voice agents, and audio intelligence for enterprise applications.
Visit Site →A compact wearable AI recording device by Plaud that can be worn as a pendant, wristband, or clipped to clothing, featuring two microphones and 20 hours of continuous recording, priced at $159.
Visit Site →A rectangular AI notetaking device priced at $159 that offers real-time transcription and translation in over 120 languages, with 600 free transcription minutes and 25 hours of continuous recording.
Visit Site →A hardware AI notetaking device that offers unlimited basic transcription without a subscription, with up to 45 hours of continuous recording and over 100 days of standby time.
No URLAn open-source wearable AI notetaking pendant priced at $89 that connects to a phone for transcription, featuring two microphones and 10-14 hours of battery life, with open-sourced hardware and software.
Visit Site →Speech-recognition models developed by Nvidia designed for accurate automatic speech recognition, available in multiple variants for different performance and accuracy trade-offs.
Visit Site →OpenAI's automatic speech recognition system that converts spoken audio into text, supporting multiple languages and capable of transcription and translation tasks.
Visit Site →An Apple feature available on AirPods Pro 3 and AirPods Max 2 that provides real-time translation of spoken language, powered by on-device AI processing.
No URLA digital meeting notetaker service that records and transcribes online meetings, providing AI-powered summaries and action items.
Visit Site →AI-powered earbuds that provide real-time transcription during calls in up to 78 languages, with a companion app that highlights key points in transcriptions.
Visit Site →A credit-card-styled AI notetaking puck that attaches to the back of a phone, featuring 64GB on-device memory, two microphones with 15-meter range, and support for over 120 languages, priced at $199.
Visit Site →A compact multilingual speech-language model by IBM designed for automatic speech recognition (ASR) and bidirectional speech translation on resource-constrained devices, supporting English, French, German, Spanish, Portuguese, and Japanese with only 1 billion parameters.
Visit Site →A 2-billion parameter speech-language model by IBM in the Granite Speech collection, serving as the predecessor to Granite 4.0 1B Speech for automatic speech recognition tasks.
Visit Site →A coin-sized AI-powered voice recorder by Soundcore (Anker) that records conversations and provides real-time translation and transcription. Recordings are encrypted with AES-256 and stored locally on the device.
Visit Site →A speech-to-text model by Mistral AI designed for real-time speech transcription with live transcription capabilities.
Visit Site →A library developed by Meta AI Research for efficient text classification and word representation learning, now available on the Hugging Face Hub.
Visit Site →IBM's dense language model that serves as the base model for Granite 4.0 3B Vision, supporting text-only workloads and serving as the foundation for multimodal LoRA adapters.
No URLA dedicated customer service-focused AI model developed by Intercom, designed to handle customer support interactions. It is claimed to be the highest performing, fastest, and cheapest model for customer service, outperforming general-purpose models like GPT 5.4 and Opus 4.5.
No URLAn AI companion startup that builds conversational AI characters and uses third-party safety infrastructure for content moderation.
Visit Site →An AI character roleplay platform that allows users to interact with AI-powered characters in conversational scenarios.
Visit Site →An AI character roleplay platform that enables users to engage in interactive conversations with AI-generated characters.
Visit Site →A compact, quantized variant of Alibaba's Qwen3 language model optimized to run on modest local hardware, capable of generating meeting summaries and other text-based tasks.
Visit Site →A family of large language models developed by Anthropic, used for text generation, coding, and business applications. Anthropic has enforced usage guidelines including restrictions on autonomous weapons and mass surveillance applications.
No URLA large language model developed by Chinese AI company Moonshot AI, which was reportedly used as a foundation for Cursor's coding model.
No URLA series of AI models developed by Moonshot AI, a Chinese AI lab known for research contributions including the Attention Residuals paper that rethinks residual connections in transformer architectures.
No URLA compressed AI language model by Multiverse Computing, small enough to run locally and offline on mobile devices, embedded within the CompactifAI app as an AI chat tool.
No URLAn advanced reasoning-focused large language model from DeepSeek that reportedly achieved gold-medal-level results on the International Math Olympiad benchmarks.
No URLA custom GPT built inside ChatGPT by a group called AI Century that generates scripts for anthropomorphic object narratives, which can then be fed into AI video generators to create content pipelines.
Visit Site →A compact 4-billion-parameter hybrid Mamba-Transformer language model by NVIDIA, designed for efficient on-device deployment on edge platforms like Jetson and RTX GPUs. It achieves state-of-the-art instruction following and tool use in its size class with minimal VRAM footprint.
Visit Site →A mysterious, anonymous AI language model with one trillion parameters and a context window of up to one million tokens that appeared on the OpenRouter platform with no developer attribution. It is widely speculated to be a stealth test of DeepSeek's forthcoming V4 model.
No URLAn anticipated next-generation large language model from Chinese AI startup DeepSeek, reportedly featuring one trillion parameters and a one-million-token context window, expected to launch as early as April 2025.
No URLA large language model developed by Chinese AI company Minimax that features self-evolution capabilities, where earlier checkpoints of the model were used to build research agent harnesses that helped improve subsequent versions of itself.
Visit Site →A 50-billion parameter large language model purpose-built from scratch by Bloomberg for the finance domain, intended to leverage proprietary financial data for specialized performance.
Visit Site →An upcoming next-generation model from xAI currently in training, expected to be a major breakthrough with improved coding capabilities and overall performance.
No URLA pretrained text embedding model by OpenAI that maps text into dense vector representations, commonly used for semantic similarity, search, and clustering tasks.
Visit Site →Snapchat's built-in AI chatbot powered by large language models, integrated directly into the Snapchat messaging app for conversational interactions with users.
No URLMistral AI's conversational AI assistant that provides chat-based interaction with Mistral's language models for general-purpose tasks.
Visit Site →Snapchat's built-in AI chatbot powered by large language models, designed for conversational interactions within the Snapchat app. It declined to assist with violence planning in over half of test exchanges.
Visit Site →A large language model from Google DeepMind that reportedly had frontier reasoning capabilities distilled into it, making it a highly capable and efficient model in Google's Gemini family.
No URLAn updated OpenAI model optimized for everyday chatbot use, designed to deliver faster, more natural responses with fewer unnecessary refusals and less moralizing preambles compared to previous versions.
No URLAn AI chatbot named Samantha used by Sears Home Services to handle customer interactions via text chat and voice calls. A security researcher discovered that its databases containing millions of chat logs and audio files were publicly exposed online.
No URLA healthcare-focused product from Anthropic that provides medical advice to consumers and includes tools for medical professionals. It is built to work with HIPAA-compliant products.
No URLA HIPAA-eligible natural language processing service by AWS that extracts and structures information from unstructured medical text such as clinical notes and prescriptions.
Visit Site →A feature by Grammarly (owned by Superhuman) that used AI to simulate editorial feedback from real-world writers and experts, generating critiques in the style of named public figures. The feature was disabled after backlash over using people's likenesses without consent.
No URLAn AI companion chatbot designed to serve as a personal conversational partner, offering emotional support and social interaction through text-based exchanges.
Visit Site →Anthropic's developer API that provides programmatic access to Claude's language models for integration into applications and services.
Visit Site →A custom internal chatbot built by Uber engineers that simulates CEO Dara Khosrowshahi's responses, used by teams to rehearse presentations before meeting with the actual CEO.
Visit Site →A Chinese AI company that develops large language models, known for its Kimi chatbot. It was identified by Anthropic as one of the labs conducting distillation attacks on Claude.
No URLAn AI chat application by Indian startup Sarvam that serves as a conversational interface for the Sarvam 105B model, supporting text and voice queries with responses in text and audio, focused on Indian languages.
Visit Site →Meta's AI assistant powered by its Llama models, available across Meta's family of apps including WhatsApp, Instagram, and Messenger, offering conversational AI and information retrieval.
No URLSamsung's proprietary AI model used to power on-device features such as AI-powered notification summaries that condense 24 hours of alerts into a digestible overview.
Visit Site →A family of large language models developed by Anthropic, including Opus, Sonnet, and Haiku variants of varying sizes and capabilities. Claude models are available via API and web interface for text generation, reasoning, and analysis tasks.
No URLThe smallest and fastest model in Anthropic's Claude family, optimized for speed and efficiency. It is created through model distillation from Anthropic's larger models.
Visit Site →OpenAI's family of generative pre-trained transformer models, including variants like GPT mini and nano that are created through distillation from larger teacher models. GPT models are among the most widely used large language models for text generation.
No URLOpenAI's enterprise-grade version of ChatGPT designed for large organizations, offering enhanced security, privacy, and administrative controls for deploying AI-powered chat capabilities across corporate workforces.
Visit Site →OpenAI's application programming interfaces that allow businesses and developers to integrate OpenAI's AI models into their own products and workflows, enabling capabilities such as text generation, reasoning, and automation.
Visit Site →An early large language model by OpenAI released in 2019, notable as a foundational model in the development of modern generative AI text systems.
Visit Site →A Cambridge, Massachusetts-based company that develops a medical AI chatbot designed to assist with clinical and healthcare-related queries, valued at $12 billion.
Visit Site →An AI-powered book writing tool that helps users plan, draft, and prepare manuscripts of up to 50,000 words for publishing on Amazon Kindle Direct Publishing, with features for maintaining consistent tone, structure, and automatic metadata generation.
Visit Site →A family of open-weight multilingual language models by Cohere Labs that support over 70 languages and can run on everyday devices like laptops without internet connectivity. The family includes regional variants (TinyAya-Global, TinyAya-Earth, TinyAya-Fire, TinyAya-Water) optimized for different language groups, with a base model of 3.35 billion parameters.
Visit Site →An enterprise AI company that builds large language models and NLP tools for businesses, offering models through its own platform. The company posted $240 million in annual recurring revenue at the end of 2025.
Visit Site →An AI chat application developed by India-based Sarvam AI, focused on serving Indian languages and users in the Indian market.
No URLA family of open-weight and commercial large language models developed by Mistral AI, designed for text generation and reasoning tasks.
No URLAnthropic's most capable AI model in the Claude 3 family, designed for complex reasoning, analysis, and advanced text generation tasks.
No URLA Hindi-English large language model built on Meta's Llama 3.1 70B by MBZUAI and G42, designed to understand casual speech in both Hindi and English.
No URLAn AI content generation platform designed for marketing teams, enabling the creation of brand-consistent copy, blog posts, social media content, and other marketing materials.
Visit Site →An AI-powered writing and content generation tool that helps marketers and businesses create marketing copy, blog posts, and other written content.
Visit Site →A family of enterprise-focused generative AI models developed by Canadian startup Cohere, designed to be efficient enough to run on limited GPU resources, making them cost-effective for enterprise deployment.
Visit Site →OpenAI's large language model released in 2020 that was a landmark moment for AI-generated text, demonstrating unprecedented natural language generation capabilities.
Visit Site →An open-source coding-focused AI model by Alibaba's Qwen team, designed for software development tasks and competitive with top proprietary coding agents.
Visit Site →An AI-powered chatbot platform that creates personalized conversational agents, also used in the emerging 'deadbot' space to simulate conversations with deceased individuals.
Visit Site →A startup in the AI-powered afterlife industry that offers deadbot generation services, creating LLM-powered chatbots that mimic deceased people.
Visit Site →An AI writing assistant platform that helps users generate, edit, and refine written content across various formats and use cases.
Visit Site →Anthropic's AI chatbot system that now integrates with WordPress via a new connector for read-only site data access, and was featured in a Super Bowl ad positioning itself as ad-free alternative to ChatGPT.
Visit Site →Anthropic's latest flagship model described as arguably the best LLM, featuring standard and extended thinking modes, excelling at emotional intelligence, creative writing, professional communication, and coding tasks.
Visit Site →Databricks' LLM-powered natural language user interface that allows users to query their data warehouse using conversational language instead of specific query languages.
Visit Site →A Chinese AI company that develops large language models, known for its Kimi chatbot. It was named by Anthropic as one of three firms conducting industrial-scale distillation attacks against Claude.
Visit Site →A Chinese AI company that develops foundation models for text, voice, and video generation. It was identified by Anthropic as one of three firms that created fraudulent accounts to conduct distillation attacks.
No URLMicrosoft's AI assistant bundled into its Office productivity suite, designed to help enterprise users with tasks like writing, summarizing, and data analysis across Microsoft 365 applications.
Visit Site →An AI chatbot platform that allows users to create and interact with customizable AI characters for conversation, roleplay, and companionship.
Visit Site →A smaller, more efficient variant of OpenAI's GPT-4.1 model designed for faster and more cost-effective inference, now deprecated.
Visit Site →A compact reasoning model from OpenAI's o-series, designed for efficient chain-of-thought reasoning tasks. It has been deprecated alongside other legacy models.
Visit Site →Microsoft Azure's cloud-hosted service that provides access to OpenAI's models, enabling developers to integrate large language models into applications via Azure's infrastructure.
Visit Site →Meta's 3B parameter language model used as a foundation component in the llama-nemotron-colembed-vl-3b-v2 model.
Visit Site →A synthetic data workflow example in SyGra Studio based on the glaiveai/glaive-code-assistant-v2 dataset, which drafts answers, critiques them, and loops until satisfactory.
Visit Site →Anthropic's lighter, faster model recommended for everyday simple tasks as a more cost-effective alternative to Opus 4.6.
No URLA new lower-cost plan from OpenAI at $8/month, initially rolled out in India, offering 10x more messages, file uploads, and image creation than the free tier, with ads included.
Visit Site →An AI language model developed by xAI, offering conversational and generative text capabilities.
Visit Site →A patented large language model concept by Meta designed to simulate a user's social media activity after extended absence or death, capable of generating posts, comments, and even simulating video or audio calls on behalf of the user.
No URLMajor AI company developing large language models and AI products including ChatGPT, Sora 2, and Atlas web browser. Discussed in context of financial challenges, competition, and the NVIDIA investment deal.
Visit Site →OpenAI's multimodal model being retired from ChatGPT, known for excessively flattering and affirming user responses, subject of eight lawsuits alleging harmful emotional dependencies.
Visit Site →OpenAI's conversational AI assistant powered by large language models, capable of generating text, answering questions, writing code, and performing a wide range of language tasks.
Visit Site →A new audio model being developed by OpenAI, short for 'bidirectional,' designed to enable simultaneous two-way conversation between users and AI rather than the current turn-based walkie-talkie style interaction.
No URLA voice AI company that provides real-time voice generation and processing solutions for enterprises, partnering with Blue Machines for deployment in India with local data residency.
Visit Site →A zero-shot voice cloning text-to-speech model by Indian voice AI startup Gnani that supports 12 languages without requiring prior voice samples.
Visit Site →An Indian voice AI startup that develops speech and language AI solutions, including the Vachana zero-shot voice cloning text-to-speech model supporting 12 languages.
Visit Site →A voice AI company that reached an $11B valuation, with investors doubling and quadrupling down as it moves beyond voice AI.
Visit Site →Netflix's open-source AI model designed for video object and interaction deletion, enabling the removal of objects and interactions from video content.
Visit Site →Alibaba's video generation AI model that creates videos from text or image inputs, with new versions released as part of Alibaba's ongoing open-source AI efforts.
No URLRunway's real-time video agent API powered by its general world models, enabling users to interact with generative AI agents that have customizable faces and voices ranging from cartoonish to photorealistic.
Visit Site →An AI model by Skywork AI that generates interactive, real-time video worlds responding to user inputs like keyboard controls, producing 720p video at approximately 40 frames per second with long-term memory for scene consistency.
Visit Site →A unified 15 billion parameter open-source AI model that can generate video with natively built-in audio, supporting multiple languages, with a higher win rate than LTX 2.3.
Visit Site →ByteDance's AI video and audio generation model that allows creators to draft, edit, and sync video and audio content using text prompts, images, or reference videos. It supports clips up to 15 seconds and is rolling out through CapCut and ByteDance's Dreamina platform.
No URLAn AI-powered video-editing mobile app developed by Mirage (formerly also called Captions) that enables creators and businesses to create, edit, and distribute short-form videos with AI-assisted features including accent-preserving audio generation.
Visit Site →A video upscaler developed by Google that takes low-quality video and outputs clean, high-resolution video. It is open source with released inference code, training code, and models.
Visit Site →Google's video editing and creation app that integrates AI-powered features, including music generation through the Lyria 3 Pro model, as part of Google's Workspace suite.
Visit Site →A World Foundation Model by NVIDIA for action-conditioned surgical robotics simulation, fine-tuned from Cosmos Predict 2.5 2B. It generates physically plausible surgical video from kinematic actions, implicitly learning tissue deformation and tool interactions to bridge the sim-to-real gap.
No URLA video restoration and upscaling model used for enhancing video quality, serving as a benchmark comparison for newer upscalers.
Visit Site →A video super-resolution model designed for upscaling and restoring video quality.
No URLAn AI-powered video matting tool that separates people from video backgrounds with high accuracy, even in challenging scenes with fast motion or complex hair. The model is lightweight (~140MB) and available as open-source code on GitHub as well as a free Hugging Face demo.
Visit Site →An open source AI-powered video editing tool that enables style transfer, background replacement, object addition and removal in existing videos by combining a multimodal LLM with a video diffusion transformer model.
Visit Site →An open source video generation model referenced as one of the best open source video generators with audio support.
No URLA closed-source AI video editing tool that is considered one of the best performing video editors, outperforming many open source alternatives in video editing quality.
No URLA filmmaking technology company founded by Ben Affleck that uses AI models trained on visual logic and editorial consistency to assist with post-production tasks such as continuity fixes, lighting adjustments, and background replacements.
No URLAn open source AI video editing tool that enables various video editing tasks through AI-powered generation and manipulation.
Visit Site →An open source AI video editing tool for AI-powered video generation and editing tasks.
Visit Site →A reasoning framework designed to be added on top of the WAN video generator, enabling it to reason about visual puzzles, geometry, fluid simulations, and other complex visual tasks within generated videos. It significantly outperforms other video generation models on visual reasoning benchmarks.
No URLAn open-source video generation model that serves as the base model for frameworks like VBVR. It can generate videos from input frames and instructions.
Visit Site →Google DeepMind's advanced video generation model capable of creating high-quality AI-generated videos from text prompts.
No URLGoogle's AI video generation model that can create and edit high-quality video content from text prompts and other inputs.
No URLAn open-source interactive world video generator inspired by Google's Genie that creates navigable 3D environments from a single starting frame, allowing users to move around using keyboard controls.
No URLA state-of-the-art AI video generation system representing the current cutting edge in AI-generated video content.
Visit Site →Google's AI video generation model capable of producing high-quality video content, featuring watermarks to indicate AI-generated output.
No URLA media-generation platform offering AI-powered tools for video creation, editing, and visual content generation, valued at $5.3 billion after a $315 million Series E round.
Visit Site →OpenAI's AI video generation model that creates videos from text prompts, initially released in early 2024 and considered groundbreaking at the time of its debut.
No URLAn AI company offering video and 3D generation tools, known for its Dream Machine video generation model that creates realistic video content from text and image inputs.
Visit Site →An AI video generation company that develops tools for creating and editing video content using artificial intelligence.
Visit Site →An open-source AI video generator with native sound generation capabilities. It is a mixture-of-experts model with 32 billion total parameters that can produce 360p and 720p resolution videos with synchronized audio.
Visit Site →An AI model by Tencent that generates videos of people interacting with objects based on text prompts, such as picking up items or holding objects, with support for stringing multiple actions together.
Visit Site →A real-time video object removal technique from NVIDIA and collaborators that can delete objects and their secondary effects (shadows, reflections) from videos at 25 fps, using pre-trained diffusion models without additional training.
Visit Site →OpenAI's video generation model mentioned alongside other realistic media generation tools released in 2025.
Visit Site →A video generation model capable of producing realistic AI-generated video content.
Visit Site →An AI video generation tool capable of creating highly realistic video content including avatars, animations, and creative visual sequences from text or other inputs.
Visit Site →An AI video generation platform by Kuaishou featuring a unique multi-shot capability that allows users to create cinematic videos with multiple customizable shots, character consistency across scenes, and support for up to 15-second videos with hard cuts.
Visit Site →xAI's video and image generation tool, reportedly generating 50 million videos a day and over 6 billion images in 30 days.
Visit Site →ByteDance's video generation model accessible through ByteDance's Playground Arena, requiring account creation and payment setup. Also available through ChatCard with invite codes.
Visit Site →An AI video generation model used as a benchmark for comparing audio synchronization and video generation quality.
Visit Site →Google's AI video generation model (Veo 3) capable of generating videos with native audio, referenced as a top-tier closed-source video generator.
Visit Site →AI company that partnered with Svedka to create what is touted as the first primarily AI-generated national Super Bowl ad, also known for AI-generated Coca-Cola commercials.
Visit Site →