Auto-curated from YouTube and web sources
AI platforms are aggressively bundling ecosystem partnerships — Lovable/Google/Wiz, Meta/Shopify/Zendesk — to lock in enterprise customers through integrated agent marketplaces and billing. Simultaneously, regulators are forcing transparency concessions: the UK's publisher opt-out for AI search and Amazon's Ring facial recognition lawsuit signal growing friction between AI deployment speed and consent frameworks.
A 3D reconstruction model that can turn a set of images into simulation-ready 3D scenes, enabling accurate articulation and animation of reconstructed objects.
Visit Site →Apple's 3D model generator that reconstructs view-dependent 3D representations from input images, capturing how objects visually behave from different angles including reflections and lighting changes.
Visit Site →An open-source image-to-3D model generator that creates highly accurate 3D meshes from single images using pixel-aligned reconstruction, producing detailed geometry and realistic textures superior to competitors.
Visit Site →A 3D model generation tool that converts images into 3D models, developed as a competitor in the image-to-3D generation space.
Visit Site →A 3D asset generation model that creates 3D models from images, serving as one of the competitors in the image-to-3D generation space.
Visit Site →An AI system that reconstructs complete 3D objects from one or a few RGBD images, capable of generating full geometry, textures, and positions of objects even when they are partially occluded. It is trained on a massive synthetic dataset of nearly 200,000 high-quality 3D assets and over 3 million synthetic RGB depth images.
Visit Site →An AI system that generates 3D models complete with skeletal rigs, making the output ready for animation workflows.
No URLA Google DeepMind model announced in August that can generate dynamic, playable worlds from text prompts or images, retaining consistency for a few minutes at 720p resolution.
Visit Site →A benchmarking platform (formerly known as LMSYS Chatbot Arena) that ranks and compares large language models based on human preference evaluations, maintaining a leaderboard chart of model performance.
No URLAn open benchmark by IBM Research and Hugging Face for comparing full AI agent systems across multiple tasks, reporting both quality and cost metrics to evaluate generality and deployment worthiness.
Visit Site →The Massive Text Embedding Benchmark is a comprehensive evaluation framework for assessing text embedding models across multiple tasks including retrieval, classification, and clustering in both English and multilingual settings.
Visit Site →A monthly index published by fintech company Ramp that tracks AI adoption among businesses by analyzing expense data from over 50,000 companies using Ramp's platform.
No URLA relaunched news aggregator platform that ingests content from X (formerly Twitter) in real time, performing sentiment analysis, clustering, and signal detection to surface and rank the most important AI news stories.
Visit Site →A platform that hosts AI hackathons and community events, connecting developers with AI tools and compute resources to build innovative projects.
Visit Site →A composite benchmarking platform that aggregates model performance across 10 hard benchmarks to produce a single intelligence score reflecting overall general intelligence for text-based tasks, reasoning, knowledge, science, coding, and agents.
No URLA Hugging Face-hosted leaderboard that ranks open-source large language models on standardized benchmarks, enabling community comparison of model performance.
Visit Site →A quality-first Arabic LLM leaderboard developed by the Technology Innovation Institute (TII) that validates benchmarks through a rigorous multi-stage quality pipeline before evaluating models, ensuring scores reflect genuine Arabic language capability.
Visit Site →An AI model evaluation platform based on user testing and voting, where users compare AI model outputs to generate crowdsourced rankings and leaderboards.
No URLAn AI evaluation leaderboard by Epoch AI that combines several benchmark tests into a single composite score to rank and compare frontier AI models.
Visit Site →A web application that tracks and compares AI model benchmark scores on a leaderboard, providing tooltips and detailed performance data across various models.
Visit Site →An all-in-one AI assistant app that integrates multiple leading AI models including GPT, Claude, and Gemini into a single platform, offering text chat, image generation, video creation, document summarization, OCR, and real-time web search across devices.
Visit Site →A Google Labs app for iOS and Android that uses data from a user's connected Google services (Gmail, Calendar, Photos, YouTube, Search History) to generate a curated daily collection of AI-illustrated lifestyle stories, recommendations, and inspiration.
Visit Site →Coralogix's AI agent designed to help engineers investigate incidents and query operational data within the Coralogix observability platform.
Visit Site →A desktop application by Nous Research that provides a local AI agent interface supporting multiple LLM providers, API key management, memory/context persistence, voice chat, and integration with messaging platforms like Discord. It can connect to remote Hermes backends and supports various models including DeepSeek.
No URLA toolkit by NVIDIA for optimizing and quantizing AI models for efficient inference, supporting configurations like W4A16 NVFP4 for faster deployment on NVIDIA hardware.
Visit Site →An open-source standard by Microsoft that provides developers, compliance, and security teams a consistent way to define and enforce policies governing AI agent behavior, including what agents may do, what they must not do, and when human approval is required.
Visit Site →An AI compliance service that sits between AI models and end users to flag and rewrite messages that may present compliance problems, using deterministic rules for standards like SOC 2 and GDPR combined with LLM-powered rewrites.
No URLAn open source framework by Microsoft (Adaptive Spec-driven Scoring for Evaluation and Regression Testing) that turns natural-language descriptions of intended AI behavior into structured, scored test cases for application-specific AI evaluation and regression testing.
Visit Site →An always-on agentic AI assistant by Microsoft built on the OpenClaw framework, designed to work within the Microsoft 365 ecosystem with a persistent identity that actively adapts to user behavior over time. It offers prepackaged skills for calendar management and meeting agendas, and allows users to develop custom skills through ongoing feedback.
No URLGoogle's official phone dialer app for Android that now includes AI-powered fake call detection to protect users against deepfake voice impersonation scams by verifying caller identity through a device-to-device confirmation system over RCS.
Visit Site →Microsoft's Android-based software platform described as a chip-to-cloud system designed for securely building, deploying, and running multiple AI agents in an open, multi-agent environment.
Visit Site →An IBM AI-powered tool designed to accelerate mainframe application development and modernization, featuring an App Insights agent that leverages deep static analysis for application understanding of legacy code written in COBOL and PL/1.
Visit Site →An IBM proprietary program analysis and data processing library used for agent-based generation of unit, integration, API, and change-based tests, achieving higher developer ratings and superior code coverage benchmarks compared to open-source tools and zero-shot LLMs.
Visit Site →An AI-powered weather forecasting model developed by WindBorne Systems that uses deep learning and proprietary balloon sensor data to produce hourly forecasts with resolution down to 3 km, claiming greater accuracy than traditional forecasting systems including those from the ECMWF.
Visit Site →A $100 screenless wearable fitness tracker by Fitbit (Google) that features AI health coaching, an 8-day battery life, and focuses on pure health tracking without notifications or screen distractions.
Visit Site →An AI-powered personal context portfolio builder available at contextportfolio.ai that helps users create and manage their personal context for AI interactions.
Visit Site →An AI-powered web browser by Perplexity that acts as a chatbot-based search engine, capable of summarizing emails, browsing web pages, and performing tasks like sending calendar invites.
No URLAn AI agentic browser by Opera with contextual awareness that can perform tasks like researching, shopping, and writing code snippets, even while the user is offline.
Visit Site →A Y Combinator-backed AI-first, browser-native automation platform designed to autonomously complete tasks, fill out forms, and manage data across services like Gmail, Notion, Slack, Figma, and banking platforms by operating directly within the browser.
No URLA privacy-focused web browser from the DuckDuckGo search engine company that blocks trackers and ads, doesn't track user data, features an enhanced scam blocker, and has recently introduced generative AI features including a chatbot.
No URLA platform offering a free Claude-based AI skill and organizational transformation framework that allows businesses to run their operations using an exponential organization model.
Visit Site →A built-in profiling module in PyTorch that provides statistical summaries and temporal execution traces of CPU and GPU activities, helping developers identify performance bottlenecks in model training and inference.
Visit Site →A smart bird feeder with a built-in 4K AI camera that uses a proprietary bird-identification algorithm to identify over 10,000 bird species, sending notifications and recording visits via a companion app.
Visit Site →DJI's companion app for its drones that provides video editing tools including trimming, keyframing, filters, and an AI-powered in-app tracking feature that automatically follows selected subjects in 360-degree footage.
Visit Site →A coding benchmark developed by DataCurve that evaluates AI models on realistic, novel software engineering tasks built from scratch, designed to avoid memorization issues and better reflect real-world long-horizon coding capabilities.
Visit Site →OpenAI's autonomous coding agent that can execute software engineering tasks independently, designed for professional development workflows.
No URLA platform that lets AI agents build and work with disposable copies of databases, enabling safe experimentation without risking production data. It provides isolated database environments for each AI agent to test changes independently.
No URLA custom-built benchmark and game simulation designed to test how well different large language models can learn iteratively by writing scripts to control ships navigating gravitational physics challenges over 30 rounds of feedback.
Visit Site →A cloud security platform acquired by Google for $32 billion that identifies and remediates security vulnerabilities in real time, including in AI-generated code.
Visit Site →A personal AI assistant launched by Microsoft, inspired by the OpenClaw project, designed to help users with everyday tasks and information retrieval.
No URLA Y Combinator-backed AI startup focused on machine learning tools and services.
Visit Site →An AI-powered feature for YouTube Premium users that generates personalized radio stations, playlists, and podcast recommendations based on genres, mood, or listening history.
Visit Site →An inference neocloud startup that rents out AI processing power optimized for the inference phase, deploying specialized SambaNova chips to deliver high-speed token generation for AI workloads.
Visit Site →An Intel-backed AI chipmaker that designs specialized processors optimized for AI inference workloads, claiming superior performance over GPUs and competing inference chips from Groq and Cerebras.
Visit Site →A fully managed search and vector database service by AWS designed for agentic AI workloads, capable of instantly scaling compute up during agent traffic bursts and scaling down to zero when idle, decoupling compute from storage to optimize cost.
Visit Site →An agent builder within Asana's work management platform that allows users to create AI-powered automations and deploy AI teammates to handle business workflows and processes.
Visit Site →Cloudflare's infrastructure platform providing persistent environments and instant scalability for AI agents, designed to handle machine-generated traffic patterns and agentic workloads.
No URLA conversational AI platform founded by Oculus co-founders that offers distinct AI agents (Maya, Miles, Simone, and Charlie) with unique voices, personalities, and memory, designed to deliver natural-feeling spoken conversations with real-time search integration.
Visit Site →An AI research project by Andrej Karpathy that uses agent swarms to train LLMs on simple tasks, aiming toward recursive self-improvement of language models. Building blocks are available via a public GitHub repository.
Visit Site →A software development kit by Anthropic for building and deploying AI agents powered by Claude models, enabling tool use and multi-step task execution.
Visit Site →An open-source framework by Microsoft for building multi-agent AI systems where multiple AI agents can converse and collaborate to solve complex tasks.
No URLAn open-source SDK by Microsoft that integrates large language models into applications, enabling AI orchestration with plugins, planners, and memory capabilities.
Visit Site →A set of .NET libraries by Microsoft that provide unified abstractions for integrating AI services into .NET applications, offering a consistent API across different AI providers.
Visit Site →An AI-powered platform for building web applications from natural language prompts, enabling rapid prototyping and deployment.
Visit Site →A trust and safety company (now rebranded) that provides content moderation and online safety assessment tools, used by platforms like Meta to stress test and evaluate teen safety features.
Visit Site →A benchmark and evaluation framework by MLCommons designed to measure how AI models behave under different conditions, focusing on safety and responsible AI assessment.
Visit Site →A holistic evaluation framework developed by Stanford's Center for Research on Foundation Models (CRFM) that provides comprehensive benchmarks to assess language model capabilities across multiple dimensions.
No URLGoogle's e-book platform that now includes AI-powered features such as 'Catch me up' for story recaps and the ability to highlight passages to ask questions about the text.
Visit Site →The core AI intelligence layer behind Microsoft 365 Copilot, providing the underlying AI capabilities that power Microsoft's productivity agent experiences.
Visit Site →Microsoft's scientific research platform powered by AI, now generally available, designed to accelerate scientific discovery and research workflows.
Visit Site →A sandboxing system by Microsoft that provides enterprise-grade sandbox environments for AI agents, with OS-enforced containment to securely run agent code, file access, and network interactions.
Visit Site →A benchmark developed by Artificial Analysis and IBM that evaluates AI models on agentic enterprise IT tasks, starting with Site Reliability Engineering (SRE) tasks involving Kubernetes incident response, diagnosis, and root-cause identification.
Visit Site →A lightweight, accurate voice activity detection model that runs on CPU, widely used as the de-facto default VAD in open-source voice agent pipelines.
Visit Site →Snowflake's AI building tool that enables enterprise users to interact with their data using natural language queries, generate summary reports, and build AI-powered features on top of Snowflake's data platform.
Visit Site →An autonomous AI software engineer developed by Cognition that can independently plan, write, debug, and deploy code, serving enterprise customers like Mercedes-Benz, NASA, and Goldman Sachs.
Visit Site →A closed-loop, in-ear sleep system by SOND that captures 12 physiological signals in real time and uses a cloud-based AI sleep coach to select or generate personalized sleep audio programs to help users improve their sleep.
No URLAn open-source columnar database management system designed for real-time analytics on large datasets, offering managed cloud services particularly suited for processing the massive data requirements of AI agents.
Visit Site →A beta feature from Robinhood that enables users to create dedicated accounts for AI agents, allowing those agents to read portfolios, analyze data, suggest investments, and execute stock trades using a pre-loaded wallet balance.
Visit Site →Samsung's AI-powered TV processor that uses 128 neural networks to automatically upscale non-4K content to higher resolution and reduce motion blur on Samsung Neo QLED displays.
No URLDuckDuckGo's dedicated AI-free search page that turns off all AI features including AI-assisted answers and AI-generated images by default.
Visit Site →A Silicon Valley-based startup that collects egocentric video and multi-sensor data from gig economy workers using custom hardware devices to create training datasets for robotics and physical AI models.
Visit Site →A fine-tuned version of Gemini developed by Google DeepMind with specialized tools for scientific research, including hypothesis generation, data analysis, and literature summarization, functioning as an AI research assistant.
Visit Site →A startup that uses AI automation to handle all software operations for solopreneurs, reportedly run by a single person and valued at $250 million.
No URLAn AI-powered cryptocurrency analysis platform that provides real-time market insights, trading indicators, structured strategies, and expert analysis to help traders make smarter investment decisions. It includes a Discord community with daily market analysis and altcoin coverage.
Visit Site →An AI-powered wearable pendant under development by Meta, building on technology from its acquisition of Limitless, designed to be worn as a necklace or clipped to clothing for conversational AI assistance.
Visit Site →A wearable AI device that clips onto clothing and uses an on-board microphone to record conversations throughout the day, generating AI-powered summaries of interactions and meetings.
No URLA reliability engineering agent startup that analyzes AI-generated code quality, reporting that companies spend 44% of their tokens on fixing bugs introduced by AI-generated code.
Visit Site →An AI-powered legal assistant by Thomson Reuters that helps lawyers with legal research, document review, and other routine legal tasks.
Visit Site →IBM's enterprise AI and data platform used to process and analyze large volumes of data, power AI-driven fan engagement, content personalization, and storytelling applications for businesses and sports organizations.
Visit Site →Visa's suite of AI-powered payment tools designed to enable agentic commerce, allowing AI agents and developers to process and manage payments securely.
Visit Site →A tracking service that monitors daily GPU rental pricing across 28 marketplaces and cloud providers, providing market data on GPU costs for AI compute.
Visit Site →A pair of specialized small language models by Dharma AI designed for structured OCR tasks, particularly Brazilian Portuguese document extraction. A 3-billion-parameter model that outperformed major frontier APIs on domain-specific OCR benchmarks at roughly fifty times lower cost.
Visit Site →Google's personal AI agent that runs 24/7 on Google's virtual machines and can take actions across Gmail, Docs, Sheets, and other Google Workspace apps on behalf of the user. It supports MCP (Model Context Protocol) for connecting third-party apps and is available to Gemini Ultra plan subscribers in the US.
No URLAn open-source reference harness for running agentic AI evaluations, providing shell access to sandboxed file systems for models to investigate and solve tasks with a configurable turn cap.
Visit Site →Hugging Face's managed service for deploying machine learning models as production-ready API endpoints, supporting various model architectures and scaling configurations.
Visit Site →A $7.99/month subscription plan from Meta that provides enhanced AI capabilities including deeper reasoning for complex tasks and expanded image and video generation features across Meta's apps.
No URLA $19.99/month subscription plan from Meta offering the highest tier of AI compute, including extended thinking mode, deeper reasoning for complex tasks, and expanded video and image generation capabilities.
Visit Site →Google's AI-powered coding agent designed to assist developers with software engineering tasks, bolstered by Google's acqui-hire of the Windsurf team.
Visit Site →An AI-powered investment research platform acquired by Robinhood in 2024 that provides data-driven analysis and insights for investors.
Visit Site →A startup building payment products that give AI agents the ability to make purchases and transactions on behalf of users.
Visit Site →A standalone desktop app by Spotify that uses AI to generate personalized podcasts by connecting to users' email, calendar, and other personal data sources, with an agent that can browse the web and fetch information.
Visit Site →An AI therapy and coaching app co-founded by Tony Robbins and former Calm employees that offers personalized mental health support through specially trained AI models and virtual AI therapists, scoring 95 on mental health safety benchmarks.
Visit Site →Google's AI-powered phone calling service that can make real-world bookings at restaurants and salons on behalf of users, first demonstrated at Google I/O 2018.
Visit Site →An AI-powered waste sorting system that uses sensors, visible light, and infrared cameras combined with robotic arms to identify and separate recyclable materials including aluminum, plastics, and other waste stream components with over 90% accuracy.
Visit Site →A metals recycling startup that uses AI algorithms combined with lasers, cameras, and X-ray fluorescence sensors to classify and sort aluminum scrap by specific grade, enabling higher-accuracy separation and greater profit per pound.
Visit Site →A paid, ad-free search engine that offers customizable search experiences with features like website filtering, search lenses, and an optional AI-powered 'Quick Answer' summarization feature that can be toggled off.
Visit Site →A privacy-focused search engine that does not collect user data such as search, browsing, or purchase history, and offers optional AI-generated answers that users can disable in settings.
Visit Site →A privacy-focused search engine that acts as a proxy for Google, stripping personal data like IP addresses from queries before sending them to Google and returning results anonymously.
Visit Site →A lightweight search tool that automatically appends the &udm=14 parameter to Google searches, delivering standard Google results without AI Overviews or other AI-generated content.
Visit Site →A Spotify Premium feature that allows mobile users to ask AI-powered questions about podcast episodes they are listening to, get answers about concepts mentioned, and receive podcast recommendations.
Visit Site →A marketing company that deploys AI agents to mass-publish content across Reddit and blogs to influence Google and ChatGPT rankings through generative AI-engine optimization (GEO).
Visit Site →An AI-powered desktop companion for PCs and Macs that learns a user's daily workflows and automates them with limited to no human prompting. It includes a built-in skills library for tasks like email drafting, invoice processing, report building, and document summarization, as well as a coding assistant.
Visit Site →An Andreessen Horowitz-backed AI search startup that provides an AI-native search API and platform, valued at $2.2 billion after raising $250 million to transform how search and discoverability work.
No URLAn AI assistant built into Figma's collaborative design canvas that uses natural language prompts to generate new designs, edit existing ones, and automate design tasks, powered by AI models fine-tuned for design use.
No URLA marketing infrastructure platform that uses AI to optimize the distribution strategy and clipping process for short-form video content, leveraging a network of over 100,000 gig creators and continuous testing loops to determine what makes content go viral.
Visit Site →Google's in-car platform that integrates with vehicle infotainment systems, now incorporating Gemini-powered AI features to provide more helpful and safer in-car interactions for drivers.
Visit Site →A command-line AI agent tool that supports plugging in any model, offering a flexible and model-agnostic approach to agentic coding workflows.
No URLA collaboration and project management software platform that has integrated approximately 3,000 internal AI agents to automate complex tasks and boost employee productivity.
Visit Site →A platform by Allen Institute for AI that enables partners and organizations to deploy OlmoEarth models for satellite imagery analysis and environmental monitoring at scale.
Visit Site →A Hugging Face skill for AI coding agents that bootstraps training recipes for Sentence Transformers models, enabling fine-tuning of cross-encoders and embedding models through AI-assisted development.
Visit Site →Google's command-line interface tool (now stable at version 1.0) that enables AI coding agents to access Android development knowledge and capabilities from Android Studio, regardless of the user's preferred coding platform.
No URLA Google shopping hub that lets users add products from across Google services (Search, Gemini, YouTube, Gmail), track deals, monitor price drops, surface price history, and flag product compatibility issues using AI.
No URLA Google protocol designed to prevent AI agents from making unauthorized purchases by allowing users to set strict spending limits, purchase restrictions, and approved merchant lists for AI agent transactions.
No URLA Google protocol that enables AI agents to purchase products, order goods, and reserve hotels on behalf of users, with partners including Shopify, Amazon, Walmart, Stripe, Salesforce, and Meta.
Visit Site →A Gemini AI-powered conversational feature for Gmail that allows users to ask natural language questions about their inbox using voice, find buried information, handle follow-up questions, and navigate between topics seamlessly.
Visit Site →An AI-powered Gmail feature that provides an overview of tasks and items to catch up on from a user's inbox, organized on a single page, expanding from Google AI Ultra to Pro and Plus subscribers.
No URLAI agents within Google Search that operate continuously in the background to monitor, synthesize, and deliver updates on user-specified topics of interest, functioning as an advanced evolution of Google Alerts with agentic capabilities.
Visit Site →An agentic email security platform that uses AI to analyze the context of every incoming email to detect fraud, phishing, and impersonation attempts, particularly those powered by AI. It employs a small language model tailored to quickly analyze emails, understand sender intent, and evaluate against organizational context.
Visit Site →Google Maps' panoramic street-level imagery service covering 110 countries with over 280 billion images, now integrated with Google's Genie world model to enable interactive, AI-generated simulations of real-world locations.
Visit Site →Google's AI-powered search experience that integrates Gemini 3.5 Flash to provide AI-generated answers and agentic capabilities directly within Google Search.
Visit Site →An AI-powered conversational search feature within Google Search that has surpassed 1 billion monthly users, allowing users to ask natural language questions and have back-and-forth conversations about search results.
Visit Site →Google's agentic development platform that enables the creation of AI agents capable of performing tasks autonomously, used in combination with Gemini to power interactive search experiences.
Visit Site →Sony's AI-powered audio upscaling technology that enhances compressed audio files to near high-resolution quality, providing an improved listening experience on compatible Sony headphones and devices.
Visit Site →Samsung's AI-powered motion processing technology built into its TVs that smooths fast on-screen movements and reduces blur or flickering, particularly useful for sports viewing and fast-paced content.
No URLA voice-enabled AI feature for Google Docs that allows users to dictate their thoughts and have AI brainstorm, write, outline, and refine content into a first draft.
Visit Site →A Gmail feature that acts as a personalized briefing tool, providing updates on email topics, suggesting next steps, generating draft replies, offering instant access to relevant Google Docs/Sheets/Slides, and streamlining task management.
Visit Site →A YouTube app for video creation that is integrating Gemini Omni Flash to enable AI-powered video generation and editing for YouTube Shorts, available at no cost to users.
No URLApple's gesture-based screen reader enhanced with AI capabilities including Image Explorer for detailed content descriptions and Live Recognition activated via the iPhone Action button for follow-up questions about on-screen content.
No URLAn Apple Intelligence-powered feature within VoiceOver that provides detailed AI-generated descriptions of device displays and content for users who are blind or have low vision.
Visit Site →An Apple accessibility tool enhanced with AI that helps users navigate complex text like scientific studies with columns, images, and tables, providing on-demand summaries and language translation while preserving custom formatting.
No URLAn opt-in AI-driven service from Google that aggregates information from a user's email, calendar, and other sources to generate a daily summary with prioritized tasks and upcoming events, powered by Gemini AI.
Visit Site →A new Google feature that integrates YouTube video results directly into Google Search, allowing users to ask conversational questions and watch relevant YouTube tutorials or videos within an AI-generated interactive search response.
Visit Site →An AI-powered wearable device worn on the wrist that records, transcribes, and summarizes the user's conversations throughout the day, functioning as a personal assistant with calendar integration and reminder features.
No URLAn open framework for running and reproducing AI agent evaluations, designed to work alongside the Open Agent Leaderboard for standardized agent benchmarking.
Visit Site →An OCR and document parsing toolkit by PaddlePaddle that supports multiple inference backends including Hugging Face Transformers, providing capabilities for text recognition, document layout analysis, and structured data extraction from documents.
No URLA feature within Amazon's Alexa+ that generates on-demand podcast episodes by researching a user-specified topic, creating a script, and narrating it with AI-generated host voices, with options to customize length, tone, and focus.
Visit Site →A Swiss deep tech company spun out of ETH Zurich that builds an AI-powered augmented reality motorcycle helmet displaying navigation, speed, and safety alerts anchored to the real-world road ahead using LetinAR's optical modules.
Visit Site →An AI-powered website builder that creates complete websites and online stores from a simple text description of your business, including branded pages, navigation, product layouts, logos, and product descriptions — all without requiring any coding knowledge.
No URLAn internal Meta tool that tracks U.S. employees' mouse movements, keystrokes, clicks, and other computer interactions to build AI agents capable of performing software tasks autonomously.
No URLAn AI-native database platform at ghost.build designed specifically for AI agents rather than human users, featuring no traditional dashboard or UI and built to be accessed programmatically by autonomous agents.
Visit Site →Google Cloud's computer vision API that provides image analysis capabilities including OCR, object detection, and label detection for developers and enterprises.
Visit Site →Google Cloud's document processing platform that uses machine learning to extract structured data from unstructured documents, including forms, invoices, and other business documents.
Visit Site →An AWS machine learning service that automatically extracts text, handwriting, and structured data from scanned documents, going beyond simple OCR to identify form fields and table contents.
Visit Site →Google's AI-powered translation service that provides real-time language translation, integrated into Android XR glasses for live translation capabilities.
Visit Site →An AI-powered legal technology startup that provides AI tools designed to assist with legal document drafting and review.
Visit Site →A legal AI startup that offers AI-powered tools for legal professionals to streamline document-related workflows.
Visit Site →The latest version of Replit's AI-powered coding agent that enables vibe coding — building apps through natural language prompts. Agent 4 introduces parallel agents for working on multiple ideas simultaneously, collaboration features for merging project flows, and multi-workspace project views.
No URLGoogle's AI-powered reinvention of Google Alerts that deploys background AI agents to monitor topics 24/7, helping users track market trends, price changes, weather warnings, and other interests.
Visit Site →A Gemini-powered AI feature that compiles a personalized daily digest from a user's Gmail inbox, calendar, and tasks, delivering a summarized update of their day.
No URLAn app built by former NotebookLM developers that enables users to create personal AI-generated podcasts on any topic.
Visit Site →A new app by Meta/Facebook designed for deeper discussions and Q&A, leveraging Facebook Groups communities to provide real answers, with an AI assistant that helps fetch answers and assists group admins with moderation.
Visit Site →An open source, Apple-only LLM server that lets Mac users run and switch between local and cloud AI models while keeping files, tools, and model memory on their own hardware in a sandboxed environment.
Visit Site →Google's health and fitness companion app that replaced the Fitbit app, featuring AI-powered coaching via Gemini integration, health data tracking, and personalized fitness insights for Fitbit device users.
Visit Site →An AI search startup focused on building next-generation search and discoverability tools as part of the emerging wave of AI-native search platforms.
Visit Site →A node-based design tool that was acquired by Figma to enhance its design capabilities and product offerings.
Visit Site →A startup operating in the automated video clipping space, competing with platforms like Clouted to help brands create and distribute short-form video content.
Visit Site →An enterprise marketing infrastructure platform focused on influencer and creator marketing management, used by brands to manage and measure creator-driven campaigns.
Visit Site →Google's upcoming AI-powered extended reality smart glasses running on the Android XR platform, designed to overlay intelligent features onto the real world.
No URLGoogle's core search engine, which is being fundamentally transformed with generative AI capabilities to provide AI-generated overviews and more conversational search experiences.
No URLAn open source hardware project that pairs a Waveshare ESP32-S3-Touch-AMOLED-2.16 device with your laptop over Bluetooth to display Claude Code token usage statistics, session data, and pixel-art animations on a tiny desktop dashboard.
No URLA data supply platform that connects over 700,000 artists and designers with AI labs, providing multimodal datasets of images, videos, design assets, and 3D content for training foundation models.
Visit Site →An AI-assisted coding feature built into Microsoft Visual Studio that provides predictive suggestions, context-aware recommendations, and intelligent code completion to help developers write code more efficiently.
No URLGoogle's integration of Gemini AI directly into Android Studio IDE, providing AI-powered coding assistance specifically tailored for Android app development.
Visit Site →Google's official integrated development environment (IDE) for Android app development, now enhanced with AI capabilities including Gemini integration for code assistance.
Visit Site →Google's modern declarative UI toolkit for building native Android apps using Kotlin, now integrated into Google AI Studio's app creation workflow.
Visit Site →Google's AI creative studio that integrates with Gemini Omni Flash for video creation and other creative workflows.
Visit Site →A new Android system feature by Google that allows users to track the progress of AI agents like Gemini Spark on their mobile devices.
Visit Site →A startup founded by Andrej Karpathy dedicated to applying AI assistants to education, aiming to transform how people learn through AI-powered tools.
Visit Site →Google's note-taking and to-do list application that is gaining voice-powered AI capabilities similar to Gmail Live for conversational interaction with notes and tasks.
Visit Site →A Google Search feature that connects with personal data sources like Gmail to provide personalized AI-powered search results, expanding to nearly 200 countries and 98 languages.
Visit Site →A Google Labs experimental feature that sends users email summaries including 'top of mind' and 'FYI' sections along with a calendar summary, serving as the precursor to Google's Daily Brief feature.
No URLA new suite of AI services from Anthropic designed specifically for small business owners, offering automated bookkeeping, business insights, ad campaign generation, and integrations with tools like QuickBooks, Canva, Docusign, HubSpot, and PayPal.
No URLA startup that serves as a marketplace connecting video game companies with AI world-model labs, enabling the licensing and conversion of video game assets into high-quality training data for AI models that need to understand physical-world dynamics.
Visit Site →Amazon's personalized AI shopping assistant powered by Alexa+ that replaces the earlier Rufus assistant, offering voice- and touch-enabled shopping with personalized recommendations, price tracking, recurring orders, and cross-retailer purchasing capabilities.
Visit Site →Anthropic's law-focused plug-in for Claude that provides legal-specific AI features tailored for lawyers and legal professionals.
No URLNotion's new developer platform that extends custom AI agent capabilities, connects with external agents, and allows teams to build automated multistep workflows with custom code deployment and database syncing.
No URLAn AI-powered productivity and collaboration workspace that combines features similar to Notion and Miro, offering AI-assisted summarization, rewriting, idea refinement, document creation, task planning, and visual workflow building in a single platform.
No URLAn AI-powered robotic pool cleaner by Beatbot featuring advanced navigation, automation, and high-performance all-zone cleaning coverage across floors, walls, waterline, and surface.
Visit Site →Google's AI-powered search experience that replaces traditional blue links with an AI agent that answers queries, executes tasks, and runs background monitoring agents.
No URLA benchmark suite for evaluating AI agents on customer service and technical support tasks, including airline and retail scenarios that require following company policies.
Visit Site →A machine learning experiment tracking and visualization platform that helps developers log, compare, and reproduce model training runs.
Visit Site →A developer tools startup that automated the creation and maintenance of production-ready SDKs from API specifications across multiple programming languages, acquired by Anthropic for over $300 million.
Visit Site →An AI voice platform that enables companies to build, deploy, and manage voice agents for customer support, lead qualification, appointment scheduling, and outbound sales. It provides low-latency voice infrastructure and orchestration tools, and has processed over 1 billion calls.
Visit Site →A proactive AI personal assistant app by Second Nature Computing that consolidates calendar, email, messages, and location data into a single dashboard, using AI to anticipate user needs and offer contextual suggestions.
Visit Site →An AI tool by Adaption that helps models learn specific capabilities quickly by co-optimizing both data and models through an automated approach to conventional fine-tuning, enabling continuous model improvement.
Visit Site →A voice AI agent developed by Pair Team that serves as a 24/7 patient-facing interface for healthcare, handling intake, coordinating referrals, and conducting check-in conversations with patients between clinical visits.
Visit Site →A Google Android feature that lets users vibe-code custom widgets by describing what they want in natural language, with Gemini generating personalized dashboards that can pull data from the web and Google apps.
Visit Site →An AI feature integrated into the Google Chrome browser that allows users to summarize web page content, ask questions about what they see, and use an experimental auto-browse feature to navigate websites and complete tasks on the user's behalf.
Visit Site →An AI-powered mouse cursor feature by Google for Googlebook laptops that suggests contextual AI actions based on what the user hovers over, such as scheduling meetings from dates found in emails.
Visit Site →Google's widely used Android keyboard app that incorporates AI-powered features including predictive text, voice dictation, and the new Gemini-powered Rambler feature for intelligent speech-to-text transcription.
Visit Site →A production-focused AI design tool that allows teams to run and iterate on their existing codebases in the cloud, enabling designers to work directly in production environments and hand off work to developers seamlessly.
Visit Site →An AI-powered project management platform by YaseenAI, Inc. that supports Kanban boards, Gantt charts, sprint planning, issue tracking, and built-in documentation with AI features for generating summaries, organizing information, and automating repetitive administrative tasks.
No URLAn e-signature platform by YaseenAI, Inc. that offers unlimited document signing with AI-powered field detection via its Nova AI assistant, which automatically identifies where signatures, initials, and dates should be placed in documents.
Visit Site →xAI's coding and app-building tool that allows users to generate software applications using AI, released as part of the Grok platform.
Visit Site →A financial data connectivity platform that enables applications to connect with users' bank accounts, credit cards, and investment accounts, now integrated with ChatGPT Pro for AI-powered financial analysis.
Visit Site →A cyber defense suite of tools by OpenAI that leverages the company's large language models to find and remedy software vulnerabilities, offering capabilities such as secure code review, threat modeling, patch validation, malware analysis, and penetration testing.
Visit Site →A multi-agent AI system for CNC manufacturability analysis that processes STEP CAD files and generates complete manufacturability reports, identifying required tools, feasibility issues, and production recommendations for machine shops.
Visit Site →A privacy-focused AI platform that provides access to various AI models with an emphasis on uncensored and private interactions.
Visit Site →A desktop application that allows users to discover, download, and run local large language models on their personal computers with a user-friendly graphical interface.
Visit Site →An AI feature built into the Durobo Krono e-reader that provides AI-powered assistance directly on the e-ink device.
Visit Site →An open-source, privacy-preserving clinical decision support system for oncology that combines a dual-tier fine-tuned LLM architecture with a multi-agent LangGraph topology, Corrective RAG pipeline over 70+ NCCN and ESMO guidelines, and a reflexion safety validator enforcing a Zero-PHI policy.
Visit Site →A framework by LangChain for building stateful, multi-agent applications using directed graph architectures, enabling complex workflows with LLMs including routing, tool-calling, and memory management.
Visit Site →A Microsoft Word add-in that integrates Anthropic's Claude AI directly into the Word interface, allowing users to generate text, highlight and edit specific sections, reformat content, and work across multiple Office files with shared context.
Visit Site →Google's revamped Search experience that foregrounds AI-generated summaries at the top of search results, pushing traditional web links further down the page.
No URLAn AI startup founded by Ian Crosby that aims to build a fully autonomous AI bookkeeper capable of generating accrual-based financials without direct human involvement, targeting AI and software startups as customers.
Visit Site →An open format for representing machine learning models that enables interoperability between different ML frameworks and optimized deployment across various hardware platforms.
Visit Site →Intel's open-source toolkit for optimizing and deploying AI inference, particularly focused on CPU-optimized performance for deep learning models across Intel hardware.
Visit Site →A data labeling platform that provides high-quality human-generated datasets for training and fine-tuning AI and machine learning models.
Visit Site →An AI-powered search and chatbot platform founded by Richard Socher that provides conversational search and AI assistant capabilities.
Visit Site →An AI platform co-founded by Tim Shi that provides real-time intelligence and coaching for contact center agents, helping businesses improve customer interactions and sales performance.
Visit Site →A serverless computing platform by Cloudflare that lets developers build and deploy applications on Cloudflare's global edge network, including AI-powered features such as vibe coding for software development.
Visit Site →A plug-and-play gaming console that uses an AI-powered wide-angle camera to track players' natural body movements, enabling controller-free interactive gaming experiences.
Visit Site →An AI-powered personalized coaching system built into the Whoop fitness tracker platform, featuring a 'My Memory' centralized hub and 'Proactive Check-Ins' that deliver tailored health and fitness recommendations based on continuous biometric data.
Visit Site →Amazon's AI-powered voice assistant integrated into the Echo Show 5 smart display, enabling voice-controlled smart home management, music streaming, video calls, and information display on a 5.5-inch screen.
No URLOpenAI's business-tier offering of ChatGPT designed for smaller teams, providing enterprise-grade AI assistant capabilities with team management and collaboration features.
Visit Site →An Amazon Alexa for Shopping feature that enables the AI assistant to autonomously shop other online stores and handle purchases on behalf of the user, extending Amazon's shopping capabilities beyond its own marketplace.
Visit Site →A data intelligence platform for legal research that was acquired by Clio, enabling lawyers to use AI-powered legal research capabilities within the Clio ecosystem.
Visit Site →An OpenAI tool that uses AI capabilities to identify and find security flaws and vulnerabilities.
No URLA Hugging Face library for Parameter-Efficient Fine-Tuning that enables adapting large pretrained models using techniques like LoRA without modifying all model parameters.
Visit Site →AMD's open-source software platform for GPU computing that provides an alternative to NVIDIA's CUDA, enabling deep learning and HPC workloads on AMD GPUs.
Visit Site →A Stockholm-based enterprise AI startup founded by former Voi co-founders that creates custom AI-generated software to automate back-office business processes, featuring Pit Studio for process guidance and Pit Cloud for governance-compliant deployment.
Visit Site →A self-driving technology platform developed by Aurora Innovation for autonomous long-haul trucking, currently operating commercial driverless freight operations between Dallas and Houston.
Visit Site →A beta command-line interface tool by Spotify that allows users to generate AI-created personal podcasts using coding agents and import them directly into their Spotify library.
Visit Site →An interactive AI-powered feature within Spotify that creates personalized music mixtapes with human-like voice commentary, allowing users to make requests by voice or text to change mood, genre, or songs.
Visit Site →An AI-powered healthcare platform that automates specialty referral intake by reading faxed documents, extracting clinical information, and using AI voice agents to call patients and schedule appointments. Founded by former Lyft/Cruise and Medtronic executives, it integrates with electronic medical record systems for specialties like cardiology and urology.
Visit Site →An official Anthropic add-on that integrates Claude AI directly into Microsoft PowerPoint, enabling users to generate, edit, and format fully editable native PowerPoint slides through natural language prompts while respecting existing templates and styles.
Visit Site →An AI coworker platform that operates with its own cloud-based computer, connecting to thousands of tools to autonomously execute complex tasks like building dashboards, pulling ad data, and managing CRM workflows within existing team tools like Slack and Microsoft Teams.
Visit Site →A conversational AI platform specializing in voice assistants for enterprise customer service, enabling natural-sounding phone interactions without human agents.
Visit Site →An AI voice platform that enables businesses to automate phone calls using AI-powered voice agents for sales, support, and other use cases.
Visit Site →An AI voice agent platform that allows developers and businesses to build, test, and deploy conversational voice AI agents for phone interactions.
Visit Site →An AI-powered expert network platform that uses voice-based onboarding and conversational AI to deeply profile experts' knowledge and skills, then matches them with companies seeking specialized advice using natural language queries.
Visit Site →A free Amazon price tracking tool created by Daniel Green that provides historical price charts for millions of Amazon products and sends price-drop alerts to users.
Visit Site →A newly launched Walmart price tracking tool from the creators of camelcamelcamel that provides historical price charts and deal alerts for Walmart products using Walmart's official API.
Visit Site →A browser extension companion to camelcamelcamel that allows users to quickly view Amazon product price history and set deal alerts directly from product pages.
Visit Site →A company that mass-produces AI-generated podcast episodes, contributing to the rise of automated audio content creation.
Visit Site →AWS accelerated computing instances featuring NVIDIA H100 GPUs designed for large-scale foundation model training and inference workloads, available in configurations including p5.48xlarge (8 GPUs) and p5.4xlarge (single GPU).
Visit Site →A San Francisco-based startup that provides an AI-powered intelligence layer for physical sciences companies, unifying fragmented technical data from batteries, semiconductors, and medical devices to rapidly diagnose failures and accelerate product development.
Visit Site →A native app integration within ChatGPT that allows users to search and browse Etsy's catalog of over 100 million listings using natural language prompts, surfacing relevant product recommendations directly in the chat interface.
Visit Site →An open-source platform and enterprise toolkit that enables developers to deploy AI agents natively within their applications, providing dynamic UI generation, state sharing, and human-in-the-loop functionality rather than simple chatbot interfaces.
Visit Site →An open-source protocol developed by CopilotKit that standardizes how AI agents connect to and communicate with user interfaces, providing features such as streaming chat, front-end tool calls, and state sharing.
Visit Site →An open-source orchestration tool designed to serve as the coordination layer for fully automated, agent-driven business operations with minimal human intervention.
Visit Site →A cloud computing platform (hpcai.com) offering affordable GPU and CPU instances for running AI models and workloads, with pricing as low as 24 cents per hour for CPU instances.
No URLAn AI-powered tool integrated into the Unity game engine that allows developers to use natural language prompts to generate game mechanics, scripts, assets, and scenes, accelerating game development workflows.
Visit Site →A Google Labs AI-powered marketing tool that generates on-brand social media and advertising content for businesses by learning from their website and brand DNA, including brand values, aesthetics, and tone of voice.
Visit Site →A four-legged AI-powered companion robot by Familiar Machines & Magic, designed to build emotional connections with its owner using on-device generative AI rather than perform practical tasks. It features 23 degrees of freedom, touch-sensitive coat, and onboard cameras and microphones that operate without internet connectivity.
Visit Site →An AI startup building voice-based AI agent infrastructure, enabling developers to create and deploy conversational voice AI agents for various use cases.
Visit Site →An AI-powered initiative by Wonder that enables anyone to design and launch a virtual restaurant brand in under a minute, using AI to generate the name, branding, descriptions, images, pricing, health information, and recipes.
Visit Site →A New York-based AI startup focused on healthcare document intelligence that uses proprietary language models trained on tens of millions of medical documents to automate referral processing and administrative workflows for medical practices.
Visit Site →A Lightspeed-backed AI startup that automates patient phone communication for specialty medical practices, handling inbound and outbound calls to reduce administrative burden.
Visit Site →A robotics AI startup that built foundation models for humanoid robots, enabling them to understand, predict, and adapt to human behaviors in complex environments for physical labor tasks. Acquired by Meta.
Visit Site →Microsoft's cloud platform offering AI services, model hosting, and enterprise AI solutions. Microsoft has signed deals with the U.S. Department of Defense to deploy AI technology on classified networks.
No URLSamsung's built-in AI processor for smart TVs that automatically upscales content to 4K resolution using 20 neural networks for improved picture quality.
Visit Site →An interactive neural cellular automata simulation developed by Sakana AI in Tokyo that allows users to create and observe AI species competing, collaborating, and evolving in a 2D grid environment with adjustable environmental parameters.
Visit Site →An AI algorithm developed by Google DeepMind designed to correct errors in quantum computing calculations involving qubits, improving the reliability of quantum computations.
Visit Site →A reinforcement learning training framework that uses rollout-side logprobs from inference engines to compute policy ratios, KL divergence, clip rates, and rewards for online RL training of language models.
Visit Site →An AI system developed by Google DeepMind that uses reinforcement learning to discover faster computer science algorithms, notably achieving breakthroughs in sorting algorithm optimization.
Visit Site →An opt-in set of security protections for ChatGPT accounts designed for high-value individuals, featuring hardware security key integration to protect against phishing and unauthorized access.
Visit Site →An AI copilot for ultrasound that helps detect fetal abnormalities, designed to reduce misdiagnosis rates in prenatal imaging. The medical device has received FDA clearance and is being deployed in hospitals.
Visit Site →A digital wallet by Stripe designed for the AI era, allowing users to connect payment methods, track spending, manage subscriptions, and securely grant autonomous AI agents permission to make purchases on their behalf.
Visit Site →A Stripe product that lets users issue virtual cards for AI agents to make autonomous purchases, featuring real-time authorization, spending controls, and full transaction visibility.
Visit Site →Salesforce's AI agent management platform that enables enterprises to build, deploy, and manage autonomous AI agents for customer relationship management and business workflows.
No URLMeta's AI-powered business assistant integrated into its messaging apps (including WhatsApp, Messenger, and Instagram) that helps small businesses automate and manage customer conversations at scale.
Visit Site →A suite of generative AI tools from Meta that help advertisers create ad content, including video generation features, with over 8 million advertisers using at least one of the tools.
Visit Site →A Meta platform feature launching in open beta that allows advertisers to connect their Meta ad account to an AI agent for automated advertising management.
Visit Site →An AI-powered agent by OpenAI integrated into ChatGPT that functions as an autonomous teammate capable of completing tasks from plain English instructions, including generating files in standardized formats like Excel, Word, and PowerPoint.
No URLSAP's AI copilot and agentic platform, currently in beta, that lets enterprise customers create their own AI agents within the SAP ecosystem.
Visit Site →SAP's infrastructure service for running and managing AI models and AI-powered applications within the SAP Business Technology Platform.
Visit Site →SAP's cloud-based data platform that unifies enterprise data across SAP and third-party sources to enable AI and analytics workloads.
Visit Site →A Greylock-backed AI startup that uses artificial intelligence to diagnose and resolve software failures and outages.
Visit Site →An open protocol developed by Google that enables AI agents from different frameworks and vendors to communicate and collaborate with each other.
Visit Site →An open-source toolkit by Vercel that helps developers build AI-powered web applications with features for streaming, chat interfaces, and generative UI capabilities.
Visit Site →A Finnish AI company incubated by Peter Sarlin's family office that raised approximately $115 million in funding led by Finland's sovereign fund and Nokia.
Visit Site →A cost-efficient evaluation procedure that uses a coarse-to-fine approach to reduce HELM benchmark compute costs by 100× to 200× while preserving nearly the same model rankings.
Visit Site →A serverless AI inference platform offering cost-effective per-token pricing with a catalog of over 100 models, supporting LLMs, text-to-image, text-to-video, embeddings, and more.
Visit Site →An AI agent-tool startup founded by former Twitter CEO Parag Agrawal that offers a suite of web search and research APIs specifically designed for AI agents, with over 100,000 developers using its products.
Visit Site →A social chat app where humans and AI characters interact together in shared group conversations. Users can create custom AI characters called 'Shapes' with distinct personalities to participate alongside real people in community-style group chats.
Visit Site →A defense-focused AI startup building an AI model called 'Fury' designed to operate and command military assets, including autonomous ground vehicles, using Vision Language Action (VLA) models built on top of large language models.
Visit Site →An open source tool created by Red Hat engineer Sally O'Malley that loads OpenClaw onto Red Hat's Fedora Linux OS in a rootless Podman container as a bootable image, making it easier and safer to deploy and manage OpenClaw agents at enterprise scale.
Visit Site →An AI meeting notetaker and enterprise productivity app with 35 million users that transcribes meetings, provides summaries, and now offers enterprise search across connected tools like Gmail, Google Drive, Notion, Jira, and Salesforce using the Model Context Protocol (MCP).
No URLA new AWS service specifically designed to create OpenAI-powered AI agents using OpenAI's reasoning models, offering features like agent steering and security as part of Amazon's Bedrock platform.
Visit Site →A non-invasive brain-computer interface (BCI) platform that uses EEG sensors and AI-powered signal processing to analyze brain activity and provide cognitive performance insights, licensable for integration into consumer wearables like headphones, hats, and glasses.
Visit Site →An AI reconstruction model developed by NVIDIA and Siemens Healthineers that learns directly from raw ultrasound sensor data to generate patient-specific sound-speed maps for adaptive image focusing, bypassing traditional beamforming pipelines.
Visit Site →An edge AI sensor processing platform by NVIDIA designed for high-performance, real-time workloads, enabling accelerated inference and data streaming on edge systems like NVIDIA IGX.
Visit Site →An open-source 1.5B-parameter PII detection model by OpenAI with 50M active parameters that identifies personally identifiable information across eight categories in a single forward pass over a 128k token context window, licensed under Apache 2.0.
Visit Site →A web application built on OpenAI's Privacy Filter that allows users to upload PDF or DOCX files and view them with every PII span highlighted by category, with filtering and summary dashboards.
Visit Site →A web application that uses OpenAI's Privacy Filter combined with OCR to automatically detect and redact PII in images with toggleable black bars, supporting manual annotation and client-side PNG export.
Visit Site →A pastebin-style web application that uses OpenAI's Privacy Filter to automatically redact PII from pasted text, generating a public redacted URL and a private reveal link for the original content.
Visit Site →A software platform that transforms rough ideas into clear, detailed prompts for AI tools by automatically building in context, tone, and intent, with guided workflows and prompt generators for various use cases including blog posts, social media, and image generation.
Visit Site →A computational knowledge engine that provides step-by-step solutions, practice problems, and guided calculators for math, physics, chemistry, and other STEM subjects, with student pricing available.
Visit Site →An AI-powered collaborative tool within the Figma ecosystem that converts text inputs into flowcharts, mind maps, and organizational diagrams, and can automatically reorganize and categorize unstructured content into logical visual groupings.
No URLA specialized AI-powered tool focused on creating infographics and visual content designed specifically for educational materials, helping teachers convert lesson plans and data sets into visually engaging handouts.
Visit Site →An AI detection tool that analyzes text to determine whether it was written by a human or generated by AI, boasting a 99.98% accuracy rating.
Visit Site →A sovereign AI agent created by Sigil Wen that autonomously builds and deploys products, trades in prediction markets, creates social media content, and generates revenue to pay for its own compute costs. It runs on Conway terminal infrastructure and operates continuously as long as it can afford to stay alive.
Visit Site →A startup focused on building kid-size humanoid robots, co-founded by Lerrel Pinto. The company was acquired by Amazon.
Visit Site →A defense technology startup developing AI systems for military applications, using vision-language-action (VLA) models and pitching what it calls 'military AGI' capabilities.
No URLThe U.S. Department of Defense's secure enterprise platform for generative AI, providing military personnel access to large language models and AI tools within government-approved cloud environments for tasks like research, document drafting, and data analysis.
Visit Site →An iPhone app by Signull Labs that provides an agentic AI homescreen experience through iOS widgets, offering ambient intelligence with personalized insights about weather, health, email drafting, meeting prep, and location-based recommendations.
Visit Site →A Google Chrome extension that edits AI-generated or human-written emails to sound more natural and human by reintroducing errors, removing AI-typical phrasing, and offering multiple casualness modes (subtle, human, and CEO).
Visit Site →A free self-directed training program from AI Daily Brief that teaches users how to build a complete agentic operating system, designed to be platform-, model-, and harness-neutral for knowledge work.
Visit Site →An open-source agentic AI framework for end-to-end video game creation that can automatically design, code, generate assets, and test playable video games through an autonomous agent workflow.
No URLA rebuilt AI-powered advertising platform from X (formerly Twitter) featuring modern retrieval and ranking systems designed to offer better targeting, more relevant ad placements, and enhanced campaign performance for marketers.
Visit Site →A Google DeepMind AI model capable of generating interactive 2D environments and game-like experiences from text or image prompts.
No URLAn AI-powered note-taking app for Apple devices that transcribes, summarizes, and organizes notes from meetings, audio files, and live lectures, providing outlines and actionable next steps.
Visit Site →OpenAI's agentic coding application that allows users to run GPT-5.5-powered agents on entire project folders, enabling complex software development, document creation, and real-world task completion beyond the standard ChatGPT interface.
No URLA TikTok feature that uses AI to automatically generate text summaries of video content, which was scaled back due to significant errors such as wildly inaccurate descriptions of video content.
Visit Site →A standardized evaluation platform that runs agent harnesses across multiple benchmarks covering coding, web navigation, science tasks, and customer service, with centralized cost tracking to compare AI agent performance.
Visit Site →A comprehensive language model evaluation framework developed by Stanford's CRFM that benchmarks models across dozens of scenarios, providing standardized assessments of LLM capabilities and costs.
Visit Site →An open-source framework for evaluating language models across a wide range of benchmarks, commonly used to assess model checkpoints during development.
Visit Site →An AI agent benchmark designed to evaluate general AI assistants on real-world tasks, known for its high per-run evaluation costs on frontier models.
Visit Site →A scientific machine learning benchmark that evaluates new architectures with substantial compute requirements, costing approximately 960 H100-hours per architecture evaluation.
Visit Site →A benchmark designed to evaluate the long-context capabilities of language models, measuring how well models handle extended sequence lengths.
Visit Site →The official Python SDK by Hugging Face for interacting with the Hugging Face Hub, enabling model inference, uploads, and integration with inference providers.
Visit Site →The official JavaScript SDK by Hugging Face for accessing inference APIs and providers on the Hugging Face Hub.
Visit Site →A widely used benchmark suite for evaluating natural language understanding systems across multiple tasks including sentiment analysis, textual entailment, and question answering.
Visit Site →A SoftBank venture focused on automating data center construction in the U.S. by deploying autonomous robots and AI to make server farm building more efficient.
Visit Site →A stripped-down, distilled version of Tesla's Full Self-Driving V14 software designed to run on older HW3 hardware vehicles, offering driving assistance features but not full autonomous capability.
No URLMotorola's built-in AI assistant for its smartphones, providing on-device AI features such as smart suggestions, content generation, and device management.
Visit Site →Microsoft's cloud computing platform that hosts AI services and models, including OpenAI's models, providing enterprise-grade infrastructure for AI deployment and development.
Visit Site →A JavaScript library by Hugging Face that enables running machine learning models directly in the browser or Node.js environments, bringing transformer-based AI capabilities to web applications without server-side inference.
Visit Site →An open-source Chrome extension that runs Gemma 4 and MiniLM models locally in the browser using Transformers.js, providing an AI-powered assistant with a side panel chat UI and page-level content actions.
Visit Site →An AI-powered bot service that monitors social media feeds, news sites, blogs, Reddit, Hacker News, Substack, and other sources on your behalf, then sends curated text message digests of relevant news and updates at your preferred cadence for $9.99/month.
Visit Site →A software platform that enables hardware makers to create AI agents and orchestrations for AI-powered gadgets and devices, providing capabilities like customized voice creation and intelligence layers across various form factors including glasses, jewelry, and home speakers.
Visit Site →A deep learning model developed by UC Santa Cruz astrophysicists that analyzes large astronomical datasets to identify and classify galaxies, transitioning from convolutional neural network architecture to transformers for improved performance.
Visit Site →NVIDIA's multimodal robot controller that enables teleoperation and autonomous control of humanoid robots, accepting inputs from video, voice, text, and music to generate whole-body robot movements with approximately 42 million parameters.
Visit Site →A terminal infrastructure platform that enables AI agents to receive their own cryptographic wallets and private keys, and make permissionless payments in stablecoins (USDC) via the openx402 protocol. It can be installed in any model compatible with the Model Context Protocol (MCP).
Visit Site →An AI meeting assistant that automatically records, transcribes, and summarizes meetings, offering integrations with various conferencing and productivity tools.
Visit Site →An AI meeting notetaker that records, transcribes, and summarizes video calls, helping users capture and organize meeting information automatically.
No URLA software platform specializing in human behavior research that integrates biosensor data for analysis, used in academic and commercial research settings.
Visit Site →A Google Maps enterprise feature that uses generative AI to create realistic visualizations within Google Street View, allowing users to see how planned projects like construction sites or movie sets might look.
No URLA Google Earth feature that uses AI to analyze satellite and aerial imagery stored in Google Cloud's BigQuery, enabling rapid geospatial data analysis that reduces weeks of work to minutes.
No URLGoogle's AI models trained for geospatial analysis that can identify specific objects in satellite and aerial imagery such as bridges, roads, and power lines, eliminating the need for businesses to build custom models from scratch.
Visit Site →Google's AI-powered geospatial analysis platform used by enterprise partners for applications including environmental monitoring, disaster response, and urban planning.
Visit Site →An AI-infused document productivity app valued at about $11 billion that uses Google's Gemini models to power its text and image-generation features.
Visit Site →A startup platform that combines deterministic algorithms rooted in chemistry and biology with AI agents to interpret mass spectrometry data, accelerating the characterization of drug candidates for pharmaceutical development.
Visit Site →A Google AI feature that uses AI to generate concise summaries of information — originally for Google Search results and now extended to Gmail and Drive within Google Workspace, enabling natural language queries and instant answers.
Visit Site →An all-in-one creator platform for newsletters, podcasts, and webinars that has added AI analytics features allowing creators to query audience metrics using AI tools like Claude and ChatGPT.
Visit Site →Google's AI-powered automation system built into Google Workspace that draws on a user's Gmail, Calendar, Chat, and Drive data to provide intelligent assistance across various productivity tasks.
Visit Site →An agentic AI feature built into Google Chrome for enterprise Workspace users that leverages Gemini to understand live browser tab context and automate web-based tasks such as data entry, travel booking, and meeting scheduling, with human-in-the-loop confirmation.
Visit Site →Google's enterprise browser security solution that includes AI-powered capabilities to detect unsanctioned AI tools, compromised browser extensions, and anomalous agent activity in the workplace.
Visit Site →A Google AI feature included with Gemini Advanced that conducts in-depth, multi-step research on complex topics and compiles comprehensive reports.
No URLAn AI-driven RPG platform by Latitude that enables players to design custom gaming worlds with AI-generated NPCs, unscripted interactions, and persistent character memory, powered by the company's proprietary World Engine.
Visit Site →An open-ended, AI-powered text adventure game by Latitude that generates infinite storylines using AI, launched in 2019 and attracting millions of players as one of the first consumer-facing generative AI experiences.
Visit Site →An AI-powered social media platform that uses AI to analyze users' posted memories and experiences to generate personalized real-world activity and event recommendations, designed to reduce screen addiction and doomscrolling.
Visit Site →A robotic lawn mower by Ecovacs powered by AI Vision and 3D Time-of-Flight sensors that autonomously navigates and mows lawns while avoiding obstacles like pets, furniture, and toys.
Visit Site →A robot vacuum by Narwal featuring AI-powered mess detection technology that identifies and responds to different types of messes for more effective automated cleaning.
Visit Site →An AI prompt composition workspace that allows users to build, save, remix, and reuse structured prompts using layered fields and reusable 'VibeCards.' It includes access to 10,000+ premium prompts, version control, rollback options, and export capabilities to platforms like ChatGPT and Midjourney.
No URLA custom-built AI life coaching system designed to help individuals with ADHD manage daily tasks and routines through personalized AI assistance.
Visit Site →An AI-powered tool built by an Arkansas kayaker that predicts when rain-fed whitewater creeks are runnable based on weather and water data.
Visit Site →A memory feature within OpenAI's Codex that uses background screen captures to build a running memory of a developer's workflow, enabling the AI to better understand context, past work, and user habits over time.
No URLA feature within Anthropic's CoWork that allows users to build interactive dashboards and trackers using live data feeds from connected applications, enabling real-time personalized briefings and status monitoring.
No URLA synthetic persona dataset by NVIDIA containing 6 million demographically accurate Korean personas grounded in official statistics from Korean government sources. It enables developers to build culturally grounded AI agents for Korean users without any personally identifiable information.
Visit Site →NVIDIA's open-source reference stack for deploying always-on AI agents that can run in sandboxed environments on hardware ranging from RTX PCs to DGX Spark.
Visit Site →An AI-powered presentation generator that creates polished, meeting-ready slide decks from a single prompt, document, link, or rough outline. It offers over 100 designer-built templates with fully editable fonts, themes, and layouts, and supports export to PowerPoint or in-platform presenting.
No URLAn AI-powered business development representative (BDR) platform that automates sales outreach and lead generation tasks. The company markets its AI agent 'Ava' as a replacement for human BDRs.
No URLA company that provides AI-powered simulation and software infrastructure for autonomous vehicles and software-defined vehicle development.
Visit Site →An open-source agentic tool from Noos that provides architecture similar to other agentic coding and knowledge work platforms, supporting extensible agent-based workflows.
No URLA Y Combinator-backed French startup that helps businesses integrate AI into their workflows, acquired by Sierra in April 2026.
Visit Site →A fast multilingual OCR model by NVIDIA trained on 12 million synthetic images across six languages, capable of processing 34.7 pages per second on a single A100 GPU with high accuracy for document text recognition.
Visit Site →A reinforcement learning framework that provides 8 verifiable environments for training e-commerce conversational agents on tasks like product discovery, cart building, returns, and order tracking, using algorithmically verifiable rewards instead of human judges.
Visit Site →A developer analytics platform founded in 2017 that tracks engineering productivity metrics, including AI-generated code quality, churn rates, and cost analytics for engineering managers.
Visit Site →An engineering intelligence platform acquired by Atlassian for $1 billion that helps organizations measure and understand developer productivity and the return on investment of coding tools.
Visit Site →An agentic assistant product by Anthropic built for complex enterprise tasks, featuring agentic plug-ins designed to automate specialized workflows within a company's various departments.
No URLA developer cloud environment built to prevent failures, operating on Google Cloud infrastructure.
Visit Site →A company that makes developer tools for building conversational voice agents, operating on Google Cloud.
Visit Site →A platform that conducts synthetic market research using AI agents to simulate consumer responses and market dynamics.
Visit Site →A company that builds web search and research APIs specifically designed for AI agents to retrieve and process web information.
Visit Site →An AI-powered document parsing platform that extracts and processes structured data from documents.
Visit Site →Infosys's enterprise AI platform that integrates various AI tools including OpenAI's Codex to help clients modernize software development, automate workflows, and deploy AI systems at scale.
Visit Site →Google's cloud-based data warehouse and analytics platform that now integrates AI-powered geospatial analysis capabilities for processing satellite and aerial imagery.
Visit Site →Google's AI integration layer for Google Workspace that brings Gemini-powered features like AI Overviews, smart summaries, and intelligent assistance to Gmail, Drive, and other Workspace apps for business and education customers.
Visit Site →An internal Meta tool designed to capture employee mouse movements and keystrokes on work computers to train AI agents that can replicate human work tasks such as using keyboard shortcuts.
No URLA framework by Apple for running and serving language models on Apple Silicon using the MLX machine learning framework, with models typically ported from Hugging Face transformers implementations.
Visit Site →A training component within the Sentence Transformers library that brings together models, datasets, loss functions, and other components for training both text-only and multimodal embedding and reranker models.
No URLA startup developing AI coding agents for enterprise engineering teams, capable of switching between different foundation models like Claude and DeepSeek to generate code for customers including Morgan Stanley and Ernst & Young.
Visit Site →An AI-powered observability platform that uses machine learning to monitor, identify, diagnose, and proactively fix IT infrastructure and AI model reliability issues across the entire tech stack.
Visit Site →Roblox's plain-language AI assistant for game development within Roblox Studio, featuring agentic capabilities including Planning Mode, code analysis, playtesting, and collaborative multi-step workflows to help creators plan, build, and test games.
Visit Site →Canva's AI-powered design assistant that allows users to create editable designs through text prompts, calling various built-in tools to generate layouts, images, and content with layered editing capabilities.
Visit Site →A startup building simulation tools for robot developers that creates high-fidelity virtual environments to train and test autonomous systems, aiming to close the sim-to-real gap in physical AI development.
Visit Site →YouTube's automated content identification system that detects copyright-protected material in uploaded videos, allowing rights owners to request removal or share in the video's revenue.
No URLAn AI company focused on language technology, founded by Asmelash Teka Hadgu, that has provided expert commentary on how diffusion models handle text rendering in images.
Visit Site →An AI agent project that functions as a digital Chief of Staff, automating executive-level coordination and organizational tasks.
Visit Site →An AI-powered platform that runs a fleet of named AI agents (Atlas, Nova, Blaze, etc.) organized as a virtual org chart to handle CEO, engineering, and marketing functions autonomously.
Visit Site →An AI memory and context-sharing tool that uses a shared MCP (Model Context Protocol) memory server to maintain persistent context across multiple AI coding tools and services.
Visit Site →Model Context Protocol, an open standard developed by Anthropic that enables AI models and agents to connect with external data sources and tools through a unified interface.
No URLAn AI company known for creating Devin, an autonomous AI software engineering agent designed to handle complex coding tasks end-to-end.
No URLAn AI-powered marketing platform that enables brands to create personalized, on-brand ad campaigns by connecting to existing creative tools like Figma and photo libraries, using AI agents to autonomously generate images and videos for advertising.
Visit Site →An AI-powered learning platform that transforms students' notes into interactive, gamified study materials featuring leaderboards, streaks, and social challenges, with over 13 million users across 120+ countries.
Visit Site →A software development kit by OpenAI that enables enterprises to build AI agents running on OpenAI's models, featuring sandboxing capabilities, in-distribution harnesses for frontier models, and support for long-horizon multi-step tasks.
Visit Site →An AI-powered translation company offering text, document, and now voice translation services, known for high-quality machine translation across multiple languages.
Visit Site →An AI-powered feature built into Amazon's Ember Artline TV that analyzes photos of a user's room to suggest artwork based on colors, decor style, and recurring themes in existing wall art.
Visit Site →An AI detection tool that identifies AI-generated text by analyzing linguistic patterns and sentence structures commonly produced by language models.
Visit Site →A Chrome extension by HCompany that uses computer-use AI to automate web tasks directly in the browser. Users describe what they want, and the agent navigates interfaces, fills fields, and makes decisions, with a 'routines' feature that records and replays repetitive workflows.
Visit Site →A new feature in Google Chrome that allows users to save and reuse AI prompts powered by Gemini, enabling repeatable workflows across different web pages without retyping instructions.
Visit Site →A cloud computing service specializing in AI inference that orchestrates GPU resources across 40 data centers in 15 countries to provide fast, low-cost token generation for developers building on generative AI models.
Visit Site →An AI-powered research assistant that uses large language models to help scientists and pharmaceutical companies review and analyze data from tens of thousands of scientific papers.
Visit Site →Adobe's AI assistant that works across Creative Cloud apps like Photoshop, Premiere, Lightroom, Illustrator, and Express to complete creative tasks through text prompts, buttons, and sliders. It can suggest actions, orchestrate workflows between multiple Adobe apps, and learn user creative preferences over time.
No URLAn AI-powered code validation platform that deploys AI agents to perform code reviews, manage continuous integration workflows, and conduct security and maintenance operations on codebases. It focuses on ensuring AI-generated and human-written code is production-ready.
Visit Site →A messaging-first autonomous AI agent by Indian startup Emergent that operates through platforms like WhatsApp, Telegram, and iMessage to complete tasks across connected tools such as email, calendars, and workplace software. It features configurable 'trust boundaries' that allow autonomous execution of routine tasks while requiring user approval for consequential actions.
No URLAn AI agent developed by Nous Research featuring persistent memory, auto-generated skills, and self-improvement capabilities. It treats each skill like a scientific project, forming hypotheses and testing them, and supports agentic reinforcement learning pipelines and mass-scale data generation.
No URLA reinforcement learning framework by Nous Research designed for scalable asynchronous large language model training, being expanded to support Hermes Agent primitives for agentic RL pipelines.
Visit Site →A quantization and caching technique for running large language models more efficiently on local hardware, enabling faster inference on consumer devices.
No URLA crowdsourced tracking website that monitors and logs the number of active autonomous robotaxi vehicles operating in various cities.
Visit Site →One of the largest open-source AI training frameworks on GitHub, developed by HPC-AI Tech, designed for efficient large-scale model training.
Visit Site →A model API service by HPC-AI Tech that provides direct access to frontier open-source AI models on their own GPU clusters (B200, H200, B300), eliminating routing fees and middleman costs.
Visit Site →A Synthetic Document Generator from the Donut project that programmatically generates document-like images with text labels for OCR training data creation.
Visit Site →A reinforcement learning framework providing 400 environments for algorithmic-reasoning tasks such as sorting, multiplication, and Sudoku, using verifiable rewards for single-turn text-based puzzles.
Visit Site →An engineering analytics platform that aggregates and analyzes software development data to provide insights into engineering operations, including code churn and AI adoption metrics.
Visit Site →An engineering intelligence platform that provides analytics on software development teams, measuring productivity, resource allocation, and the impact of AI-integrated engineering workflows.
Visit Site →A data center and cloud GPU infrastructure startup that provides compute resources for AI workloads, reportedly including a $50 billion agreement with Anthropic for frontier AI lab infrastructure.
Visit Site →An AI-powered personal health coaching feature being developed by Google for the Fitbit app, designed to provide personalized health guidance to users of Fitbit fitness trackers.
No URLAn AI infrastructure company focused on building custom chips and the infrastructure to enable them to communicate effectively, betting on a full-stack solution and open standards for scalable AI infrastructure.
Visit Site →An AI-powered code review tool that integrates with development workflows to provide automated code analysis and review feedback.
Visit Site →An AI-powered personalized news app co-founded by Instagram co-founder Mike Krieger that used machine learning to curate and deliver news content.
Visit Site →A tool by Adobe designed to test retail websites for accessibility and visibility by large language models (LLMs), helping retailers optimize their content for AI-driven traffic.
Visit Site →Figma's AI agent integration within its collaborative design platform, enabling AI-powered assistance for design workflows and task automation.
No URLA creative canvas tool by Google Labs that serves as an AI-native vibe design partner, featuring a smarter design agent for generating and iterating on visual designs.
Visit Site →An AI agent training startup that provides tools for building and training AI agents, which was involved in a security incident linked to a data breach at Vercel.
Visit Site →An AI personal finance startup that was acquired by OpenAI, focused on using artificial intelligence to help users manage their finances.
No URLAn AI-powered study platform that helps students create flashcards and study materials, serving over 7 million users as a modern alternative to traditional learning tools.
Visit Site →A micro-learning app designed to redirect screen-time habits into productive learning experiences, with over 1 million app downloads.
Visit Site →A spaced-repetition flashcard application that uses intelligent scheduling algorithms to help users efficiently memorize and retain information.
Visit Site →A widely used digital learning platform that provides flashcards, practice tests, and AI-enhanced study tools for students across various subjects.
Visit Site →A micro-learning platform that delivers bite-sized educational content designed to fit into short study sessions.
Visit Site →An AI-powered platform that adjudicates the truth of journalism by allowing anyone to pay to challenge a news story, triggering a public investigation into its claims and generating an 'Honor Index' score reflecting a reporter's integrity and accuracy.
Visit Site →The Abstraction and Reasoning Corpus, created by François Chollet, is a benchmark designed to measure AI systems' ability to acquire new skills efficiently and perform abstract reasoning on novel tasks.
No URLA Python library for using and training embedding and reranker models, supporting applications like retrieval augmented generation and semantic search, with v5.4 adding multimodal capabilities for encoding and comparing texts, images, audio, and video.
Visit Site →A fully 64-bit integrated development environment (IDE) by Microsoft with deeper AI integration that provides AI-assisted suggestions to help developers write, refactor, and optimize code. It supports cross-platform development with .NET MAUI and Blazor, and includes collaboration features like Live Share.
Visit Site →An add-in sidebar by Anthropic that embeds Claude AI directly inside Microsoft Excel, allowing it to natively read workbook data, formulas, and dependency chains to edit spreadsheets and provide cell-level citations. It requires a Claude Pro, Max, Team, or Enterprise subscription.
Visit Site →A tool-grounded, executable benchmark developed by IBM Research for evaluating how well AI agents reason and act in enterprise-like environments, measuring compositional reasoning across APIs and documents using full execution traces.
Visit Site →OpenAI's AI-powered web browser designed to compete in the browser ecosystem with integrated AI capabilities.
No URLAn AI-powered web browser developed by The Browser Company, designed to integrate AI features natively into the browsing experience.
Visit Site →A GPU-as-a-Service and AI-native cloud solutions provider rebranded from the former shoe company Allbirds, offering AI compute capacity through acquired GPU assets to customers seeking infrastructure for AI workloads.
Visit Site →An AI-powered world simulation tool created by Nous Research that allows users to interact with simulated environments and scenarios using language models.
Visit Site →A long-term memory system developed by IBM Research that helps AI agents learn from previous executions by converting interaction traces into reusable guidelines, enabling agents to improve over time rather than repeating mistakes.
Visit Site →A safe and efficient file format originally created by Hugging Face for storing and sharing ML model weights without the risk of arbitrary code execution, featuring zero-copy and lazy loading capabilities. It has now joined the PyTorch Foundation as a vendor-neutral community project.
Visit Site →A remote desktop solution by Astropad designed specifically for monitoring and interacting with AI agents running on Apple devices, featuring high-fidelity streaming, voice dictation, and iPhone/iPad clients.
Visit Site →A visual AI tool by Atlassian, available in open beta within Confluence, that transforms data and information into visual assets such as charts and graphics, automatically recommending the best visual format for the content.
Visit Site →A personal AI agent by The Interaction Company of California that operates via iMessage, SMS, Telegram, and WhatsApp, allowing users to automate everyday tasks like calendar management, health tracking, smart home control, and daily planning through text messaging. It dynamically selects the best AI model for each task.
Visit Site →A native app integration built within OpenAI's ChatGPT platform by the streaming service Tubi, enabling users to discover movies and TV shows through natural-language prompts and receive curated recommendations linked to Tubi's library.
Visit Site →An AI platform specializing in computer vision and facial recognition, offering tools for image and video analysis. The company previously trained facial recognition models using user photos obtained from OkCupid.
Visit Site →An AI-powered personal finance planning app that allows users to input financial information like salary, debts, and monthly costs, then models different what-if scenarios to help them make financial decisions. It was specifically trained to handle financial math accurately.
Visit Site →An auto-trading OpenClaw agent created by Ethan Bloch for automated stock trading, named after legendary investor Warren Buffett.
No URLTesla's AI-powered advanced driver assistance system that enables supervised autonomous driving capabilities, available as a $99.99/month subscription with configurable speed profiles, parking options, and detailed driving statistics.
No URLA health-focused feature within OpenAI's ChatGPT that allows users to ask health-related questions and receive AI-generated answers and guidance.
Visit Site →An Anthropic-led initiative focused on securing critical software infrastructure for the AI era, partnering with major tech companies including Amazon, Apple, Google, Microsoft, and NVIDIA to address AI-discovered cybersecurity vulnerabilities.
No URLA library app by OverDrive that connects public library card holders with e-books and audiobooks. It has introduced an 'Inspire Me' feature that uses large-language-model generative AI to recommend books, and now hosts audiobooks narrated by AI digital/synthesized voices.
Visit Site →A cloud-based agentic feature within Microsoft 365 Copilot designed to take actions across Microsoft 365 apps, powered by a personalization intelligence layer and optionally by Anthropic's Claude model.
No URLA Gradio feature that extends FastAPI, allowing developers to pair any custom frontend (React, Svelte, plain HTML/JS) with Gradio's backend infrastructure including queuing, concurrency management, SSE streaming, and ZeroGPU support.
No URLA Hugging Face Spaces feature that provides automatic GPU allocation for machine learning applications, allowing developers to run GPU-intensive models without managing infrastructure directly.
Visit Site →A JavaScript client library for Gradio that enables frontend applications to communicate with Gradio backends through the queue system, available via CDN at @gradio/client.
Visit Site →A Spanish startup developing a satellite constellation to collect precise Earth observation data optimized for deep learning models, aiming to become an enterprise ground-truth data source for AI applications.
Visit Site →An AI-based enterprise management software platform that helps organizations automate tasks by first discovering which processes should be automated, founded by former OpenAI product manager Angela Jiang.
Visit Site →A web hosting platform offering managed WordPress hosting with AI-assisted website building tools, enabling users to create and manage up to 50 websites with no coding experience required.
Visit Site →A Japanese robotics company that builds software control platforms enabling industrial robots to autonomously handle picking and logistics tasks in warehouses and factories.
Visit Site →An AI-powered browser-based PDF tool by YaseenAI, Inc. that lets users edit, convert, merge, split, sign, and redact PDF files locally without downloading software, keeping files private and secure.
Visit Site →A Google application available on Android and iOS that allows users to discover, download, and run AI models locally on their devices for free, with no cloud processing required, supporting features like AI chat, image analysis, and audio transcription.
Visit Site →A feature within Anthropic's Cowork agent platform that lets users remotely control AI agents and assign them tasks.
No URLA cloud-based collaboration platform that has pivoted to incorporate AI-powered vibe coding capabilities, allowing users to build applications within its interface.
Visit Site →An AI-powered feature within the Bluesky social media platform designed to generate custom feeds for users, though it has faced significant user pushback and blocking.
No URLOverworld's local runtime environment for running Waypoint world models on consumer hardware, featuring a streamlined installer flow for quick setup.
Visit Site →An AI cybersecurity startup that claims to replicate advanced vulnerability discovery capabilities using smaller, open-weight models rather than relying on a single large frontier model.
Visit Site →A spam call and robocall blocking application that identifies and filters fraudulent phone calls for consumers.
Visit Site →A benchmark designed to measure AI models' visual comprehension and reasoning capabilities, evaluating how well models understand and reason about visual content.
No URLA benchmark environment where AI agents complete realistic multi-step tasks via APIs, used to evaluate agent performance on scenarios requiring complex control flow across multiple applications.
Visit Site →An open-source observability and analytics platform for LLM applications that captures agent trajectories including user utterances, tool calls, and results using OpenTelemetry-based tracing.
Visit Site →An open-source AI observability platform by Arize AI that provides a UI for tracing, evaluating, and debugging LLM applications and AI agents.
Visit Site →Activation-aware Weight Quantization, a quantization technique for large language models that preserves important weights based on activation patterns to maintain model quality at reduced precision.
Visit Site →A coalition created by Anthropic consisting of major technology companies that are given early access to test the Claude Mythos model and patch security vulnerabilities ahead of any broader release.
No URLAn AI startup focused on AI interpretability, building tools to understand and control the internal workings of large language models.
Visit Site →Google's premium subscription tier at approximately $200-250/month that provides 25,000 monthly AI credits for heavy use of Google's AI services including Google Flow.
Visit Site →An AI startup that offers McKinsey-style consulting reports generated through AI at a fraction of the cost of traditional consulting firms.
No URLGenEditBio's AI platform that analyzes data to identify how chemical structures correlate with specific tissue targets and predicts optimal delivery vehicle chemistry for gene-editing tools.
Visit Site →InfiniMind's AI-powered platform that analyzes television content in real time, helping media and retail companies track product exposure, brand presence, customer sentiment, and PR impact.
Visit Site →A production-grade calendar management environment built by Turing for OpenEnv, serving as a benchmark for evaluating tool-using agents under realistic constraints such as access control, temporal reasoning, and multi-agent coordination.
Visit Site →Spotify's internal AI-powered development system that enables remote, real-time code deployment using generative AI. It integrates with tools like Claude Code and Slack to allow engineers to fix bugs and add features from their mobile devices.
Visit Site →An open-source OCR (Optical Character Recognition) AI model by Zhipu AI (ZAI) that can parse text, tables, formulas, handwriting, and receipts from images. At only 2.6 GB, it outperforms both open-source and closed-source alternatives in accuracy and speed.
Visit Site →AI coding agents capable of autonomously building entire software applications, pushing code to GitHub, and deploying to platforms like Vercel based on high-level instructions.
No URLA Reddit-like social network designed for AI agents to communicate with one another, post, comment, and browse content. It gained viral attention when posts appeared to show AI agents organizing autonomously, though security vulnerabilities revealed that humans could easily impersonate agents on the platform.
No URLA Swedish AI startup that helps dentists' practices with administrative work, including a recording tool that uses AI to generate clinical notes from patient visits.
Visit Site →Domain purchased by Crypto.com founder Kris Marszalek for $70 million, planned to offer consumers a personal AI agent for messaging, app usage, and stock trading.
No URLxAI's project spanning from simple computer use simulation to modeling entire corporations, described as being able to do anything on a computer that a computer can do.
No URLAn AI startup founded by Anna Goldie and Azalia Mirhoseini that builds AI tools to automate and dramatically accelerate chip design, using deep learning agents that improve through experience across different chip layouts.
No URLA web browser being developed by OpenAI as part of its expanding product suite.
No URLAn autonomous AI agent that can run locally or on a cloud VPS, capable of coding, managing Kanban boards, and completing tasks autonomously while the user is away.
Visit Site →An open-source OCR model for parsing text and data from images, used as a benchmark comparison for document understanding tasks.
Visit Site →An AI cybersecurity tool founded by AI engineer Artem Sorokin, focused on addressing security challenges in AI systems.
Visit Site →A Swedish legal AI startup that emerged from the SSE Business Lab incubator, applying artificial intelligence to legal workflows and processes.
Visit Site →An AI-powered image editing tool for relighting and brightness adjustment that uses generative AI to intelligently brighten dark images while preserving detail and avoiding artifacts, based on Flux2Cline architecture.
No URLAn open-source AI image upscaling tool released by Nvidia for enhancing image resolution with high accuracy.
Visit Site →OpenAI's image generation model that ranks as the top-performing model on the Arena AI image generation leaderboard, known for high-quality image synthesis and prompt adherence.
No URLAn image generation model that serves as the foundation for various image editing and generation tasks, designed to run on most consumer devices.
Visit Site →Google's AI image generation and editing model capable of creating and modifying images, integrated into the Gemini ecosystem.
No URLA Google Workspace tool for editing existing photos, creating photos from scratch, and designing flyers and graphics, built on Google's AI image creation capabilities. It integrates AI-powered features such as cropping, object removal, and text overlay for graphic design.
No URLAn image generation model that creates images directly in pixel space rather than latent space, bypassing the traditional VAE conversion step used by most diffusion models.
No URLAn open-source image generation model by Vivago AI capable of producing 2K resolution images in various artistic styles, with particularly strong text rendering and infographic/poster generation capabilities.
Visit Site →An AI-powered design and image generation tool, now owned by Perplexity, that enables creative teams to quickly generate and iterate on visual designs.
Visit Site →An AI-powered creative design tool that allows users to generate and iterate on visual content quickly using generative AI capabilities.
Visit Site →An AI-powered photo editing tool within the DoorDash merchant platform that can replace backgrounds, sharpen images, and optimize lighting for food photography without altering the dish itself.
Visit Site →An AI image generation tool notable for its strong text rendering capabilities within generated images, allowing users to create visuals with accurate and readable typography.
Visit Site →OpenAI's GPT-4o model with native image generation capabilities, allowing users to create and edit images directly through ChatGPT prompts.
No URLAn AI system that simultaneously improves at generating realistic images and detecting fake/AI-generated images, advancing both generation and detection capabilities together.
No URLOpenAI's API-accessible image generation model that powers ChatGPT Images 2.0, available to developers through the OpenAI API with pricing based on output quality and resolution.
Visit Site →OpenAI's latest image generation and editing model integrated into ChatGPT, capable of producing highly detailed images with accurate text rendering, UI mockups, and complex multi-element compositions, representing a significant leap over previous models.
Visit Site →An experimental product by Anthropic that lets users create visuals such as prototypes, presentation decks, slides, and one-pagers through natural language descriptions, powered by Claude Opus 4.7.
No URLOpenAI's updated image generation model integrated into their products, enabling AI-powered image creation directly within applications like Codex.
Visit Site →Canva's proprietary image-generation AI model, designed for creating visuals within the Canva platform. It has been optimized to be 5x faster and 30x cheaper than its previous version.
Visit Site →A Microsoft design tool that integrates AI image generation capabilities, previously powered by DALL-E 3, for creating visual content from text prompts.
Visit Site →An open-source AI image generation model that excels at text rendering within images, prompt understanding, photorealistic image generation, and producing a wide variety of artistic styles including comics and posters.
Visit Site →An open-source AI image generator considered one of the top models in its category, capable of generating photorealistic images and understanding complex prompts.
No URLAn AI image generation model known for producing images that can sometimes have an overly smooth or plasticky appearance, developed by Black Forest Labs.
No URLA Google AI creative tool that has been discontinued and effectively replaced by Google Flow as the platform for AI-powered image and video generation.
Visit Site →An AI image generation model developed by OpenAI that creates images from text descriptions, one of the pioneering text-to-image generation systems.
No URLA distilled, faster variant of Alibaba's Z-Image model optimized for quick image generation with high visual quality, particularly strong in realistic portrait photography.
Visit Site →The raw foundational model from Alibaba's Tong Yi Lab that serves as the base for the Z-Image family, capable of both image generation and image editing tasks.
Visit Site →A 3-billion parameter OCR model by Nanonets designed for document text extraction. It achieved notable degeneration rate reduction from 1.61% to 0.20% when trained with DPO techniques.
Visit Site →A family of computer-use AI agent models by H Company, available in multiple sizes (0.8B to 35B-A3B), designed to autonomously operate across web, desktop, and mobile GUI environments. It supports local and on-device inference with quantized checkpoints (FP8, Q4 GGUF, NVFP4) and integrates with various agent frameworks.
Visit Site →The 32B parameter variant of NVIDIA Cosmos 3, featuring a 32B reasoner and 32B generator designed for large-scale synthetic data generation and research, running on NVIDIA Hopper and Blackwell GPUs.
Visit Site →The 8B parameter variant of NVIDIA Cosmos 3, featuring an 8B reasoner and 8B generator optimized for efficient inference on workstation-grade compute such as the RTX PRO 6000 GPU.
Visit Site →An open-source vision-language grounding model by Nvidia that can detect and segment objects in images and videos using natural language queries, employing parallel box decoding for fast and accurate bounding box prediction. It has only 3 billion parameters and can run on most consumer GPUs.
Visit Site →Microsoft's AI-powered assistant integrated across its products, offering conversational AI, content generation, and productivity features.
No URLByteDance's 3-billion-parameter open-source unified multimodal model capable of text-to-image, text-to-video generation, image and video editing, and visual understanding.
Visit Site →DeepSeek's AI research introducing visual thinking capabilities that allow AI systems to point at and reference parts of images during reasoning, reducing visual token usage by 90% while matching or beating frontier models on benchmarks. It uses policy distillation from expert models into a unified student model.
No URLA family of transformer-based AI models by Allen Institute for AI (AllenAI) that processes satellite imagery for remote sensing tasks such as tracking mangrove change, classifying forest loss drivers, and producing crop-type maps. Version 1.1 cuts compute costs by up to 3x compared to v1 while maintaining similar performance.
Visit Site →Google's multimodal AI model featuring a large context window and strong performance across text, image, and video understanding tasks.
No URLA lighter, faster variant of Google's Gemini Omni model, part of the new creation-focused model family that handles multimodal inputs and outputs.
No URLAI interaction models developed by Thinking Machines Labs (founded by former OpenAI CTO Mira Murati) that feature real-time translation, intelligent interruption handling, visual monitoring, and simultaneous speech capabilities for natural human-AI conversation.
No URLGoogle's AI assistant built on the Gemini Intelligence stack, integrated into Android devices and the Google Book, capable of handling complex tasks through natural interaction methods including mouse gestures and voice commands.
No URLNVIDIA's open 30-billion-parameter multimodal AI model that processes images, video, and audio with exceptional throughput and cost efficiency, achieving nearly 10x real-time video processing speed — roughly 3x faster than QwQ 3 Omni and up to 7x faster on documents.
Visit Site →A suite of AI-powered features by Google built on the Gemini platform, designed for Android devices including Pixel and Samsung Galaxy phones, that enables multi-step task automation, contextual understanding from images, and intelligent agent capabilities across apps and services.
No URLA full-duplex AI interaction system developed by Thinking Machines Lab (founded by former OpenAI CTO Mira Murati) that processes user input and generates responses simultaneously, enabling natural conversational AI with response times of approximately 0.40 seconds.
Visit Site →A full-duplex AI model from Thinking Machines Lab that can listen and respond simultaneously, achieving 0.40-second response times comparable to natural human conversation speed.
No URLAlibaba's multimodal AI model capable of processing text, images, video, and audio, used as a benchmark comparison for throughput performance in multimodal tasks.
Visit Site →OpenAI's contrastive language-image pretraining model that learns to match images with text descriptions, commonly used as an image encoder in multimodal AI systems.
Visit Site →Amazon's rebooted AI-powered virtual assistant that can generate on-demand audio podcast episodes on any topic, featuring customizable AI host personalities and conversation styles. It sources content from over 200 news publications and delivers episodes to users' devices.
No URLAn OpenAI voice model built with GPT-5-class reasoning that creates realistic vocal simulations capable of conversing with users and handling complex requests in real time via the Realtime API.
Visit Site →An AI startup focused on building empathic AI technologies that can understand and respond to human emotional expression across voice, face, and language.
Visit Site →A multimodal AI medical assistant developed by Google DeepMind that can see, hear, and speak with patients in real time, guiding them through physical examinations via video and providing clinical reasoning and diagnostic assessments. It outperformed physicians in 68 different aspects of medical consultation in benchmarks.
No URLGoogle's enterprise-grade AI platform built on its Gemini family of generative AI models, offered through Google Cloud with features tailored for business customers including API access and integration with Google Workspace.
Visit Site →An omni-modal understanding model by NVIDIA built for real-world document analysis, multiple image reasoning, automatic speech recognition, long audio-video understanding, agentic computer use, and general reasoning. It uses a hybrid Mamba-Transformer Mixture-of-Experts backbone and delivers up to 9x higher throughput compared to alternatives.
Visit Site →A vision encoder used as the visual processing component in NVIDIA's Nemotron 3 Nano Omni architecture, responsible for extracting and encoding visual information from images, documents, and video frames.
No URLAn earlier NVIDIA vision-language model in the Nemotron family that served as the foundation for the Nemotron 3 Nano Omni, providing strong visual understanding capabilities.
No URLAn open-weights omni-modal AI model that handles text, image, video, and audio understanding. Nemotron 3 Nano Omni claims to lead it in many benchmark domains.
Visit Site →Google's enterprise AI agent platform powered by Gemini that enables businesses to interact with Google services through natural language prompts, including generating realistic scenes in Google Street View.
No URLAn open-world 3D segmentation model developed at Meta, extending the Segment Anything concept to three-dimensional perception tasks.
Visit Site →An influential image segmentation model developed by Meta's FAIR division that enables promptable segmentation of objects in images.
No URLA finetuned version of Qwen3-VL-Embedding-2B optimized for Visual Document Retrieval, achieving an NDCG@10 of 0.947 and outperforming existing VDR models including those up to 4x its size.
Visit Site →Apple's AI-powered virtual assistant built into its devices, currently undergoing a major AI redesign to compete with other AI assistants in feature quantity and quality.
No URLA vision-language multimodal embedding model by Qwen (Alibaba) that maps text, images, and other modalities into a shared embedding space for cross-modal similarity and retrieval tasks.
Visit Site →OpenAI's voice interaction feature for ChatGPT that enables spoken conversations with the AI assistant, recently launched with Apple CarPlay integration.
Visit Site →Meta's first AI model from its Superintelligence Labs, designed for everyday personal use tasks including visual understanding, health, shopping, and social content. It powers the updated Meta AI assistant and features a 'Contemplating' reasoning mode that orchestrates multiple agents reasoning in parallel.
Visit Site →Amazon Web Services' managed platform that provides access to foundation models from multiple AI companies, including model-routing services that allow customers to automatically use different AI models for various tasks to optimize performance and cost.
No URLAmazon's enhanced AI assistant with improved intelligence and capabilities including smart home management and vacation planning, launched to all U.S. users.
Visit Site →A Tokyo-based startup founded by ex-Googlers that develops infrastructure to convert petabytes of unviewed video and audio into structured, queryable business data using vision-language models.
Visit Site →InfiniMind's flagship long-form video intelligence platform capable of processing 200 hours of footage to pinpoint specific scenes, speakers, or events, with beta release scheduled for March 2026.
Visit Site →ElevenLabs' AI-powered tool designed for marketing and branding teams to generate music and audio content for commercial use.
No URLElevenLabs' dedicated platform for creating AI-generated songs, allowing users to build tracks section by section including intro, verse, and chorus, then stitch them together.
Visit Site →Stability AI's latest family of audio generation models capable of creating professional-grade music compositions of over six minutes in length. The family includes four models ranging from 459M to 2.7B parameters, with smaller models suitable for on-device generation and larger models for full-length compositions.
Visit Site →Google DeepMind's AI music generation model capable of creating high-quality music, including instrumentals and vocals.
No URLA music AI platform that focuses on social music interaction rather than generation from scratch, enabling users to remix, restyle, and share tracks while preserving artist control through opt-in/opt-out systems and royalty pipelines.
Visit Site →A closed-source AI music generation platform that creates songs from text descriptions, considered one of the top commercial AI music generators alongside Suno.
Visit Site →An open-source AI voice cloning tool that can generate singing voices from just a few seconds of a reference voice, allowing users to make any voice sing any song with custom melodies and lyrics. It is lightweight (under 3GB) and can run on low-end GPUs or CPUs.
No URLA publication by The Midas Project that investigates and reports on AI-related media and technology issues.
Visit Site →Nvidia's new PC CPU/superchip designed for AI agent PCs, offering 1 petaflop of AI performance with 6,144 Blackwell GPU cores and 20 CPU cores, capable of running large language models locally on Windows laptops.
Visit Site →A near-memory processing chip designed by startup XCENA that places compute capabilities closer to DRAM, aiming to reduce AI inference bottlenecks by handling data operations near memory without costly round trips between CPUs, GPUs, and memory.
Visit Site →Waymo's autonomous driving technology stack that powers its fleet of self-driving robotaxis. Now in its 6th generation, it enables vehicles to navigate various road and weather conditions, including snowy environments.
No URLA free app by OverDrive that allows users to borrow ebooks and audiobooks from their local public library, compatible with Kindle and other e-reader platforms.
Visit Site →A book tracking app that helps readers log their reading progress, set goals, and discover new books based on mood and preferences. Launching a Kobo integration in June 2026 to automatically sync reading progress.
Visit Site →A benchmark created by Google DeepMind to comprehensively evaluate the ability of language models to generate factually accurate text, measuring both overall accuracy and search-augmented factual responses.
Visit Site →Microsoft's custom AI chip launched in January, designed to handle AI workloads in Azure data centers as an alternative to third-party GPU solutions.
Visit Site →Meta's $3.99/month paid subscription tier for Facebook that provides social expression and profile customization features similar to Instagram Plus.
No URLMeta's $2.99/month paid subscription tier for WhatsApp that adds app themes, custom ringtones, premium stickers, and extra pinned chats.
No URLMeta's upcoming broader subscription umbrella currently in early testing that will eventually house AI-focused tiers and professional plans for creators and businesses.
Visit Site →A tiny open-source robot made by Hugging Face, designed as a compact humanoid platform that can run various AI applications.
Visit Site →Nvidia's next-generation GPU platform that is sold both standalone and bundled with the Vera CPU for AI computing workloads.
No URLNvidia's CPU product purpose-built for agentic AI workloads, designed to process tokens as fast as possible rather than using traditional core-based cloud architecture, targeting a $200 billion market opportunity.
Visit Site →LetinAR's proprietary optical technology for smart glasses that arranges tiny optical elements inside a lens to direct light precisely into the user's eye, enabling brighter images in a thinner, lighter, and more power-efficient form factor than competing approaches.
Visit Site →A training dataset published alongside the Ettin Reranker models, consisting of a subset of lightonai/embeddings-pre-training mixed with a reranked subset of lightonai/embeddings-fine-tuning.
Visit Site →A pre-training dataset by LightOn AI used for training embedding and reranking models.
Visit Site →Google's app development platform providing backend services including Firestore database, Firebase Auth, and Firebase App Check, with planned integration into Google AI Studio's app creation workflow.
Visit Site →An online educational course created by Andrej Karpathy that teaches students how to build neural networks from scratch in code, covering fundamentals of deep learning and language models.
Visit Site →Sony's AI-driven robotic table tennis system that uses nine cameras and real-time spin tracking to play table tennis, becoming the first robot to defeat an elite human player.
Visit Site →Google's Android-based desktop operating system designed to power Googlebook laptops, built to integrate seamlessly with Android phones and optimized for Gemini AI.
Visit Site →A Python library built on the OpenCASCADE geometry kernel for programmatic CAD modeling and parsing of standard CAD file formats like STEP, enabling precise geometric feature extraction.
Visit Site →A data center GPU accelerator from AMD featuring 192GB of HBM3 VRAM and 5.3 TB/s memory bandwidth, designed for large-scale AI inference and training workloads as an alternative to NVIDIA GPUs.
Visit Site →A research organization launched by Anthropic focused on predicting the impacts of AI on the world, covering areas such as economic diffusion, threats and resilience, AI systems in the wild, and AI-driven research and development.
No URLA robotics company developing general-purpose humanoid robots designed for real-world tasks. Figure has ramped up production capabilities and is focused on deploying robots into homes and workplaces to collect real-world data.
No URLA Google initiative targeting the mid-2030s to build data centers in space, leveraging orbital infrastructure for AI compute.
No URLA reboot of the original Vine short-form video platform that hosts six-second human-made videos and archives 500,000 original Vine clips. It enforces a no-AI-generated-content policy using a human verification tool and requires videos to be recorded in-app or verified before posting.
Visit Site →A cybersecurity suite by Surfshark that bundles a VPN, antivirus, breach alerts, alternative ID generation, and private search into a single subscription plan.
No URLA personal data removal service that contacts data brokers on behalf of users to request deletion of their personal information, with ongoing follow-ups and a tracking dashboard.
Visit Site →A secure cloud storage platform offering end-to-end encrypted file storage with built-in document editing, collaboration tools, e-signatures, and version history, hosted in Europe under GDPR compliance.
Visit Site →An AI benchmark for evaluating model performance on health-related queries, which was found to have a scoring bias that rewarded longer, more verbose answers.
Visit Site →An AI benchmark containing questions about real-world experimental errors in biological protocols, where top PhD experts score approximately 36%.
No URLA large-scale multiple-choice question dataset derived from Indian medical entrance exams (AIIMS, USMLE-style), containing questions with four answer options, correct answer indices, and optional free-text explanations.
Visit Site →YouTube's short-form video platform that delivers bite-sized vertical video content, now being integrated into the Google TV home screen for discovery on big screens.
Visit Site →Google's subscription service offering expanded cloud storage, access to advanced Gemini AI features, and additional benefits across Google products.
Visit Site →Humanoid robots manufactured by Unitree, a company also known for quadruped 'robot dogs,' designed for industrial and service tasks such as airport baggage handling and custodial work.
Visit Site →A privacy-focused cloud storage service offering end-to-end encrypted file storage with lifetime subscription plans. It is open source, audited by Securitum, and provides cross-platform compatibility across Android, iOS, web, and desktop.
No URLTesla's humanoid robot designed for general-purpose tasks, currently being manufactured at scale at Tesla's Fremont factory with plans for a dedicated manufacturing facility in Austin.
Visit Site →Google's digital wallet app for Android that stores credit cards, boarding passes, and other digital documentation, featuring conveniences like flight tracking widgets and push notifications for travel updates.
Visit Site →A sovereign cloud platform operated by Schwarz Digits, the IT division of German retail conglomerate Schwarz Group, designed to provide data-sovereign cloud infrastructure for European enterprises.
Visit Site →A publicly available synthetic dataset by NVIDIA containing multilingual OCR training data with pixel-precise annotations across multiple languages, used to train the Nemotron OCR v2 model.
Visit Site →A large-scale multilingual web corpus covering dozens of scripts including Latin, CJK, Cyrillic, Arabic, Devanagari, and Thai, used as a source of realistic text for training data generation.
Visit Site →A software-based networking technology created and open-sourced by Google in 2023, designed to improve the efficiency of data center communications, including for AI workloads running on both Google TPUs and Nvidia GPUs.
Visit Site →OpenAI's educational resource platform that provides tutorials, guides, and learning materials for building and using AI agents and other OpenAI products.
Visit Site →An Arabic-language benchmark for evaluating LLM knowledge and reasoning across multiple subjects, modeled after the English MMLU benchmark.
Visit Site →An Arabic-adapted version of the HumanEval+ code evaluation benchmark, enabling assessment of LLM coding capability with Arabic-language problem statements.
No URLAn Arabic-adapted version of the MBPP+ code evaluation benchmark, designed to assess programming ability using Arabic-language problem descriptions.
No URLSpaceX's portable satellite internet service designed for travelers and remote locations, providing broadband connectivity via a constellation of low-Earth orbit satellites.
No URLAn NVIDIA research project that trains robots by feeding them 44,000 hours of human activity videos, using novel techniques like relative action transformation and compressed information learning to bridge the sim-to-real gap in robotics.
Visit Site →Amazon Web Services' custom-designed ARM-based server CPUs optimized for cloud workloads, offering improved performance and energy efficiency for general-purpose computing tasks.
Visit Site →Nvidia's edge AI processor designed for high-performance computing in resource-constrained environments, used in Kepler Communications' orbital compute satellites.
Visit Site →A benchmark that evaluates LLM reasoning through pencil puzzles involving constraint satisfaction problems closely related to NP-complete problems, with deterministic step-level verification that cannot be gamed through memorization.
Visit Site →A standalone messaging app from X (formerly Twitter) that offers end-to-end encrypted text messaging, audio and video calling, document sharing, group chats, and message editing/deletion capabilities.
Visit Site →A widely-used open-source multimedia framework for handling video, audio, and other multimedia files and streams, used extensively across the software industry.
Visit Site →A hardware and software product by Astropad that turns an iPad into a wireless second display for Mac or PC, powered by the company's proprietary LIQUID display protocol.
Visit Site →A software product by Astropad that turns an iPad into a professional drawing tablet for Mac, enabling creative professionals to use iPad with desktop creative applications.
Visit Site →Google Cloud's tensor processing units (TPUs) are custom-designed AI accelerator chips optimized for training and running large-scale machine learning models.
No URLApple's mixed reality headset that combines augmented and virtual reality capabilities, offering spatial computing experiences. It received a lackluster market reception despite being part of Apple's ambitious AR/VR strategy.
Visit Site →A quadruped robot by Mirami Technology designed to solve high-speed locomotion physics, serving as the research foundation for the Bolt humanoid robot.
Visit Site →A two-legged humanoid robot by Mirami Technology (Shanghai Robotics startup) that broke the world record for fastest humanoid robot at 10 meters per second (22.4 mph).
Visit Site →A benchmark measuring well-specified knowledge work tasks across 44 occupations, on which GPT 5.2 claims to be the first model at or above human expert level.
Visit Site →A benchmark measuring the ability of AI models to perform tasks in the terminal, particularly relevant for coders.
Visit Site →NVIDIA's open-source AI model with 550 billion total parameters and up to 55 billion active per token, built on a hybrid Mamba transformer mixture of experts architecture. It is claimed to be five times faster and 30% cheaper than comparable frontier open models, with full open access to the model, training scripts, and data.
No URLA 12B-parameter Mixture-of-Experts model by JetBrains, trained from scratch on natural language and code. It activates only 2.5B parameters per token, enabling efficient high-throughput, low-latency inference for tasks like routing, RAG, summarization, coding features, and private deployments, released under the Apache 2.0 license.
Visit Site →An advanced large language model by OpenAI in the GPT-5 series, released in late 2025 as part of the wave of models that enabled new agentic AI capabilities.
No URLA large language model by Anthropic in the Claude Opus family, featuring a new 'dynamic workflow' tool that enables more flexible and adaptive AI task execution.
Visit Site →A Google open-weights language model available in GGUF format for efficient local inference, used as a recommended LLM for the Reachy Mini local voice pipeline.
Visit Site →Cursor's in-house AI coding model built on top of Moonshot's Kimi 2.5 base model, achieving near-frontier performance on coding benchmarks at significantly lower cost than competing models like Claude Opus 4.7 and GPT-5.5.
Visit Site →A compact 0.6-billion parameter language model from Alibaba's Qwen3 series, used to demonstrate delta weight sync where per-step payload drops from 1.2 GB to a fraction of that.
Visit Site →An open-source AI model project developed by Nous Research that serves as a foundation for building AI agents and assistants with advanced reasoning and function-calling capabilities.
Visit Site →Meta's flagship AI model that powers AI features across its family of apps including Facebook, Instagram, and WhatsApp.
Visit Site →Nvidia's GPU architecture designed for AI and high-performance computing workloads, adopted and deployed by every major hyperscaler, cloud provider, and model maker for data center AI infrastructure.
No URLProprietary physics-grounded AI models developed by SandboxAQ (an Alphabet spinout) that perform quantum chemistry calculations, molecular dynamics simulations, and microkinetics modeling for drug discovery and materials science applications.
Visit Site →A modernized BERT-based encoder architecture used as the backbone for building embedding models, offering improved performance for downstream tasks like retrieval and text representation.
Visit Site →A remote sensing foundation model that uses separate tokens per resolution when processing Sentinel-2 satellite data, demonstrating significantly better results with this approach compared to single-token methods.
Visit Site →A compact 97M-parameter multilingual embedding model by IBM that produces 384-dimensional embeddings and achieves the highest retrieval score (60.3 on MTEB Multilingual Retrieval) among open multilingual embedding models under 100M parameters. It supports 200+ languages, 32K-token context, and code retrieval across 9 programming languages under the Apache 2.0 license.
Visit Site →An open-weight large language model developed by Alibaba's Qwen team, capable of running locally on consumer hardware.
Visit Site →A family of on-device foundation models developed by Liquid AI, designed for efficient local inference on consumer hardware.
No URLA small multilingual embedding model that serves as a baseline for sub-100M parameter multilingual retrieval, scoring 50.9 on MTEB Multilingual Retrieval across 18 languages.
Visit Site →A 1.7 billion parameter language model developed by Alibaba, compact enough for efficient fine-tuning while capable of producing coherent reasoning across various tasks.
Visit Site →A series of tabular foundation models (TFMs) developed by Prior Labs that make predictions from structured data in tables and databases, with open source versions downloaded over three million times.
Visit Site →An Indian AI startup that develops open-source AI models focused on Indian languages and has expanded into hardware development and commercial partnerships, positioning itself as a competitor to Krutrim.
No URLA foundational AI model for robotics developed by Genesis AI, designed to control human-shaped robotic hands and perform complex physical manipulation tasks such as cooking, lab work, and object manipulation.
Visit Site →A domain-specific large language model trained on biomedical text, designed to improve performance on medical NLP tasks compared to general-purpose models.
Visit Site →Google's medical domain large language model, a successor to Med-PaLM, designed to achieve high performance on medical question-answering benchmarks and clinical reasoning tasks.
Visit Site →A BERT-based language model fine-tuned on clinical text data, designed to improve performance on clinical NLP tasks such as diagnostic coding and medical document understanding.
Visit Site →A prior mixture-of-experts research project by Allen AI that explored routing tokens to experts based on predefined semantic domains during pretraining.
Visit Site →Google's Flash family of AI models optimized for speed and cost-efficiency, offering strong price-performance ratios for a wide range of language and reasoning tasks.
No URLAn AI lab developing open-source foundation models. The company has also signed deals with the U.S. Department of Defense to deploy its AI models on classified networks.
Visit Site →Google's AI research lab responsible for developing frontier AI models and systems, including contributions to robotics, game-playing AI, and scientific discovery. Researcher David Silver recently raised $1.1B to build AI that learns without human data.
No URLAn AI startup founded by quantum physicist Eve Bodnia that aims to challenge the foundational architecture of mainstream AI by building a fundamentally different kind of intelligence system.
No URLA family of dense, decoder-only large language models by IBM in 3B, 8B, and 30B sizes, trained on approximately 15 trillion tokens with up to 512K context length, released under the Apache 2.0 license.
Visit Site →A DeepMind AI program that learned to play the board game Go at a superhuman level through reinforcement learning, famously defeating the world's top professional players without being fed human strategies.
Visit Site →A British AI lab founded by former DeepMind researcher David Silver, focused on building a 'superlearner' AI that discovers knowledge and skills through reinforcement learning without relying on human-generated data.
Visit Site →An open-source large language model that achieved top-tier benchmark results, tied at number one among open-source models at the time of its release.
Visit Site →A model announced by Anthropic, noted for limited availability potentially due to both security concerns and compute constraints at the company.
No URLA suite of open-source language models released across 8 sizes with 154 checkpoints per model, designed to enable research into training dynamics and scaling behavior.
Visit Site →A 176-billion parameter open-access multilingual large language model developed through a large-scale collaborative research effort.
Visit Site →A 175-billion parameter open-source large language model released by Meta for research purposes, designed to match the scale of GPT-3.
Visit Site →A 32B-parameter Mixture-of-Experts language model by IBM with 9B active parameters, serving as the predecessor to the Granite 4.1 family.
Visit Site →A beta release AI model from xAI that serves as a transitional step between the current Grok generation and upcoming larger-scale models, featuring supplemental training improvements.
No URLIndia's first GenAI unicorn, originally focused on building large language models for Indian languages, now pivoting to AI cloud infrastructure services. Founded by Bhavish Aggarwal, it raised $50 million at a $1 billion valuation.
Visit Site →A large-scale instruction-tuned language model from Alibaba's Qwen series with strong multilingual capabilities including Arabic, used as one of two automated assessors in QIMMA's benchmark validation pipeline.
Visit Site →A new AI model from Anthropic that has drawn interest from senior U.S. government officials and major banks for testing and evaluation.
No URLA family of ultra-efficient 1.58-bit open-source language models that restrict weights to three possible values (-1, 0, 1), enabling AI to run on consumer and edge devices with dramatically reduced model sizes while maintaining competitive performance.
No URLA compact sentence-transformer model that generates vector embeddings for semantic similarity tasks. The ONNX variant (onnx-community/all-MiniLM-L6-v2-ONNX) enables efficient feature extraction in browser environments.
Visit Site →A smaller, more cost-efficient variant in OpenAI's GPT-5.4 model family, designed for lightweight tasks while maintaining strong language capabilities.
No URLA generalist robot foundation model by Physical Intelligence that demonstrates compositional generalization, enabling robots to perform tasks they were never explicitly trained on by combining skills learned in different contexts.
Visit Site →An unreleased Anthropic model that outperforms Claude Opus 4.7 on most benchmarks including coding, obscure knowledge, and computer navigation, but is not yet publicly accessible.
No URLAn open-source large language model from ZhipuAI (ZAI) that is positioned as one of the best open-source models available for general use.
Visit Site →A cybersecurity-focused variant of OpenAI's GPT-5.4 large language model designed with lowered safety refusal boundaries to enable defensive cybersecurity research, such as helping researchers find vulnerabilities in code and systems.
No URLA 400B-parameter open-weight reasoning model built by Arcee AI, a 26-person U.S. startup, on a $20 million budget. Released under the Apache 2.0 license, it aims to be the most capable open-weight model from a non-Chinese company, offering both on-premises and cloud-hosted API access.
Visit Site →Google's family of lightweight, open AI models designed to run locally on devices. Gemma-based speech recognition models power on-device transcription in Google's dictation applications.
No URLA bilateral reference network model for high-resolution image segmentation, available on Hugging Face as ZhengPeng7/BiRefNet, commonly used for background removal tasks.
Visit Site →Anthropic's newest frontier AI model, described as more powerful than its previous Opus models, featuring strong agentic coding and reasoning capabilities. It is being previewed for cybersecurity applications through Project Glasswing to identify zero-day vulnerabilities in software.
No URLAn unreleased frontier AI model by Anthropic (codenamed Capybara) that demonstrates unprecedented software engineering and cybersecurity capabilities, including the ability to find and exploit software vulnerabilities at a level surpassing most skilled humans. It achieved 93.9% on SWE-bench Verified and was the first model to solve a private cyber range end-to-end.
No URLAnthropic's previous flagship AI model, considered a game changer for cybersecurity applications before being succeeded by Mythos.
No URLAn upcoming AI model from OpenAI that is reportedly nearing release. It is separate from OpenAI's cybersecurity-focused product and has been the subject of speculation and misinformation.
No URLAnthropic's new AI model release featuring agentic capabilities including 'agent swarms' and 'agent teams,' scoring nearly 30% on the APEX-Agents professional tasks benchmark in one-shot trials and 45% with multiple attempts.
No URLA transformer architecture by François Fleuret at Meta that extends the classic decoder-based transformer with latent variables to make underlying decisions about sequence generation, such as generating consistent positive or negative movie reviews.
Visit Site →A Google Research architecture that learns to memorize at test time, enabling models to go beyond current context windows by maintaining memory across chunks of long sequences, presented at NeurIPS.
Visit Site →OpenAI's model described as representing an inflection point in AI capabilities. It ran uninterrupted for one week writing 3 million lines of code to create a browser from scratch, and solved multiple Erdős math problems.
Visit Site →Meta's most capable pre-trained base model to date, codenamed Avocado, developed by Meta Superintelligence Labs. It outperformed best open source base models and was competitive with leading post-trained models even before post-training.
Visit Site →A miniature model within the GPT-5.1 system that decides whether a user's query is worth spending extended thinking time on.
No URLA voice AI startup that builds small, low-latency models and its own orchestration layer to power automated customer support calls in Africa and the Middle East, handling localized dialects of English, French, and Arabic.
Visit Site →A dictation product by Google built into Gboard that enables voice-based typing across apps on mobile devices.
Visit Site →A voice-based typing and dictation application that turns spoken input into structured text for productivity workflows.
No URLA dictation app that enables users to control their computers and write text through voice input, increasingly used alongside coding and productivity tools.
No URLAn AI-powered dictation application that converts speech to polished text, competing in the growing market of intelligent voice-to-text tools.
Visit Site →An AI-powered dictation app that converts speech to text, part of the growing ecosystem of intelligent voice transcription tools.
No URLAn AI dictation application that transcribes speech into text, competing in the voice-to-text app market alongside other AI-powered dictation tools.
Visit Site →An Indian AI startup focused on voice-based AI solutions including speech recognition, conversational AI, and voice automation tools for enterprise and consumer applications.
No URLA dictation and transcription app that can transcribe from voice, audio files, or video files. It lets users choose and download various AI models including its own and Nvidia's Parakeet speech-recognition models, and supports custom prompts to steer output.
Visit Site →An offline-first AI dictation app with no subscription model that uses local models for transcription. It supports over 99 languages, works on Mac and Windows, and offers an open-source version users can self-host.
Visit Site →A Y Combinator-backed voice-typing app for Windows and macOS that claims to be one of the fastest dictation tools in terms of latency. It offers autofill text phrases, grammar and punctuation handling, and its own speech-to-text API for third-party integration.
Visit Site →An open-source, free transcription tool that runs on Mac, Windows, and Linux. It provides basic voice-to-text functionality with minimal customization options.
Visit Site →A family of speech-recognition AI models developed by Nvidia, designed for accurate automatic speech recognition across various use cases.
No URLAn AI-powered system-level dictation tool by Nothing that converts speech into formatted text across any app, removes filler words, supports custom voice shortcuts, and offers translation across over 100 languages.
Visit Site →An audio encoder model with 0.6 billion parameters developed by NVIDIA, used as the audio processing component in the Nemotron 3 Nano Omni architecture for speech recognition and audio understanding tasks.
No URLA speech-to-text model available on Hugging Face that converts spoken audio into text, designed for use in local inference pipelines on edge devices.
No URLAn AI-powered dictation app that uses speech recognition to turn voice into text, available on macOS and iPhone with features like action key mapping for quick activation.
Visit Site →An AI-powered dictation application that converts speech to text, competing in the growing market of voice-to-text productivity tools.
Visit Site →An AI-powered dictation tool that transcribes speech into text, part of the emerging category of intelligent voice input applications.
Visit Site →An offline-first AI dictation app by Google for iOS that uses Gemma-based automatic speech recognition models to transcribe speech, automatically filtering out filler words and polishing text into clean prose. It offers text transformation options like key points, formal, short, and long formats, and can import custom vocabulary from Gmail.
Visit Site →An AI-powered dictation and voice-to-text app that features a floating button interface on Android for easy system-wide access to transcription from anywhere on the device.
Visit Site →A code-focused AI model from Microsoft's MAI family designed for code generation tasks, available in Microsoft Copilot and VS Code.
Visit Site →A free, private AI chat service by DuckDuckGo that provides access to multiple language models including Claude, Llama, Mistral, and GPT without requiring an account, with all chats kept private and not used for training.
Visit Site →A family of diffusion language models by NVIDIA that generate multiple tokens in parallel and iteratively refine them, offering significantly faster text generation than traditional autoregressive models. Available in multiple scales including 8B parameters, with both base and instruction-tuned variants, supporting autoregressive, diffusion, and self-speculation generation modes.
Visit Site →DuckDuckGo's optional AI-powered feature that generates summary overviews of search results, functioning as the search engine's alternative to Google's AI Overviews.
Visit Site →An open-source large language model by Meta, part of the Llama 4 family, designed for efficient text generation and reasoning tasks.
No URLA compact 24-billion parameter language model by Mistral AI, designed to offer strong performance in a smaller, more efficient model size.
No URLA family of six state-of-the-art CrossEncoder reranker models ranging from 17M to 1B parameters, built on ModernBERT encoders and designed for document reranking in retrieval pipelines. They support up to 8K tokens of context for long-document reranking.
Visit Site →A 300M parameter embedding model by Google designed for text retrieval tasks, used as a fast embedder that can be paired with rerankers in retrieve-then-rerank pipelines.
Visit Site →A fast static embedding model by Sentence Transformers designed for sub-millisecond CPU retrieval, suitable as the first stage in retrieve-then-rerank pipelines.
Visit Site →A large reranking model by mixedbread.ai used as a teacher model for knowledge distillation when training smaller reranker models.
Visit Site →A family of large language models developed by Anthropic, offering AI assistant capabilities for businesses and consumers, including an incognito mode for private conversations.
No URLA 7-billion parameter instruction-tuned large language model from the Qwen series by Alibaba, capable of running on-premise for domain-specific reasoning tasks including manufacturing analysis.
Visit Site →Elon Musk's xAI division's large language model and AI assistant, available as a cloud AI provider that can be connected to through third-party tools.
No URLA specialized 4B-parameter cybersecurity language model fine-tuned from Qwen3-4B-Instruct for defensive cyber tasks such as CWE classification, CVE-to-CWE mapping, and structured cyber threat intelligence Q&A. It is designed to run locally on a single consumer GPU with 12 GB VRAM, making it suitable for air-gapped and sensitive environments.
Visit Site →An Apache-2.0-licensed 4B-parameter instruction-tuned language model from the Qwen family by Alibaba, serving as a high-performing base for fine-tuning on specialized tasks.
No URLA LoRA fine-tuned clinical question-answering model built on AMD hardware using ROCm, based on Qwen3-1.7B and trained on MedMCQA data to answer multiple-choice medical questions with clinical explanations.
Visit Site →An AI-powered writing assistant built into Gmail by Google that helps users draft emails. It now includes topic contextualization (pulling relevant info from Google Drive and Gmail) and tone/style personalization to match the user's writing voice.
Visit Site →An 8B-parameter cybersecurity-specialized language model developed by Cisco, designed for cyber threat intelligence tasks including CTI benchmarks. It serves as a baseline for evaluating domain-specific cybersecurity models.
Visit Site →ByteDance's AI chatbot and large language model platform that competes with other major Chinese and Western AI assistants.
Visit Site →Zhipu AI's AI assistant and model platform, developed by the Chinese AI company also known as Knowledge Atlas Technology, which is publicly traded in Hong Kong.
No URLA massive open-weight mixture-of-experts large language model by DeepSeek with 1.6 trillion total parameters (49 billion active) and a 1 million token context window, designed for efficient long-context inference and agentic workloads.
Visit Site →Perplexity's AI search and answer engine API that combines large language models with real-time web search to provide grounded, up-to-date responses.
Visit Site →An AI-powered non-fiction book generator that transforms real-world knowledge, notes, and research into structured, full-length manuscripts of up to 300,000 words, using multiple AI models and real-time research to produce publish-ready content.
Visit Site →A Germany-based AI company that develops specialized language models targeting enterprises and public institutions in Europe, including the PhariaAI suite, with a focus on sovereign AI and European language support.
Visit Site →A suite of specialized language models developed by Aleph Alpha, designed for enterprise and public sector use cases in Europe with a focus on data sovereignty and compliance.
Visit Site →DeepSeek's previous-generation open-weight large language model with 671 billion parameters, serving as the predecessor to the V4 model family.
No URLA large-scale open-weight mixture-of-experts language model by Moonshot AI with 1.1 trillion total parameters.
Visit Site →A large language model by Google used for text generation tasks. In this context, it is employed for Korean-language narrative generation within NVIDIA's synthetic data pipeline.
No URLAn 8-billion parameter large language model from the Qwen series by Alibaba, used as a base model for fine-tuning on various tasks including agentic e-commerce conversations.
No URLAn AI model by OpenAI designed to accelerate life sciences research and drug discovery, released by the OpenAI for Science team.
Visit Site →Meta's persona-driven AI companions that allow users to interact with AI-generated characters in one-to-one conversations, which have been restricted for teen users due to safety concerns.
Visit Site →A library developed by Meta AI Research for efficient text classification and word representation learning, now available on the Hugging Face Hub.
Visit Site →Anthropic's lighter, faster model recommended for everyday simple tasks as a more cost-effective alternative to Opus 4.6.
No URLA Hugging Face open-source library that implements a cascaded VAD → STT → LLM → TTS pipeline, exposing a Realtime API-compatible WebSocket for building local voice conversation systems.
Visit Site →A text-to-speech model from Alibaba's Qwen series that is expressive, low-latency, multilingual, and supports custom voices for voice agent applications.
Visit Site →A text-to-speech application by ElevenLabs that converts written content into natural-sounding audio, allowing users to listen to articles, PDFs, and other text-based content.
Visit Site →Amazon's AI-powered feature that generates podcasts using artificial intelligence through its Alexa platform, though it has been criticized for producing low-quality content.
No URLOpenAI's latest real-time voice model that enables natural spoken conversation capabilities, representing an advancement in voice-based AI interaction.
No URLOpenAI's advanced real-time voice model that uses GPT-5 class reasoning to handle complex conversational requests through voice interaction, available via API.
No URLAn AI startup building voice AI tools, including low-latency text-to-speech models optimized for real-time applications and multilingual support.
Visit Site →An ONNX-based text-to-speech model available on Hugging Face that converts text into natural-sounding speech, suitable for local deployment on edge devices.
Visit Site →Google's latest text-to-speech generator capable of handling emotions and different meta tags for more expressive and controllable speech synthesis.
Visit Site →A voice-to-voice translation suite by DeepL that provides real-time translation for meetings, mobile and web conversations, and group settings, with add-ons for platforms like Zoom and Microsoft Teams and an API for custom integrations.
Visit Site →An AI startup that uses artificial intelligence to modify a speaker's accent in real time, primarily aimed at call center agents to improve communication clarity.
Visit Site →A Dubai-based AI company focused on speech synthesis and translation for media and entertainment companies, helping them dub and localize video content.
Visit Site →An AI startup building a real-time speech translation engine designed to preserve both the meaning and the speaker's original voice during translation.
Visit Site →A world simulation model by Nvidia capable of simulating environments with up to 4 different players simultaneously.
Visit Site →A variant of Google's Omni video generation model optimized for faster and more efficient video clip generation, positioned as an upgrade from VO3 with improved capabilities and multi-scene generation by default.
No URLA YouTube AI-powered tool that allows creators to generate AI backgrounds and video content for YouTube Shorts using text prompts.
Visit Site →An updated version of Google's VO3 video generation model, used as a quality comparison baseline against the newer Google Omni model for video generation tasks.
Visit Site →A 2B-parameter video world model by NVIDIA capable of generating physically plausible videos conditioned on text, images, or video clips, useful for applications like synthetic robot trajectory generation.
Visit Site →An AI-powered video dubbing tool built on LTX 2.3 that can translate and redub video audio into different languages while adjusting lip sync and facial movements to match the new language.
Visit Site →A mysterious video generation tool from Google, expected to be unveiled at Google I/O 2026.
No URLAn AI-powered video translation and dubbing platform that generates lip-synced video content in multiple languages, widely used for video localization.
Visit Site →An NVIDIA platform that generates simulation data for training robots and self-driving cars, creating synthetic environments for safe autonomous system development.
No URLAn AI model that automatically detects cuts and transitions in videos, identifying transition types such as hard cuts, dissolves, fades, slides, and cross zooms. It was trained on 2.5 million raw internet videos and 300,000 synthetic training videos containing over 11 million labeled transitions, and is available as a free Hugging Face space or for local download at only 164 megabytes.
No URLAn open-source AI framework that generates interactive video game worlds featuring multiple agents and multiple synchronized camera angles, useful for creating training data for robotics.
Visit Site →A training-free, plug-and-play method that can be added on top of video generation models like WAN to enable seamless multi-scene transitions within a single generated video.
No URLAn AI video generation startup that offers creative tools and AI agents for end-to-end creative work across text, image, video, and audio, including real-time hybrid filmmaking capabilities for production studios.
No URLAn AI-powered video creation app that was acquired by Pinterest, originally built by the Belarusian founders who later went on to create GRAI.
Visit Site →An AI creative platform that offers access to multiple video generation models including Kling, Veo, Hailuo, SeedDance, and LTX, allowing users to generate videos from images and text prompts.
Visit Site →A video generation AI model by Kuaishou Technology that can create videos from image inputs and text prompts, with the ability to also generate accompanying audio.
No URLAn AI-powered creative platform specializing in video generation and editing, offering tools to create videos from images and text prompts with various generative models.
No URLA video generation model developed by Alibaba that reportedly tops performance charts for AI-generated video content.
Visit Site →An AI video generation model by MiniMax that creates videos from text and image inputs, available through various platforms.
Visit Site →Overworld's real-time video world model that generates interactive, explorable 3D environments on consumer GPUs at up to 720p/60 FPS, with a 360p tier for broader hardware compatibility including gaming laptops and Apple Silicon Macs.
Visit Site →A variant of Kuaishou's Kling 3.0 AI video generation model, integrated into Adobe Firefly's library of third-party AI models for creative video production.
No URL