Hypothesis

8 Matching Annotations

May 2026
x.com x.com

https://x.com/adcock_brett/status/2054973511572271172

1
1. fxp007 19 May 2026
  
  in Public
  
  The robots are reasoning directly from camera pixels
  
  大多数AI系统需要预处理数据或使用复杂的中间步骤，但作者声称他们的机器人直接从相机像素进行推理，这挑战了人们对计算机视觉系统架构的普遍理解，暗示了一种更高效的处理方式。
  
  non-consensus ai computer-vision
Visit annotations in context

Tags

non-consensus

computer-vision

ai

Annotators

fxp007

URL

x.com/adcock_brett/status/2054973511572271172
geohot.github.io geohot.github.io

https://geohot.github.io//blog/jekyll/update/2026/04/23/us-win-ai.html

1
1. fxp007 01 May 2026
  
  in Public
  
  The good world is where everyone has AI, and not as a revokable privilege through an API, but through hard possession.
  
  作者提出了一个关于AI普及的愿景，即每个人都应该拥有AI，而不是将其作为一种可以撤销的API特权。
  
  vision-for-ai AI-accessibility
Visit annotations in context

Tags

AI-accessibility

vision-for-ai

Annotators

fxp007

URL

geohot.github.io//blog/jekyll/update/2026/04/23/us-win-ai.html
Apr 2026
blog.google blog.google

https://blog.google/innovation-and-ai/products/gemini-app/gemini-app-now-on-mac-os/

1
1. fxp007 17 Apr 2026
  
  in Public
  
  We're building the foundation for a truly personal, proactive and powerful desktop assistant, with more news to share in the coming months.
  
  这段声明揭示了Google的长期愿景——不仅是提供AI工具，而是创建一个主动、个性化的桌面助手。这种从被动响应到主动预测的转变代表了AI发展的前沿方向，可能预示着未来操作系统与AI的深度融合。
  
  ai-vision future-tech
Visit annotations in context

Tags

future-tech

ai-vision

Annotators

fxp007

URL

blog.google/innovation-and-ai/products/gemini-app/gemini-app-now-on-mac-os/
x.com x.com

https://x.com/teortaxesTex/status/2042017378054086973

1
1. fxp007 16 Apr 2026
  
  in Public
  
  would have succeeded if it had vision and agentic loop
  
  令人惊讶的是：作者暗示GLM-5.1的失败可能源于缺乏视觉能力和智能代理循环，这揭示了当前AI发展的关键瓶颈——多模态整合和自主决策能力可能是未来AI突破的关键所在。
  
  surprising ai-capabilities vision-agency
Visit annotations in context

Tags

ai-capabilities

surprising

vision-agency

Annotators

fxp007

URL

x.com/teortaxesTex/status/2042017378054086973
Jan 2026
www.youtube.com www.youtube.com

LLMs Are About to Be Replaced - Yann LeCun’s V-JEPA Changes Everything!

1
1. stopresetgo 08 Jan 2026
  
  in Public
  
  for - Yann Lecun - paper - Yann Lecun - AI - LLMs are dead - language is optional for reasoning - to paper - VL-JEPA: Joint Embedding Predictive Architecture for Vision-language - https://hyp.is/eSxi8OxGEfCF7QMFiWL9Fg/arxiv.org/abs/2512.10942
  
  Comment - That language and reasoning are separate is obvious. - If we look at the diversity of life and its ability to operationalize goal seeking behavior, that already tells you that - Michael Levin's research on goal-seeking behavior of organisms and the framework of multi-scale competency architecture validates Lecun's insight - Orders of magnitude fewer efficiency of Lecun's team's prototype compared to LLM also validates this
  
  Yann Lecun paper - Yann Lecun - AI - LLMs are dead AI - LLMs are dead to paper - VL-JEPA: Joint Embedding Predictive Architecture for Vision-language
Visit annotations in context

Tags

Yann Lecun

paper - Yann Lecun - AI - LLMs are dead

to paper - VL-JEPA: Joint Embedding Predictive Architecture for Vision-language

AI - LLMs are dead

Annotators

stopresetgo

URL

youtube.com/watch
May 2025
en.wikipedia.org en.wikipedia.org

Chinese room - Wikipedia, the free encyclopedia

1
1. l3slie 04 May 2025
  
  in Public
  
  Gottfried Leibniz made a similar argument in 1714 against mechanism (the idea that everything that makes up a human being could, in principle, be explained in mechanical terms. In other words, that a person, including their mind, is merely a very complex machine).
  
  anatomy of a landscape / atrocity exhibition
  
  vision ai
Visit annotations in context

Tags

vision

ai

Annotators

l3slie

URL

en.wikipedia.org/wiki/Chinese_room
Apr 2020
opencv.org opencv.org

About

1
1. raj_reddy 24 Apr 2020
  
  in Public
  
  OpenCV (Open Source Computer Vision Library) is an open source computer vision and machine learning software library. OpenCV was built to provide a common infrastructure for computer vision applications and to accelerate the use of machine perception in the commercial products. Being a BSD-licensed product, OpenCV makes it easy for businesses to utilize and modify the code. The library has more than 2500 optimized algorithms, which includes a comprehensive set of both classic and state-of-the-art computer vision and machine learning algorithms. These algorithms can be used to detect and recognize faces, identify objects, classify human actions in videos, track camera movements, track moving objects, extract 3D models of objects, produce 3D point clouds from stereo cameras, stitch images together to produce a high resolution image of an entire scene, find similar images from an image database, remove red eyes from images taken using flash, follow eye movements, recognize scenery and establish markers to overlay it with augmented reality, etc. OpenCV has more than 47 thousand people of user community and estimated number of downloads exceeding 18 million. The library is used extensively in companies, research groups and by governmental bodies. Along with well-established companies like Google, Yahoo, Microsoft, Intel, IBM, Sony, Honda, Toyota that employ the library, there are many startups such as Applied Minds, VideoSurf, and Zeitera, that make extensive use of OpenCV. OpenCV’s deployed uses span the range from stitching streetview images together, detecting intrusions in surveillance video in Israel, monitoring mine equipment in China, helping robots navigate and pick up objects at Willow Garage, detection of swimming pool drowning accidents in Europe, running interactive art in Spain and New York, checking runways for debris in Turkey, inspecting labels on products in factories around the world on to rapid face detection in Japan. It has C++, Python, Java and MATLAB interfaces and supports Windows, Linux, Android and Mac OS. OpenCV leans mostly towards real-time vision applications and takes advantage of MMX and SSE instructions when available. A full-featured CUDAand OpenCL interfaces are being actively developed right now. There are over 500 algorithms and about 10 times as many functions that compose or support those algorithms. OpenCV is written natively in C++ and has a templated interface that works seamlessly with STL containers.
  
  ai vision machine learning documentation
Visit annotations in context

Tags

vision

ai

documentation

machine learning

Annotators

raj_reddy

URL

opencv.org/about/
Sep 2016
www.nytimes.com www.nytimes.com

A Lesson of Tesla Crashes? Computer Vision Can’t Do It All Yet

1
1. scottpony 20 Sep 2016
  
  in Public
  
  The Tesla accident in May, researchers say, was not a failure of computer vision. But it underscored the limitations of the science in applications like driverless cars despite remarkable progress in recent years, fueled by digital data, computer firepower and software inspired by the human brain.
  
  Testing annotations. Interesting statement.
  
  self driving cars tesla AI computer vision
Visit annotations in context

Tags

AI

self driving cars

computer vision

tesla

Annotators

scottpony

URL

nytimes.com/2016/09/20/science/computer-vision-tesla-driverless-cars.html

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL