Apple has released public betas of iOS 18.1, iPadOS 18.1, and macOS
Sequoia 15.1 that feature new Apple Intelligence tools like text
rewriting
Sign Up [1] |Advertise [2]|View Online [3]
TLDR
TOGETHER WITH [Incogni] [4]
TLDR AI 2024-09-20
YOUR ONLINE PRIVACY MATTERS. TAKE BACK CONTROL WITH INCOGNI (SPONSOR)
[4]
If you don't mind having your personal data available to every
spammer, scammer, and bad actor who's willing to pay for it, skip this
ad.
Still here? Check out Incogni [4] Ä it's the hassle-free way to
protect your data privacy:
* Incogni scans people search sites for your personal information
and sends removal requests on your behalf.
* Within ñ14 days, your records are off the dark corners of the internet.
* Every 10 days, Incogni does it all over again.
* You stay in the loop with regular privacy reports.
Take back control. Reduce spam, scam, and cyber risk.
Get 60% off Incogni with code TLDRAI [4] (30 day money back guarantee)
¨
HEADLINES & LAUNCHES
SNAP IS INTRODUCING AN AI VIDEO-GENERATION TOOL FOR CREATORS (2
MINUTE READ) [5]
Snapchat has announced a new AI video-generation tool for select
creators that enables video creation from text and soon image prompts.
The tool, powered by Snap's foundational video models, will be
available in beta on the web. Snap aims to compete with companies like
OpenAI and Adobe but has not shared output examples yet.
APPLE INTELLIGENCE IS NOW AVAILABLE IN PUBLIC BETAS (2 MINUTE READ)
[6]
Apple has released public betas of iOS 18.1, iPadOS 18.1, and macOS
Sequoia 15.1 that feature new Apple Intelligence tools like text
rewriting and photo cleanup. Only the iPhone 15 Pro, iPhone 16, iPhone
16 Pro, and M1 iPads and Macs support these AI features. Final
versions are expected in October.
CRUISE ROBOTAXIS RETURN TO THE BAY AREA NEARLY ONE YEAR AFTER
PEDESTRIAN CRASH (2 MINUTE READ) [7]
Cruise is resuming operations in Sunnyvale and Mountain View, with human-driven vehicles for mapping and plans to progress to supervised
AV testing later this fall. This follows a settlement and leadership
change after an October 2023 crash. Cruise has issued software updates
and signed a partnership with Uber for robotaxi services starting in
2025.
¨
RESEARCH & INNOVATION
V-STAR: TRAINING VERIFIERS FOR SELF-TAUGHT REASONERS (31 MINUTE READ)
[8]
V-STaR is a novel approach to improving large language models that
utilizes both correct and incorrect solutions generated during
self-improvement to train a verifier, which then selects the best
solution at inference time. The method has shown significant
improvements in accuracy on code generation and math reasoning
benchmarks compared to existing approaches, potentially offering a
more efficient way to enhance LLM performance.
FAST 3D GENERATION FROM SINGLE IMAGES (31 MINUTE READ) [9]
Vista3D is a new framework that generates 3D models from a single
image in just 5 minutes. Using a two-phase approach, it quickly forms
rough geometry before refining the details, capturing both visible and
hidden aspects of objects for more complete 3D reconstructions.
HEART MONITORING FROM FACIAL VIDEOS (GITHUB REPO) [10]
PhysMamba is a new framework designed for remote heart monitoring via
facial videos, addressing challenges in capturing long-range
physiological signals.
¨¨
ENGINEERING & RESOURCES
AIAI BOSTON: THE EAST COAST'S MOST SIGNIFICANT SUMMIT FOR APPLIED
AI'S BUILDERS & EXECS. ¨ (SPONSOR) [11]
Uniting engineering teams & tech leadership unleashing the LLM
revolution, AIAI Boston returns on October 16-18.
3 co-located summits. 500+ attendees. CXO speakers from Runway,
NVIDIA, Takeda, Optum.
LEADERS ¨ apply [12] for your Chief AI Officer Summit pass.
ENGINEERS ¨ explore [13] Generative AI Summit & Computer Vision
Summit.
GOT OCR (GITHUB REPO) [14]
A somewhat amazing advancement in general-purpose optical character recognition (OCR) that can read text from images with great
performance. This particular version dramatically improves in-the-wild
OCR as well.
FISH SPEECH (GITHUB REPO) [15]
Powerful voice generation and single-shot voice cloning. Completely
open source and easy to get running.
1X GENIE (GITHUB REPO) [16]
Genie is a video generation for world model systems. 1x Robotics has open-sourced a version that mirrors the one it trained internally.
¨
MISCELLANEOUS
OPENAI SAYS IT'S FIXED ISSUE WHERE CHATGPT APPEARED TO BE MESSAGING
USERS UNPROMPTED (3 MINUTE READ) [17]
A Reddit user reported that OpenAI's ChatGPT initiated a conversation unprompted, leading to speculation about new engagement features.
OpenAI acknowledged the issue and issued a fix, attributing it to a
glitch with unsent messages. Debate continues over the authenticity of
the incident, with similar reports from other users.
ANNOUNCING PIXTRAL 12B (8 MINUTE READ) [18]
Pixtral 12B excels in multimodal tasks, maintaining state-of-the-art performance on text-only benchmarks, and supports variable image sizes
in a 128K token context window. Its architecture includes a new 400M
parameter vision encoder and a 12B parameter multimodal decoder based
on Mistral Nemo. Pixtral outperforms many open and closed models in
multimodal reasoning and instruction following without compromising on
text capabilities.
SCALING: THE STATE OF PLAY IN AI (13 MINUTE READ) [19]
LLMs like ChatGPT and Gemini are becoming increasingly capable as
they scale up in size, data, and computing power, leading to improved performance across various tasks. Current Gen2 models like GPT-4 and
Claude 3.5 are leading the market, with upcoming Gen3 models expected
to further escalate capabilities and costs. The discovery of a new
scaling law in AI, pertaining to increased "thinking" during
inference, promises further advancements in AI performance beyond just
model training.
¨
QUICK LINKS
OVERLAP (PRODUCT LAUNCH) [20]
Overlap (YC S24) is a new AI-powered iOS app that curates the best
short video clips on literally any topic you're interested in - built
for those quick work or study breaks.
MISTRAL LAUNCHES A FREE TIER FOR DEVELOPERS TO TEST ITS AI MODELS (2
MINUTE READ) [21]
Mistral AI has launched a free tier to let developers fine-tune and
build test apps with its models and slashed API prices by over 50%.
A PROMPTABLE RETRIEVAL MODEL (GITHUB REPO) [22]
Promptriever is the first retrieval model that can be prompted like a
language model.
Love TLDR? Tell your friends and get rewards!
Share your referral link below with friends to get free TLDR swag!
https://refer.tldr.tech/21532aea/2 [23]
Track your referrals here. [24]
Want to advertise in TLDR? ¨
If your company is interested in reaching an audience of AI
professionals and decision makers, you may want to ADVERTISE WITH US
[25].
If you have any comments or feedback, just respond to this email!
Thanks for reading,
Andrew Tan & Andrew Carr
If you don't want to receive future editions of TLDR AI, please
unsubscribe from TLDR AI [26] or manage all of your TLDR newsletter subscriptions [27].
Links:
------
[1]
https://tldr.tech/ai?utm_source=tldrai
[2]
https://advertise.tldr.tech/?utm_source=tldrai&utm_medium=newsletter&utm_campaign=advertisetopnav
[3]
https://a.tldrnewsletter.com/web-version?ep=1&lc=df5c4ca8-734c-11ef-b5ad-9577e7a7de79&p=fd6c50ee-7739-11ef-a98a-c12e9d91840d&pt=campaign&t=1726838822&s=93fe2c338de52a92dfaaf295d6fb71f70965da2c91b02fcd9500e483b4b28812
[4]
https://get.incogni.io/aff_c?offer_id=1151&aff_id=16286
[5]
https://techcrunch.com/2024/09/17/snap-is-introducing-an-ai-video-generation-tool-for-creators/?utm_source=tldrai
[6]
https://www.theverge.com/2024/9/19/24249206/apple-intelligence-ios-18-1-public-beta?utm_source=tldrai
[7]
https://techcrunch.com/2024/09/19/cruise-avs-return-to-bay-area-year-after-pedestrian-crash/?utm_source=tldrai
[8]
https://arxiv.org/abs/2402.06457?utm_source=tldrai
[9]
https://arxiv.org/abs/2409.12193v1?utm_source=tldrai
[10]
https://github.com/chaoqi31/physmamba?utm_source=tldrai
[11]
https://world.aiacceleratorinstitute.com/location/caioboston/?utm_source=tldrai
[12]
https://world.aiacceleratorinstitute.com/location/caioboston/
[13]
https://world.aiacceleratorinstitute.com/location/boston/
[14]
https://github.com/Ucas-HaoranWei/GOT-OCR2.0?utm_source=tldrai
[15]
https://github.com/fishaudio/fish-speech?utm_source=tldrai
[16]
https://github.com/1x-technologies/1xgpt/tree/main/genie?utm_source=tldrai [17]
https://futurism.com/openai-chatgpt-initiating-conversations?utm_source=tldrai
[18]
https://mistral.ai/news/pixtral-12b/?utm_source=tldrai
[19]
https://www.oneusefulthing.org/p/scaling-the-state-of-play-in-ai?utm_source=tldrai
[20]
https://www.ycombinator.com/companies/overlap?utm_source=tldrai
[21]
https://techcrunch.com/2024/09/17/mistral-launches-a-free-tier-for-developers-to-test-its-ai-models/?utm_source=tldrai
[22]
https://github.com/orionw/promptriever?utm_source=tldrai
[23]
https://refer.tldr.tech/21532aea/2
[24]
https://hub.sparklp.co/sub_a89cbcf98f89/2
[25]
https://advertise.tldr.tech/?utm_source=tldrai&utm_medium=newsletter&utm_campaign=advertisecta
[26]
https://a.tldrnewsletter.com/unsubscribe?ep=1&l=eedf6b14-3de3-11ed-9a32-0241b9615763&lc=df5c4ca8-734c-11ef-b5ad-9577e7a7de79&p=fd6c50ee-7739-11ef-a98a-c12e9d91840d&pt=campaign&pv=4&spa=1726837268&t=1726838822&s=4b3cdc6f756fd040ec3c8902cfb75b39e56069990f99dd162951f4c2ce98323d
[27]
https://tldr.tech/ai/manage?email=tldr%40synchro.net
---
þ Synchronet þ Vertrauen þ Home of Synchronet þ [vert/cvs/bbs].synchro.net