Abdoulaye - Abdoulaye Diack

Colab Updates: Julia Support and Gemini Data Science

March 9, 2025

•

Abdoulaye

Google Colab has been updated with interesting new features. Julia is now supported natively, so no more need for workarounds! Plus, the Gemini Data Science agent is now more widely accessible. This agent lets you query data through simple prompts, like asking for trend visualizations or model comparisons. It aims to reduce the time…
10Gbps Over 1km: Taara’s Incredible Silicon Photonics Breakthrough

March 1, 2025

•

Abdoulaye

I find this simply incredible. This new Taara chip is smaller than a fingernail, yet it can transmit data at 10 gigabits per second over a 1KM DISTANCE! 🤯🤯🤯 “In tests at the Moonshot Factory labs, our team has successfully transmitted data at 10 Gbps (gigabits per second) over distances of 1 kilometer outdoors…
Managing ML Projects: A Guide for Beginners and Professionals

March 1, 2025

•

Abdoulaye

How do you manage ML projects? 🤔 A question I hear often!Working in research over the years, I often got asked about the day-to-day of managing machine learning projects. That’s why I’m excited about Google’s new, FREE “Managing ML Projects” guide which I can now point to going forward. it’s only 90 minutes but…
SigLIP 2: Multilingual Vision-Language Encoders Released

February 22, 2025

•

Abdoulaye

Google DeepMind has released SigLIP 2, a family of Open-weight (Apache V2) vision-language encoders trained on data covering 109 languages, including Swahili. The released models are available in four sizes: ViT-B (86M), L (303M), So400m (400M), and g (1B). Why is this important? This release offers improved multilingual capabilities, covering 109 languages, which can…
SMOL: New Open-Source Dataset for Low-Resource Language Machine Translation

February 22, 2025

•

Abdoulaye

🎉 My colleagues and members of the language community have released SMOL, a new open-source dataset (CC-BY-4) designed for machine translation research. SMOL includes professionally translated parallel text for over 115 low-resource languages, with a significant representation of over 50 African languages. This dataset is intended to provide a valuable resource for researchers working…
Small Language Models: Notes from the past couple of weeks 🤖🤯

January 17, 2025

•

Abdoulaye

The past few days have brought interesting developments in small language models that could expand mobile computing and low-resource environment applications. Here’s what caught my attention: • Microsoft’s Phi was made fully open source (MIT license) and has been improved by Unsloth AI. 🚀🔓 Blog: https://unsloth.ai/blog/phi4 • Kyutai Labs based in Paris 🇫🇷 introduced…
Pastra – A Practical Guide to the Gemini Multimodal Live API

January 8, 2025

•

Abdoulaye

Google’s Gemini Multimodal Live API provides developers with tools to build AI applications that process and respond to real-time multimodal input (audio, video, and text). Heiko Hotz, a Gemini expert at Google, has created a project called Pastra, a comprehensive guide to help developers get started with this technology. What the guide covers: Getting…