AI and All Data Weekly - 02 December 2024

Β·

3 min read

#166 - 02-December-2024

image_fx_ (43)

https://bsky.app/profile/paasdev.bsky.social

The Coolness this week

🌐 Andrew Ng's AI Suite OSS
🌐 Open Jupyter Notebooks from github
🌐 Snowpipe Streaming with Kafka Auto Schema
🌐 SenseCAP Watcher AI Physical Device
🌐 cool projects like sdkman and debezium
🌐 Using AgentKit for orchestration
🌐 Super charged voice
🌐 Agents with Memory
🌐 OpenInterpreter lets you run code locally
🌐 Extract Structured Data from Documents
🌐 Unifiy Streaming, Batch, AI
🌐 Automated AI Web Researcher in Ollama
🌐 Cross Platform Screen Sharing
🌐 Collaboration for AI Engineers
🌐 AutoRestTest is a complete testing software for automated API testing that combines the utility of graph theory, Large Language Models (LLMs), and multi-agent reinforcement learning (MARL) to parse the OpenAPI Specification and create enhanced comprehensive test cases.
πŸš€ Graphs not Silos
🐿️ FLUSS: Streaming Storage
🐿️Fluss -> Flow for Flink Real Time Analytics
🌐 TableFlow - iceberg / kafka
❄️ Snowflake Cortex AI + Slack
🌐 Big Pile of Snowflake Queries Dataset
⌨️ 5 Days to GenAI with Kaggle
πŸ™‹πŸ»β€β™‚οΈ Data Engineering Trends
πŸ™‹πŸ»β€β™‚οΈ Flink SQL with AI
πŸ™‹πŸ»β€β™‚οΈ Segmentation Masks detect - SAM2
🌐 Ray Data Scalable Datasets for ML
πŸ•ΆοΈ Ollama Functions as Tools
πŸ•ΆοΈ Ollama - cool functions with Ollama python
πŸ•ΆοΈ Postbot3000 - give it a try
πŸ•ΆοΈ Full way to grab your website with LLM
πŸ•ΆοΈ Very Interesting Web3 Stuff
🚘 Cool Limo Startup in Jersey with AI
🌐 Anthropic Open Source Model Context Protocol
πŸ–₯️ MCP: First Server
πŸ™‹πŸ»β€β™‚οΈ Open Interpreter
πŸ•ΆοΈ Cool Google Tricks
πŸ•ΆοΈ SQL Talk
πŸ•ΆοΈ Google AI Studio
πŸ•ΆοΈ GO and Java - Type Safety
πŸ•ΆοΈ AG2 Agents
πŸ•ΆοΈ Microsoft's updated AUTOGEN Agents
✏️ LLAMAINDEX resume cookbook
πŸ” Airflow with Snowflake
πŸ•ΆοΈ Clean Your Mac with a Shell Script
🐍 Big Friendly Bluesky Extract
πŸ“‘ LLM Observability OSS
πŸ“ NV Ingest from NVIDIA for PDF
πŸ•ΆοΈ NodeJS Editor
πŸ“‘ SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory
πŸ“‘ Cool new markdown like language
πŸ•ΆοΈ Open Source Browser API for AI

New Models

πŸ•ΆοΈ Sparse-Llama-3.1-8B-2of4
πŸ•ΆοΈ NVIDIA Hymba
❄️ Snowflake Arctic Instruct
🀯 QWQ
🀯 Natural Language to SQL
🀯 olmo2 models

Interesting Datasets

🌐 [tulu3 datasets https://huggingface.co/collections/allenai/tulu-3-datasets-673b8df14442393f7213f372

Upcoming

🐍 Dec 5: Global PyData: Virtual: https://global2024.pydata.org/cfp/talk/L9JXKS/
πŸ’»
Dec 19: Conf42 IoT 2024: Virtual: https://www.conf42.com/Internet_of_Things_IoT_2024_Tim_Spann_opensource_build

Recent Tim Stuff

πŸ’» XTremePython 2024 - LLM
πŸ’» PyData NYC
πŸ’» Advanced RAG Techniques @ All Things Open Raleigh 2024
πŸ’» Building Real Time LLM Models
πŸ’» Big Data Conference EU Talk on Open Source Real-Time AI
πŸ’» CloudX AI Real-Time
πŸ’» BuildStuff - Adding Generative AI
πŸˆβ€β¬› Conf42 Prompt Engineering
πŸ₯‘ 06 Nov 2024 AI Alliance Talk in Manhattan
πŸ’» 08 Nov 2024 PyData NYC slides

Apps, Demos, Examples, Models, Notebooks and Projects

🐍 RAG 101
🐦 Milvus Knowledgebase
πŸ‘» AIM Ghosts
πŸš• Unstructured Data - Ghosts - Part 1
✍🏼 Multimodal RAG is not Scary Ghosts
✍🏼 Advanced RAG Techniques

Technologies

Python

Java

Snowflake

Streamlit

AWS

Google Cloud

Azure

CODE + COMMUNITY

Β© 2020-2024 Tim Spann https://www.youtube.com/@FLaNK-Stack (AI + Vectors + LLM + Streaming + IoT)

image_fx_ (44)

Β