AI and All Data Weekly - 02 December 2024
AI+Data Weekly ( AI, Data, Iceberg, Polaris, Streamlit, Flink, Kafka, Python, Java, NiFi )
#166 - 02-December-2024
https://bsky.app/profile/paasdev.bsky.social
The Coolness this week
π Andrew Ng's AI Suite OSS
π Open Jupyter Notebooks from github
π Snowpipe Streaming with Kafka Auto Schema
π SenseCAP Watcher AI Physical Device
π cool projects like sdkman and debezium
π Using AgentKit for orchestration
π Super charged voice
π Agents with Memory
π OpenInterpreter lets you run code locally
π Extract Structured Data from Documents
π Unifiy Streaming, Batch, AI
π Automated AI Web Researcher in Ollama
π Cross Platform Screen Sharing
π Collaboration for AI Engineers
π AutoRestTest is a complete testing software for automated API testing that combines the utility of graph theory, Large Language Models (LLMs), and multi-agent reinforcement learning (MARL) to parse the OpenAPI Specification and create enhanced comprehensive test cases.
π Graphs not Silos
πΏοΈ FLUSS: Streaming Storage
πΏοΈFluss -> Flow for Flink Real Time Analytics
π TableFlow - iceberg / kafka
βοΈ Snowflake Cortex AI + Slack
π Big Pile of Snowflake Queries Dataset
β¨οΈ 5 Days to GenAI with Kaggle
ππ»ββοΈ Data Engineering Trends
ππ»ββοΈ Flink SQL with AI
ππ»ββοΈ Segmentation Masks detect - SAM2
π Ray Data Scalable Datasets for ML
πΆοΈ Ollama Functions as Tools
πΆοΈ Ollama - cool functions with Ollama python
πΆοΈ Postbot3000 - give it a try
πΆοΈ Full way to grab your website with LLM
πΆοΈ Very Interesting Web3 Stuff
π Cool Limo Startup in Jersey with AI
π Anthropic Open Source Model Context Protocol
π₯οΈ MCP: First Server
ππ»ββοΈ Open Interpreter
πΆοΈ Cool Google Tricks
πΆοΈ SQL Talk
πΆοΈ Google AI Studio
πΆοΈ GO and Java - Type Safety
πΆοΈ AG2 Agents
πΆοΈ Microsoft's updated AUTOGEN Agents
βοΈ LLAMAINDEX resume cookbook
π Airflow with Snowflake
πΆοΈ Clean Your Mac with a Shell Script
π Big Friendly Bluesky Extract
π LLM Observability OSS
π NV Ingest from NVIDIA for PDF
πΆοΈ NodeJS Editor
π SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory
π Cool new markdown like language
πΆοΈ Open Source Browser API for AI
New Models
πΆοΈ Sparse-Llama-3.1-8B-2of4
πΆοΈ NVIDIA Hymba
βοΈ Snowflake Arctic Instruct
π€― QWQ
π€― Natural Language to SQL
π€― olmo2 models
Interesting Datasets
π [tulu3 datasets https://huggingface.co/collections/allenai/tulu-3-datasets-673b8df14442393f7213f372
Upcoming
π Dec 5: Global PyData: Virtual: https://global2024.pydata.org/cfp/talk/L9JXKS/
π» Dec 19: Conf42 IoT 2024: Virtual: https://www.conf42.com/Internet_of_Things_IoT_2024_Tim_Spann_opensource_build
Recent Tim Stuff
π» XTremePython 2024 - LLM
π» PyData NYC
π» Advanced RAG Techniques @ All Things Open Raleigh 2024
π» Building Real Time LLM Models
π» Big Data Conference EU Talk on Open Source Real-Time AI
π» CloudX AI Real-Time
π» BuildStuff - Adding Generative AI
πββ¬ Conf42 Prompt Engineering
π₯ 06 Nov 2024 AI Alliance Talk in Manhattan
π» 08 Nov 2024 PyData NYC slides
Apps, Demos, Examples, Models, Notebooks and Projects
π RAG 101
π¦ Milvus Knowledgebase
π» AIM Ghosts
π Unstructured Data - Ghosts - Part 1
βπΌ Multimodal RAG is not Scary Ghosts
βπΌ Advanced RAG Techniques
Technologies
CODE + COMMUNITY
Β© 2020-2024 Tim Spann https://www.youtube.com/@FLaNK-Stack (AI + Vectors + LLM + Streaming + IoT)