Skip to main content

Command Palette

Search for a command to run...

FLaNK Stack Weekly for 27 November 2023

Kafka, NiFi, Flink, LLM

Published
1 min read
FLaNK Stack Weekly for 27 November 2023

27-November-2023

The FLaK Federation is building...

2023-11-16_19-42-43_097

Happy Thanksgiving!

Please give to https://www.glwd.org/

FLaNK Stack Weekly

Tim Spann @PaaSDev

https://pebble.is/PaaSDev

https://vimeo.com/flankstack

https://www.youtube.com/@FLaNK-Stack

https://www.threads.net/@tspannhw

https://medium.com/@tspann/subscribe

Get your new Apache NiFi for Dummies!

https://www.cloudera.com/campaign/apache-nifi-for-dummies.html

https://ossinsight.io/analyze/tspannhw

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

http://www.meetup.com/futureofdata-princeton/

https://www.meetup.com/futureofdata-newyork/

https://www.meetup.com/futureofdata-philadelphia/

This is Issue #113

https://github.com/tspannhw/FLiPStackWeekly

https://www.linkedin.com/pulse/schedule-2023-tim-spann-/

https://www.cloudera.com/solutions/dim-developer.html

https://www.cloudera.com/products/dataflow/nifi-dataflow-calculator.html?utm_source=twitter&keyplay=data-flow&utm_campaign=FY24-Q2_Content_Globl_Nifi_SS_Tool_Promos&cid=UNGATED&utm_medium=social-organic&pid=11590424099

https://community.cloudera.com/t5/Community-Articles/New-Cloudera-AMP-with-Amazon-Bedrock-Integration-Now/ta-p/377071?utm_medium=social-organic&pid=11547807644

Project News

NiFi 2.0.0-M1 https://nifi.apache.org/project-documentation.html

https://cwiki.apache.org/confluence/display/NIFI/Release+Notes#ReleaseNotes-Version2.0.0-M1

NiFi 1.24 is out soon with some of the new processors from 2.0.0-M1 and some patches

Parquet, not butter!

https://cwiki.apache.org/confluence/plugins/servlet/mobile?contentId=278466652#content/view/278466652

Articles

https://www.slideshare.net/bunkertor/building-realtime-travel-alerts

https://www.slideshare.net/bunkertor/jconworld-continuous-sql-with-kafka-and-flink

https://www.slideshare.net/bunkertor/endss23tspannintegrating-llm-with-streaming-data-pipelines

https://blog.cloudera.com/5-key-takeaways-from-flink-forward-2023/

https://www.decodable.co/blog/change-data-capture-breaks-encapsulation-does-it-though

https://docs.cloudera.com/cem/2.0.0/release-notes/topics/cem-whats-new.html

https://www.datanami.com/2023/11/17/cloudera-and-nvidia-partner-to-expand-ai-capabilities/?utm_source=BigDATAwire+Newsletter&utm_medium=email-BOTH&utm_campaign=OpenAI+Chaos%3B+Data+Intelligence+at+Databricks%3B+Batch+Plus+Stream&utm_term=7132E3795801H7R&oly_enc_id=7132E3795801H7R

https://netflixtechblog.com/1-streamlining-membership-data-engineering-at-netflix-with-psyberg-f68830617dd1

https://microsoft.github.io/generative-ai-for-beginners/#/

https://blog.roboflow.com/gpt-4-vision-alternatives/ https://ahranemahaganapathy.medium.com/embarking-on-the-data-flow-journey-unveiling-apache-nifis-architecture-and-basics-0af7bfad3654

https://www.linkedin.com/feed/update/urn:li:activity:7090927558904459264/?updateEntityUrn=urn:li:fs_feedUpdate:(V2,urn:li:activity:7090927558904459264)

https://opensource.net/get-started-with-technical-writing/

https://towardsdatascience.com/the-new-best-python-package-for-visualising-network-graphs-e220d59e054e

Videos

https://www.youtube.com/watch?v=psnRObquBfw&t=638s&pp=ygUJVGltIFNwYW5u

Events

On Demand https://events.dzone.com/dzone/Data-Pipelines-Investigating-the-Modern-Day-Stack?utm_bmcr_source=LinkedIn

Open Source Finance Forum. Virtual. https://resources.finos.org/znglist/osff-2023-virtual-presentations/?c=cG9zdDo5OTEzOTk%3D&utm_campaign=OSFF+NYC+2023&utm_content=269713979&utm_medium=social&utm_source=linkedin&hss_channel=lcp-18473937

December 12-14, 2023: OSACon. Online. https://osacon.io/

April 2024: XtremeJ 2024. Virtual. https://xtremej.dev/2023/schedule/

Cloudera Events https://www.cloudera.com/about/events.html

More Events: https://www.linkedin.com/pulse/schedule-2023-tim-spann-/

Code

  • https://github.com/tspannhw/FLaNK-Halifax
  • https://github.com/tspannhw/CoC2023
  • https://github.com/tspannhw/PaK-Stocks
  • https://github.com/tspannhw/FLaNK-EveryTransitSystem
  • https://github.com/tspannhw/FLaNK-Ice
  • https://github.com/tspannhw/FLaNK-SaoPauloBrazil
  • https://github.com/tspannhw/FLaNK-ContinuousSQL
  • https://github.com/tspannhw/FLaNK-OpenAi

Models

  • https://github.com/jmorganca/ollama
  • https://github.com/QwenLM/Qwen-VL
  • https://github.com/THUDM/CogVLM
  • https://github.com/haotian-liu/LLaVA
  • https://github.com/SkunkworksAI/BakLLaVA
  • https://huggingface.co/Intel/neural-chat-7b-v3-1

Tools

  • https://github.com/Acly/krita-ai-diffusion
  • https://krita.org/
  • https://github.com/comfyanonymous/ComfyUI
  • https://docs.runpod.io/reference/runpod-apis
  • https://vast.ai/
  • https://github.com/sdan/vlite
  • https://github.com/PKU-YuanGroup/Video-LLaVA
  • https://github.com/dgarnitz/vectorflow
  • https://github.com/doronz88/pymobiledevice3
  • https://www.terminalizer.com/install
  • https://github.com/Augur1989/Augur89/tree/LCARS-Resource-Monitor
  • https://github.com/facebookresearch/llama-recipes/
  • https://github.com/akto-api-security/akto
  • https://owasp.org/www-community/api_security_tools
  • https://github.com/arainho/awesome-api-security
  • https://api-guesser.netlify.app/
  • https://github.com/run-llama/llama_index
  • https://surge-synthesizer.github.io/
  • https://animotion.dev/
  • https://docs.aws.amazon.com/awsaccountbilling/latest/aboutv2/using-free-tier-api.html
  • https://github.com/Christoph-Lauer/Sonogram-Visible-Speech
  • https://tytel.org/helm/

© 2020-2023 Tim Spann

More from this blog

Unstructured Data Unleashed

198 posts

https://github.com/tspannhw/SpeakerProfile

Tim Spann is a Principal Developer Advocate for Zilliz and Milvus. He works with Milvus, Towhee, Attu, GPTCache, Generative AI, HuggingFace, Python, Java, A