Skip to main content

Command Palette

Search for a command to run...

FLaNK Stack for 3 July 2023

Flink-NiFi-Kafka

Published
1 min read
FLaNK Stack for 3 July 2023

3-July-2023

HOLIDAY

FLiPN-FLaNK Stack Weekly

Tim Spann @PaaSDev

My friend wrote an awesome new book on streaming, I highly recommend picking up a copy!

https://leanpub.com/streamprocessingwithapacheflink/c/ucQ5dLcZYAo2

CODE + COMMUNITY

Please join my meetup group NJ/NYC/Philly/Virtual.

http://www.meetup.com/futureofdata-princeton/

https://www.meetup.com/futureofdata-newyork/

https://www.meetup.com/futureofdata-philadelphia/

This is Issue #92

https://github.com/tspannhw/FLiPStackWeekly

https://www.linkedin.com/pulse/schedule-2023-tim-spann-/

Videos

https://www.youtube.com/watch?v=8NrK69WrRq0&ab_channel=PlainSchwarz

Talks

https://www.slideshare.net/bunkertor/meetup-streaming-data-pipeline-development-258709707

https://www.slideshare.net/bunkertor/big-data-fest-building-modern-data-streaming-apps

https://www.youtube.com/live/1xFha8va7pg?feature=share

Articles

https://medium.com/@george.vetticaden/accelerating-ai-data-pipelines-building-an-evernote-chatbot-with-apache-nifi-2-0-and-generative-ai-9d977466ff4c

https://exceptionfactory.com/posts/2023/07/01/streamlining-apache-nifi-cluster-state-migration/

https://medium.com/cloudera-inc/building-a-stateful-streaming-intrusion-detection-system-with-sql-stream-builder-4667c87f347f

https://medium.com/@tspann/cdc-not-cat-data-capture-e43713879c03

https://medium.com/@tspann/functions-anywhere-faas-ee92ecedb248

https://community.cloudera.com/t5/What-s-New-Cloudera/Cloudera-Streaming-Analytics-CSA-1-10-introduces-new-built/ba-p/373443

https://blog.cloudera.com/fraud-detection-with-cloudera-stream-processing-part-1/

https://siliconangle.com/2023/06/27/cloudera-expands-apache-iceberg-support-private-clouds/

https://debezium.io/blog/2023/06/22/towards-exactly-once-delivery/

https://medium.com/data-engineering-chariot/friends-dont-let-friends-use-json-in-their-data-lakes-e8321f4028c3

https://dev.to/thedanicafine/so-you-want-to-speak-at-a-technical-conference-responding-to-a-cfp-54m6

https://www.databricks.com/company/newsroom/press-releases/announcing-delta-lake-30-new-universal-format-offers-automatic

https://www.vox.com/climate/23769186/bad-air-quality-index-wildfires-pollution

https://marcushellberg.dev/java-ecosystem-trends-report-2023

https://hazelcast.com/blog/enriching-kafka-applications-with-contextual-data/

https://dzone.com/articles/streaming-change-data-capture-data-two-ways

Documentation

https://docs.cloudera.com/csa/1.10.0/how-to-ssb/topics/csa-ssb-kafka-kudu-join.html

https://docs.cloudera.com/runtime/7.2.17/index.html

Events

https://attend.cloudera.com/ameropendatalakehousewithcdpon?lid=7vxyhds3tlv7

July 19, 2023: 2-Hours to Data Innovation: Data Flow https://www.cloudera.com/about/events/hands-on-lab-series-2-hours-to-data-innovation.html

October 18, 2023: 2-Hours to Data Innovation: Data Flow https://www.cloudera.com/about/events/hands-on-lab-series-2-hours-to-data-innovation.html

Cloudera Events https://www.cloudera.com/about/events.html

More Events: https://www.linkedin.com/pulse/schedule-2023-tim-spann-/

Code

https://github.com/cloudera/CML_AMP_LLM_Chatbot_Augmented_with_Enterprise_Data/tree/main

NiFi Code

https://github.com/georgevetticaden/evernote-ai-chatbot

Tools

  • https://saurabhs.org/advanced-macos-commands
  • https://github.com/poloclub/wizmap
  • https://high-qr-code-generator.com/
  • https://github.com/salesforce/xGen
  • https://erichartford.com/openorca
  • https://neal.fun/password-game/
  • https://github.com/Kanaries/graphic-walker
  • https://orbstack.dev/
  • https://github.com/apache/parquet-format/blob/master/Encryption.md
  • https://github.com/Stability-AI/generative-models
  • https://github.com/CASIA-IVA-Lab/FastSAM
  • https://github.com/imgly/background-removal-js
  • https://github.com/ooguz/papyrus
  • https://github.com/configu/configu
  • https://www.pinecone.io/
  • https://github.com/orf/gping
  • https://rust-lang.github.io/mdBook/

© 2020-2023 Tim Spann

More from this blog

Unstructured Data Unleashed

199 posts

https://github.com/tspannhw/SpeakerProfile

Tim Spann is a Principal Developer Advocate for Zilliz and Milvus. He works with Milvus, Towhee, Attu, GPTCache, Generative AI, HuggingFace, Python, Java, A