29-January-2024
FLaNK Stack Weekly
Tim Spann @PaaSDev
Get your new Apache NiFi for Dummies!
cloudera.com/campaign/apache-nifi-for-dummi..
ossinsight.io/analyze/tspannhw
Trial: console.us-west-1.cdp.cloudera.com/trial/re..
CODE + COMMUNITY
Please join my meetup group NJ/NYC/Philly/Virtual.
http://www.meetup.com/futureofdata-princeton/
https://www.meetup.com/futureofdata-newyork/
https://www.meetup.com/futureofdata-philadelphia/
This is Issue #122
https://github.com/tspannhw/FLiPStackWeekly
https://www.cloudera.com/solutions/dim-developer.html
Articles
Apache NiFi and Amazon Textract for Machine Learning medium.com/@tspann/apache-nifi-and-amazon-t..
Apache NiFi and Amazon Transcribe for Machine Learning medium.com/@tspann/apache-nifi-and-amazon-t..
Building a Library of Python Processors medium.com/@tspann/building-a-library-of-py..
Harnessing the Power of Apache NiFi and Amazon Polly for Machine Learning medium.com/@tspann/harnessing-the-power-of-..
Building LLM Pipelines with Pinecone, HuggingFace, Python and Apache NiFi medium.com/@tspann/llm-pipelines-with-pinec..
Writing A Gen AI Processor with Python medium.com/@tspann/writing-a-generative-ai-..
Raspberry Pi 5 Setup medium.com/@tspann/i-setup-too-many-sbcs-d6..
Codeless Generative AI Pipelines with Chroma Vector DB & Apache NiFi medium.com/@tspann/codeless-generative-ai-p..
Using NiFi to Augment and Enrich LLM Results with Real-Time Contextual Data medium.com/@tspann/augmenting-and-enriching..
ReadyFlow with WatsonX community.cloudera.com/t5/Community-Article..
AWS Open Source community.aws/content/2bJFKCPKPttVH0yPHPPt3..
Checkpoint Chronicle December 2023 decodable.co/blog/checkpoint-chronicle-dece..
ADSB with NiFi researchgate.net/publication/352469660_Near..
NiFi Security exceptionfactory.com/posts/2021/07/21/singl..
4 Wars orf AI latent.space/p/dec-2023
DocLLM for PDF medium.com/@basics.machinelearning/discover..
GRPC and Protobuf are growing infoq.com/news/2023/12/linkedin-grpc-protob..
Multi-Layered Cache infoq.com/news/2023/10/doordash-multilayere..
CDF Updates community.cloudera.com/t5/What-s-New-Cloude..
LLM infoq.com/articles/large-language-models-ll..
Top 10 Challenges to GenAI datanami.com/2024/01/22/top-10-challenges-t..
Data Engineering in 2024 datanami.com/2024/01/23/data-engineering-in..
EdgeAI docs.omniverse.nvidia.com/dev-guide/latest/.. catalog.ngc.nvidia.com/orgs/nvidia/containe.. developer.nvidia.com/blog/generate-syntheti..? developer.nvidia.com/blog/how-to-build-visi.. developer.nvidia.com/blog/bringing-generati.. developer.nvidia.com/blog/getting-started-o.. developer.nvidia.com/blog/bringing-generati.. jetson-ai-lab.com
Fine Tuning LLM philschmid.de/fine-tune-llms-in-2024-with-trl
Use Markdown in Google support.google.com/docs/answer/12014036
Videos
Seven Videos on Real-Time Streaming medium.com/@tspann/seven-videos-on-real-tim..
Unlocking Financial Data with Real-Time Pipelines (OSACon 2023) youtube.com/watch?v=Q7gF7m4yFi4&ab_chan..
Looking at the New Features of Apache NiFi (Halifax Community over Code) youtube.com/watch?v=_orD9aAXk48&ab_chan..
Utilizing Real-Time Transit Data for Travel Optimization (Halifax Community over Code) Sunday Oct 8 2023, Canada youtube.com/watch?v=OWQmeF-UeEc&ab_chan..
Continuous SQL with Kafka and Flink | Timothy Spann (EN) youtube.com/watch?v=IGs0k240zhU&ab_chan..
Events
Feb 8, 2024: NYC. https://www.meetup.com/new-york-open-source-data-infrastructure-meetup/events/297484047/
18:00 - 18:30 Welcome: Networking & snacks 18:30 - 18:35 Kickoff: Welcome Aiven 18:35 - 19:00 A Guide to Product Experimentation (Erin Mikail Staples, LaunchDarkly) 19:00 - 19:30 Building Real-time Pipelines: A Case Study with Transit Data (Tim Spann, Cloudera) 19:30 ~ 21:00 Food & networking
Feb 2024: Webinar cloudera.com/about/events/webinars/stay-ahe..
Feb 20, 2024: 12-1PM EST. Virtual. Azure Data Tech Groups: DBA Fundamentals Group meetup.com/dba-fundamentals-group/events/29..
Feb 28, 2024: NYC. Cloudera Meetup. Flink meetup.com/futureofdata-princeton/events/29..
March 5, 2024: Princeton. Meetup. GenAI. meetup.com/applied-generative-artificial-in..
March 15, 2024: Princeton. IT Professional Conference at Trenton Computer Festival IEEE Information Technology Professional Conference on Friday, March 15th, 2024 princetonacm.acm.org/tcfpro
April 2024: XtremeJ 2024. Virtual. https://xtremej.dev/2023/schedule/
Cloudera Events https://www.cloudera.com/about/events.html
More Events: linkedin.com/pulse/schedule-2024-tim-spann-..
Code
- github.com/tspannhw/FLaNK-python-watsonx-pr..
- github.com/tspannhw/FLaNK-CDW
- github.com/tspannhw/FLaNK-VectorDB
- github.com/tspannhw/FLaNK-RPI5
- github.com/tspannhw/FLaNK-EdgeAI
- github.com/kevinbtalbert/NiFi-Flows-Demos
- github.com/DataSQRL/apirag
- github.com/tspannhw/FLaNK-python-ExtractCom..
Models
Tools
- github.com/timfraedrich/OutRun
- github.com/build-on-aws/get-the-news-rss-at..
- github.com/langroid/langroid
- github.com/aws-samples/apache-flink-near-on..
- github.com/IncomeStreamSurfer/chatgptassist..
- konpyutaika.github.io/nifikop
- github.com/stas00/ml-engineering
- github.com/LiheYoung/Depth-Anything
- github.com/marklogic/nifi
- github.com/viraniaman94/sendenv
- learn.microsoft.com/en-us/azure/cosmos-db/f..
- developers.google.com/edu/python
- github.com/InstantID/InstantID
- github.com/Corgea/retriever
- github.com/weaviate/weaviate
- github.com/LiheYoung/Depth-Anything
- github.com/qdrant/qdrant
- github.com/rajnandan1/kener
- towardsdatascience.com/running-local-llms-a..
- github.com/huggingface/trl
- harlequin.sh
- jupysql.ploomber.io/en/latest/quick-start.h..
- querybook.org
- wix-incubator.github.io/quix/docs/about
- fugue-tutorials.readthedocs.io
- github.com/async-profiler/async-profiler
- heynote.com
- github.com/theOGognf/finagg
- github.com/InstantID/InstantID
- github.com/huggingface/datatrove
- bernsteinbear.com/blog/scrapscript
- github.com/reorproject/reor
- memgpt.readme.io/docs/index
- github.com/origin-energy/java-snapshot-test..
- github.com/BishopFox/cloudfoxable
- pypi.org/project/Wikipedia-API
- github.com/nutlope/pdftochat
- github.com/NVlabs/Deep_Object_Pose/blob/mas..
- github.com/danvega/todos-http-client
- thenewstack.io/what-you-can-do-with-vector-..
- milvus.io/docs/example_code.md
- newark.com/sbc-powered-drones-for-aerial-in..
- github.com/linkedin/rest.li
- farfetch.github.io/kafkaflow
- thenewstack.io/opentofu-1-6-general-availab..
- huggingface.co/blog/gcp-partnership
- github.com/kanton-bern/hellodata-be
- github.com/assafelovic/gpt-newspaper
- softwaredoug.com/blog/2024/01/24/are-we-at-..
- zed.dev/download
- github.com/vnglst/pong-wars
- github.com/rasbt/LLMs-from-scratch
- github.com/Mihaiii/llm_steer
- cloudevents.github.io/sdk-java/kafka.html
- github.com/lamini-ai/prompt-engineering-ope..
- github.com/lamini-ai/llm-classifier
© 2020-2024 Tim Spann