PinnedPaul ScalliinCodeXReal-Time Video Streaming with KafkaBuilding an end to end Machine Learning Video Pipeline·3 min read·Apr 16, 2021----
PinnedPaul ScalliinTowards DevManaging the Machine Learning Lifecycle with MlflowMLOps, or “Machine Learning Operations,” is the practice of managing the production lifecycle of machine learning models. This involves…3 min read·Dec 13, 2022----
Paul ScalliHarnessing the Power of Feature Stores in Machine LearningIn the realm of machine learning (ML), the journey from raw data to insightful predictions is fraught with challenges. One of the most…·3 min read·Mar 6, 2024----
Paul ScalliUnraveling the Mysteries of Data Mesh: A Guide for Data PioneersIn the ever-evolving landscape of data management, a new paradigm has emerged, casting a spell of innovation across industries: Data Mesh…·4 min read·Mar 6, 2024----
Paul ScalliinThe PythoneersPython Decorators: Sprinkling Magic Dust on Your Code for EfficiencyWelcome to the enchanting world of Python decorators, where every line of code holds the potential to be more powerful, more efficient, and…·3 min read·Mar 6, 2024----
Paul ScalliPython Decorators: The Magic Touch for Your CodeAhoy, fellow developers! 🧙♂️✨ Are you ready to add a sprinkle of magic to your Python code? Decorators are like the secret sauce that can…·2 min read·Mar 6, 2024----
Paul ScalliHow to package and publish your Python Library to pipCreating and publishing a Python library can be a thrilling journey, much like crafting a new spell in the world of wizardry. In this…·3 min read·Mar 6, 2024----
Paul ScalliApache Spark Optimizations: Enhancing Speed, Reducing Memory Usage, and Improving ScalabilityAhoy, data wranglers and code slingers! 🏴☠️ Welcome aboard the Spark Optimization, where we’re about to embark on a grand adventure…·4 min read·Mar 6, 2024----
Paul ScalliThe Future of Data Lakes: An Overview of Delta LakeDelta Lake is an open-source data lake format built on top of Apache Spark that offers robust data management capabilities. It enables…·2 min read·Feb 8, 2023----
Paul ScalliinCodeXReal-time Processing of Streaming Data: Understanding the Concept and Available TechnologiesStreaming data refers to a continuous flow of data that is generated by various sources in real-time. This data can come from various…·2 min read·Jan 18, 2023----