PinnedPaul ScalliinCodeXReal-Time Video Streaming with KafkaBuilding an end to end Machine Learning Video PipelineApr 16, 2021Apr 16, 2021
PinnedPaul ScalliinTowards DevManaging the Machine Learning Lifecycle with MlflowMLOps, or “Machine Learning Operations,” is the practice of managing the production lifecycle of machine learning models. This involves…Dec 13, 2022Dec 13, 2022
Paul ScalliHarnessing the Power of Feature Stores in Machine LearningIn the realm of machine learning (ML), the journey from raw data to insightful predictions is fraught with challenges. One of the most…Mar 6Mar 6
Paul ScalliUnraveling the Mysteries of Data Mesh: A Guide for Data PioneersIn the ever-evolving landscape of data management, a new paradigm has emerged, casting a spell of innovation across industries: Data Mesh…Mar 6Mar 6
Paul ScalliinThe PythoneersPython Decorators: Sprinkling Magic Dust on Your Code for EfficiencyWelcome to the enchanting world of Python decorators, where every line of code holds the potential to be more powerful, more efficient, and…Mar 6Mar 6
Paul ScalliPython Decorators: The Magic Touch for Your CodeAhoy, fellow developers! 🧙♂️✨ Are you ready to add a sprinkle of magic to your Python code? Decorators are like the secret sauce that can…Mar 6Mar 6
Paul ScalliHow to package and publish your Python Library to pipCreating and publishing a Python library can be a thrilling journey, much like crafting a new spell in the world of wizardry. In this…Mar 6Mar 6
Paul ScalliApache Spark Optimizations: Enhancing Speed, Reducing Memory Usage, and Improving ScalabilityAhoy, data wranglers and code slingers! 🏴☠️ Welcome aboard the Spark Optimization, where we’re about to embark on a grand adventure…Mar 6Mar 6
Paul ScalliThe Future of Data Lakes: An Overview of Delta LakeDelta Lake is an open-source data lake format built on top of Apache Spark that offers robust data management capabilities. It enables…Feb 8, 2023Feb 8, 2023
Paul ScalliinCodeXReal-time Processing of Streaming Data: Understanding the Concept and Available TechnologiesStreaming data refers to a continuous flow of data that is generated by various sources in real-time. This data can come from various…Jan 18, 2023Jan 18, 2023