Sinopsis
In this podcast, I discuss a variety of SQL Server related topics mixed with a sprinkling of professional development and I talk with some of the biggest names in the industry. I want to introduce new and familiar topics and talk about them in a way you may have not considered. We are all on different stages as data professionals. I hope you will join me and Steve on the SQL trail compañeros--you won't regret it.
Episodios
-
Episode 285: Who Is Using Microsoft Fabric
29/05/2025 Duración: 34minFabric personas were originally designed to break down the various functional roles within Microsoft Fabric—such as Power BI, Data Factory, Data Activator, Data Engineering, Data Science, Data Warehouse, and Real-time Analytics—into more manageable, bite-sized sections. The goal was to prevent users from feeling overwhelmed by the platform’s breadth. However, this feature has since been discontinued, as it did not effectively communicate the seamless integration between these roles. Still, the underlying concepts can be useful when thinking about how you might approach Fabric from a functional standpoint. Do you like the change on one large white canvas, or did personas have a use for you? Let us know in the comments below. We hope you enjoyed this conversation on personas in Microsoft Fabric. If you have questions or comments, please send them our way. We would love to answer your questions on a future episode. Leave us a comment and some love ❤️on LinkedIn, X, Facebook, or Instagram. The show notes for toda
-
Episode 284: The Four-Letter Word ETL - Data Movement
04/02/2025 Duración: 58minOnce you have your data stored in OneLake, you'll be ready to start transforming it to improve it's usability, accuracy, and efficiency. In this episode of the podcast, Belinda Allen takes us on a delightful journey through Data Flows, Power Query, Azure Data Factory (Pipelines), and discusses the merits of shortcuts. We also learn about a handy way to manually upload a table if you have some static data you need to update. There are many tools and techniques that can be used for data ingestion and transformations. And while some of these options we discuss will be up to individual preference, there are pros and cons to each. One of the blessings and curses of Fabric is that there are many ways of achieving the same result, so what you choose may depend on the nature of the data you have and your goals, but might also be dictated by personal experience. We hope you enjoyed this conversation with Belinda on ingesting and transforming data in Microsoft Fabric. If you have questions or comments, please send
-
Episode 283: Data Lakehouse vs Data Warehouse vs My House
02/01/2025 Duración: 48minMicrosoft Fabric offers two enterprise-scale, open-standard format workloads for data storage: Warehouse and Lakehouse. Which service should you choose? In this episode, we dive into the technical components of OneLake, along with some of the decisions you’ll be asked to make as you start to build out your data infrastructure. These are two good articles we mention in the podcast that could help inform your decision on the services to implement in your OneLake. Microsoft Fabric Decision Guide: Choose between Warehouse and Lakehouse - Microsoft Fabric | Microsoft Learn Lakehouse vs Data Warehouse vs Real-Time Analytics/KQL Database: Deep Dive into Use Cases, Differences, and Architecture Designs | Microsoft Fabric Blog | Microsoft Fabric We hope you enjoyed this conversation on the nuances of data storage within Microsoft OneLake! If you have questions or comments, please send them our way. We would love to answer your questions on a future episode. Leave us a comment and some love ❤️ on LinkedIn, X, Faceboo
-
Episode 282: OneLake - A Deep Dive
26/11/2024 Duración: 34minIn Episode 281, we introduced Microsoft OneLake with a high-level overview. Now we're going deeper with a discussion on the Parquet format, why Microsoft went with the Delta Lake variation, and what Delta Lake format brings to the table (no pun intended). We'll also examine some "behind the scenes" aspects of file management, and why you'll still be using the GUI to create most of your objects. Onelake is Microsoft's solution to the demand for centralizing all data in one location, eliminating the need to transfer it across multiple systems. We expect this to play out further however, when we consider scenarios like data sovereignty, geographical data distribution, separation of subsidiary data, and even departmental budgets that may necessitate multiple instances of OneLake. We round out our OneLake deep dive with a conversation on the Direct Lake Mode option for importing data into Power BI and Eugene shares his perspective on why everyone may not be rushing to jump on the bandwagon just yet. We hope you en
-
Episode 281: OneLake - The OneDrive for Data
22/10/2024 Duración: 31minAs you start using Fabric, having a central location for your data is crucial. OneLake acts as this unified destination, offering a single, consolidated repository for all your data. In this podcast episode, we explore the core features of OneLake and its benefits with our guest, Mariano Kovo, and discuss how it efficiently handles large amounts of data from diverse sources. We'll also dive into the importance of how your data is presented to Azure services, focusing on the Delta Parquet format. Did you know you can explore OneLake data directly through Windows Explorer? Microsoft aims to make a single copy of your data accessible across multiple services, eliminating the need for constant data movement. Shortcuts make it easier to access your data seamlessly within the OneLake environment, enhancing efficiency and accessibility. We hope you enjoy this foundational episode on Microsoft OneLake! If you have questions or comments, please send them our way. We would love to answer your questions on a future epis
-
Episode 280: A Focus on Microsoft Fabric
08/10/2024 Duración: 18minAt the Microsoft Build Conference in May 2023, Microsoft announced the new Fabric, where you could slice and dice all your data harmoniously within the environment. A few months later, Kevin, Eugene, and I discussed this evolution of the Azure Data platform in episode 267, and our thoughts on the vision for it's future, our expectations, and predictions. Now, more than a year later, we decided it's a good time to take an in-depth look at the platform to see what goals have come to fruition, what predictions have come true, and what may have changed. In this introduction to Season 8, we'll get the conversation started. In the next 10 episodes we'll be taking a deep dive into the reality of what Microsoft Fabric is today, navigating through the nuances, complexities, and sheer vastness of the product. We'll break it down into digestible chunks focused on specific aspects such as: One Lake Data Warehouse vs. Lakehouse vs. Eventhouse Data Factory Microsoft Synapse Data Governance and Security Data Activator Powe
-
Episode 279: SQL Server Migrations Demystified
20/09/2024 Duración: 01h03minIf you use SQL Server, you will eventually have to migrate that instance somewhere – to a new version, a new server, the cloud . . . somewhere. Or perhaps you'll find yourself migrating from another database into SQL Server. No matter which way you slice it, SQL Server migrations can be daunting, not to mention complex and time-consuming. While we know there are risks and many things that can go wrong, the "new" Microsoft continues to put time and effort towards making successful SQL Server migrations attainable for everyone. In this episode of the podcast, we chat with Tejas Shah and Sudhir Raparla, 2 of the Microsoft Project Managers responsible for SQL Server migration tooling. They share practical perspectives on approaching your SQL migration with confidence and the tools and enhancements that will help. During the conversation, Tejas and Sudhir also take us through the 5 migration steps they want you to consider as you undertake your SQL Server migration process. Even though we’ve migrated thousands of
-
Episode 278: Running SQL Server on Azure VMs
30/07/2024 Duración: 01h04minCan you run SQL Server on Azure VM? Which VM is best? Is running SQL Server on a VM in Azure the right choice? Find out in this insightful episode with Anders Pedersen! With over 10 different SQL Server services now offered in Microsoft Azure, it can be difficult to know how you want to run your environment. Sometimes, the old ways are the best ways for an organization, and running SQL Server on a VM in Azure is the right fit. In this episode of the SQL Data Partners Podcast, we chat with Anders Pedersen about his experience moving their systems to Azure VMs. We discuss some of the tiering issues, the newest storage tier being rolled out, and how he manages upgrades. Join us for another informative podcast where a seasoned database administrator shares their experience of managing a SQL Server environment. Did you get any good take-aways from today's podcast or have some questions? Leave us a comment and some love ❤️ on LinkedIn, X, Facebook, or Instagram. The show notes for today's episode can be foun
-
Episode 277: PostgreSQL for the SQL Server Crowd
15/07/2024 Duración: 01h02minIs testing out pgAdmin on your to-do list? In this episode of the podcast, we chat with Ryan Booz, a PostgreSQL advocate at Redgate, about how a SQL Server professional might begin a dive into PostgreSQL, one of the most popular open source databases in the world. Ryan came from a career background in SQL Server, but after experiencing his accidental "jump-into-the-deep-end" PostgreSQL moment, he hasn’t looked back. Naturally, open source presents DBAs and their organizations with many desirable features, but there are certain drawbacks as well. Ryan shares how he navigated his transition into PostgreSQL and raises some points to consider if you are thinking about a switch. We discuss a few of the land mines you might encounter along the way as well as terminology differences in this space. Be sure to check out Planet PostgreSQL for the most recent blog posts from the very folks that are contributing code to the PostgreSQL project. Have you shifted from SQL Server to PostgreSQL? How'd it go? Did you get any g
-
Episode 276: Dynamic SQL and Testing in Isolation
21/05/2024 Duración: 41minListener beware! This episode is full of danger as we tackle an interesting use case for Dynamic SQL. Dynamic SQL generally has a bad reputation in SQL Server circles, and with good reason. Dynamic SQL can open the door to many undesirable results - SQL Injection attacks being the most frightening of these. It can also be difficult to read, making maintenance problematic; however, in this episode one brave soul - Marathon's own Laura Moss - explains how she uses Dynamic SQL to help refresh a subset of production data to be used in their development environments. You know we are always suckers for an interesting use case and Laura delivers big time. While you won’t be able to plug and play her example into your environment, we hope it gets the wheels turning if you struggle to update your test environments. Have you found a way to use Dynamic SQL as a tool for good and not evil? Did you get any good take-aways from today's podcast or have some questions? Leave us a comment and some love ❤️ on LinkedIn, Twitter
-
Episode 275: Machine Learning and Power BI
16/04/2024 Duración: 44minWhat kinds of problems are organizations solving with Machine Learning? In this episode, we explore a situation where a public works department was looking for more accurate information to predict future water levels based on rainfall to maintain water tank storage for balancing pressure and to prevent overflow flooding. Marathon data solutions consultants Brian Knox and Andy Yao, built a custom machine learning model and made the results available through Power BI reporting. We talk through some of the data hurdles the project presented, the tools they used, and how their work provided results the client could rely on. We touch on Azure ML environment and future integrations that will come with Power BI and ML. Have you done any work in ML or predictive modeling? Did you get any good take-aways from today's podcast? Leave us some love ❤️ on LinkedIn, Twitter/X, Facebook, or Instagram. The show notes for today's episode can be found at Episode 275: Machine Learning and Power BI. Have fun on the SQL Trail!
-
Episode 274: A CMM Case Study
22/02/2024 Duración: 51minAfter discussing the Capabilities Maturity Model in our last episode, it was fate when Andy Levy reached out and suggested a topic which sounds like a case study about his experience with CMM. As the only data professional in his organization at the time of his hiring, Andy went from fixing problems to slowing increasing his role in the organization and participating in the planning meetings—being in the room where decisions are made and change happens. We think this episode will be an interesting perspective for those who might be on the fence about the model, and looking for ways to increase their own visibility in an organization. Let us know what you think! What do you think of CMM? Did you get any good take-aways from today's podcast? Leave us some love ❤️ on LinkedIn, Twitter/X, Facebook, or Instagram. The show notes for today's episode can be found at Episode 274: A CMM Case Study. Have fun on the SQL Trail!
-
Episode 273: The Capability Maturity Model for Data Professionals
02/02/2024 Duración: 55minHave you ever felt stuck in a rut, repeating the same tasks, while knowing there is room for improvement? The Capability Maturity Model may be a way for you to start contributing to those improvements. In this podcast episode, Kevin Kline from SolarWinds walks us through how we might go from simply dealing with issues as they come, to being a contributor in decisions about the future of our organization. Listen in and learn about the levels of CMM, how they relate to those of us in data professions, and how you can apply the methodologies to become a leader who drives positive change, while doing what you love. Let us know what you think! What CMM level are you in presently? Did you get any good take-aways from today's podcast? Leave us some love ❤️ on LinkedIn, Twitter/X, Facebook, or Instagram. The show notes for today's episode can be found at Episode 273: The Capability Maturity Model for Data Professionals. Have fun on the SQL Trail!
-
Episode 272: Performance Tuning Scripts
10/01/2024 Duración: 50minDo you find yourself repeating the same actions when pulling SQL Server performance metrics? Performance tuning a troublesome SQL Server can be a challenge. Luckily the community continues to produce wonderful folks like Erik Darling who contribute their knowledge to make your life a bit easier. In this episode of the SQL Data Partners podcast, we sit down with Erik and discuss the scripts he built to gather performance metrics. While every potential issue is not captured in these scripts, they'll help you start gathering information so you can decide on the next step to take. Get the scripts Have you used Erik’s scripts before? Let us know! The show notes for today's episode can be found at Episode 272: Performance Tuning Scripts. Have fun on the SQL Trail!
-
Episode 271: Pass Summit 2023 Wrap-Up
29/12/2023 Duración: 38minThis past November, Eugene and I attended the 2023 PASS Data Community Summit (aka PASS Summit) in Seattle, while Kevin headed down to Orlando for the Microsoft Live! 360 event in Orlando. Having no remote option this year was fine for us, as it was great to reconnect with our friends and colleagues in the data community, as well as get updates live and in-person from Microsoft on all the new features rolling out. The focus of both conferences was eerily similar and in this episode we discuss our experiences along with a picture of what 2024 will look like. Azure services continue to be a focus and Fabric is included in that space; however, I was surprised at the number of Postgres sessions. In fact, PostGres got a whopping 30 minutes out the 90-minute Microsoft keynote. AI continued to be peppered in the conversation and Kevin gives us his take on where some of this is going. As a side reference, if you are interested in learning more about large language models, check out this video by Andrej Karpathy. The
-
Episode 270: Medallion Architecture
21/11/2023 Duración: 39minMoving up the ranks in the holy technology wars is the medallion architecture, and boy are we interested in getting your thoughts. Not since the 2008 Olympics and Michael Phelps' tenth of a second win over Milorad Cavic has there been so much controversy around bronze, silver, and gold. This episode of the podcast has a genesis with Databricks and the methodology of getting data into a workable form for all the reporting pieces businesses love so much. We discuss our thoughts on the various layers of a medallion architecture and the implementation in Azure delta lake environments. Have a different take? Let us know! The show notes for today's episode can be found at Episode 270: Medallion Architecture. Have fun on the SQL Trail!
-
Episode 269: Why Do I Need a Managed Service Provider?
19/10/2023 Duración: 16minIn this Episode, Carlos talks about managed services and shares some of the benefits of working with an MSP, as well as potential cons. The term manage services refers to the practice of outsourcing business administration or management responsibilities to the third party. Why would you ever want to outsource these pieces? Listen in to learn more and see if hiring an MSP would be a good decision for your company. The show notes for today's episode can be found at Episode 269: Why Do I Need a Managed Service Provider? Have fun on the SQL Trail!
-
Episode 268: AI and the Future of the Database
28/09/2023 Duración: 53minWe love hearing from our listeners!!! In this episode, a long-time listener asked about the future of AI in the data platform space. We thought this was a very interesting topic as Microsoft has been including Artificial Intelligence or AI in more and more of its marketing material. In this episode we'll dive into the definition of AI, what features are currently available, how we can leverage those technologies, and where we think this might go in the future. One of the challenges we currently face is all the buzz and excitement around AI. From a data platform vantage point, we started with analytics and training models to analyze the data. Microsoft has suddenly slapped Artificial Intelligence on some of the feature sets and confuses the issue a bit. We are excited to have Mike Chrestensen from Duke Health as our episode guest to help us sort it all out. Mike has begun leveraging AI in his work and I think he gives some interesting thoughts on how he has used it to help his team go faster. We hope you enjoy
-
Episode 267: Microsoft Fabric
12/09/2023 Duración: 43minAll your data, all your teams—in one place. What am I? If you said Microsoft Fabric, you win! When I interned with Cisco Systems in 2000, I supported a platform called Unified Messaging. At that time, we were talking about getting your email, voice mail, and faxes all in one place. My, how the times have changed. To a certain extent, the Microsoft Fabric is an extension, or wrapper, of some of the tools we have talked about in other episodes. The central idea is the ability to store your information in a data lake, and then having multiple tools at your disposal to use that data as required by the business. Power BI is the cherry on top - providing the visualizations and access to the source data that the business users like to get their hands on. In this episode we talk through the architecture and then discuss when organizations might want to adopt Microsoft Fabric. Would you like to hear more about this in a future episode? Let us know and we’ll look to circle back with long time friend of the podcast Jona
-
Episode 266: Working with Containers
29/08/2023 Duración: 36minWe're kicking off Season 7 with containers! Spinning up a VM may not be such a big deal anymore; however, most of us still have to request from another group one and wait. Even waiting on an Azure VM can be somewhat painful. Wouldn't it be nice to forget about setting up another development environment just to test something that isn't going to stick around? Our guest today is Chuck Bryan, and he talks to us about how he is using containers to support his environments and the flexibility it provides to him in his development. While the Linux containers used to get lots of love, there haven't been too many feature updates lately as much of the focus is on azure services. What is cool to me is there are tools out there that can help us folks running windows get up and running without having to wait on our infrastructure to upgrade to Windows server 2016--or have Azure spend. Chuck gives us some insights on how he got started with containers. We discuss what environments might benefit from them--and which ones w