Microsoft Fabric: Shortcutting to a firewall-protected Azure Data Lake, distilled.Less is moreOct 2Oct 2
Microsoft Fabric: Diving into Lakehouse access from local machines and other remotes with Delta-RSSometimes local compute outperformsSep 15Sep 15
Data Architecture: Data capture time and event time in medallion architecture.Data arrangement has the most significant impact on your solution.Aug 25Aug 25
Spark performance: Let cache() or persist() handle your temporary data when possibleLet Spark do the workJul 19Jul 19
Microsoft Fabric: Utilize Shared SparkSessions fully with mssparkutils.notebook.run and runMultipleThere is isolated and then there is a little less isolated.Jun 6Jun 6
Spark SQL: Why the choice of language doesn’t impact performanceExploring the language-agnostic power of Apache Spark SQLMay 24May 24
Microsoft Fabric and Databricks: The low-level challenge of enforcing primary keys and foreign…Key enforment requires performanceMay 18May 18
Microsoft Fabric: Building Pseudo Identity Columns Without monotonically_increasing_id() in SparkUsing non integer keys instead of monotonically_increasing_id() is not the solution.May 1May 1