Christian Henrik ReichMicrosoft Fabric: Shortcutting to a firewall-protected Azure Data Lake, distilled.Less is moreOct 2Oct 2
Christian Henrik ReichMicrosoft Fabric: Diving into Lakehouse access from local machines and other remotes with Delta-RSSometimes local compute outperformsSep 15Sep 15
Christian Henrik ReichData Architecture: Data capture time and event time in medallion architecture.Data arrangement has the most significant impact on your solution.Aug 25Aug 25
Christian Henrik ReichSpark performance: Let cache() or persist() handle your temporary data when possibleLet Spark do the workJul 19Jul 19
Christian Henrik ReichMicrosoft Fabric: Utilize Shared SparkSessions fully with mssparkutils.notebook.run and runMultipleThere is isolated and then there is a little less isolated.Jun 6Jun 6
Christian Henrik ReichSpark SQL: Why the choice of language doesn’t impact performanceExploring the language-agnostic power of Apache Spark SQLMay 24May 24
Christian Henrik ReichMicrosoft Fabric and Databricks: The low-level challenge of enforcing primary keys and foreign…Key enforment requires performanceMay 18May 18
Christian Henrik ReichDelta and Parquet: Integer, GUID/UUID or SHA256 as ID?A study of keysMay 6May 6
Christian Henrik ReichMicrosoft Fabric: Building Pseudo Identity Columns Without monotonically_increasing_id() in SparkUsing non integer keys instead of monotonically_increasing_id() is not the solution.May 1May 1
Christian Henrik ReichMicrosoft Fabric: Accessing CDM data (from Dataverse) from SparkMotivationFeb 71Feb 71