Pages

Thursday, March 25, 2010

Presenting Virtually to Columbus PASS Chapter in May

Jeremiah Peschka (blog|twitter) sent out a call for speakers for the Columbus PASS chapter over the twitterverse last month, and invited virtual presenters to apply.  After a short back and forth about topics, we settled on talking about using SSIS to do ETL for data warehouses.  Yes, I've done that one before - but I think it's a great idea to do a presentation more than once.  As Brent Ozar (blog|twitter) just happened to say, there's more "junior" professionals out there than "senior" - so tailor your presentations for your audience!  As much as I'd love to do presentations about coding complex Integration Services data flow components (and I would), I can probably count a worldwide audience for that without running out of fingers - and that includes devs I now know in the SSIS team in Redmond.
The Abstract
Microsoft designed Integration Services with Data Warehouse ETL processes in mind, so loading one should be pretty easy, right?  It's not that bad, but there are a few things you should avoid, and some non-obvious best practices to use to make that ETL faster and more reliable. Assuming attendees have a basic level of familiarity with a Kimball method data warehousing methodology, this talk will explain how to accomplish Kimball style ETL using SQL Server Integration Services. We'll review the SCD Wizard and how it works for processing your dimension tables - what it's good for, what it's not, and what alternatives you have to it. We'll also examine fact table loading using a surrogate key replacement pipeline, especially using the Lookup component efficiently. Finally, we’ll look at specific and easy performance tweaks, techniques and best practices that apply directly to DW loads. That includes reviewing which Data Flow components to avoid and how to still get your work done without them, and tuning inserts and updates.
The Goals
For the members of CBusPASS, your goals should be:
  • To walk away with firm knowledge that SQL Server Integration Services really can load a data warehouse, and fast.
  • To pick up at least one useful tip to improve your warehousing ETL processes.
  • To take advantage of the fact I can't crack a whip and demand your attention like I might be able to if I was physically present - ask demanding questions and don't be intimidated!
And there's always goals for myself:
  • Improve the content and flow of the presentation, since this is the second time around.
  • Learn what it's like to present virtually.  I'd like to know if I can pull it off, what works and what doesn't - it'll come in handy for my day job.
  • Do my best to impress Jeremiah and hope he passes a comment or two the way of the PASS Community Summit program committee.  (Wink, wink!)
The Time and Place
Synchronize your watches for 6:45pm ET on May 13th - details for connecting via LiveMeeting will be up on the Columbus SQL Server User Group website.  If you aren't in Columbus, attend anyway!  I know they won't mind at all.
UPDATE 2010-05-14: I've posted the slides and demo package on SkyDrive.

No comments:

Post a Comment