For those of us who are fortunate enough to work in an environment where tools of all sorts are readily available it is easy to lose sight as to how much of a struggle certain aspects of our job would be without them.
As such I am taking a different approach to the current warehousing project I am working on. My goal is to use all open source (or MS Office for Mac) tools to design the Data Warehouse I am working on for Laundry Time.
The current scope of work is to:
- Work with client to understand warehousing needs
- Determine the required subject areas, we have already discussed:
- Party
- Address
- Inventory
- Equipment
- Payroll
- Invoices
- Source information as recurring feeds from external applications
- Develop Reference Data in a manner that can be ingested
- Build Warehouse
- Build ETL
- Validate Warehouse
- Phase 2: Deploy Automated Data Governance Tools
With this scope of work and the guiding principal of not spending any money on tools this project could get quite complex very quickly. However the first tool we need to make sure we are all on the same page is a good data modeling tool. As Oracle Data Modeler is free and has a connector for Redshift (Thats right were doing it in Redshift and going all out) this seemed to be a natural fit.
Next Steps:
- Complete Goal Mapping – Excel
- Complete Conceptual Data Model – Oracle Data Modeler
I’ll be posting updates as the work progresses.