In our sample we have one column for each date since the beginning of this year. Because of that they contain a varying number of columns with a dimension value in the column header. Spreadsheets like these are usually automatically generated by some kind of pivoting program. Let’s assume that this spreadsheet describes the number of products sold on a given date. However, we can clearly do a lot better by extending our initiative to a few more steps: “Microsoft Excel Input” (which can also read ODS by the way), “Row Normalizer” and “Row De-normalizer”.īelow I’ll describe an actual (obfuscated) example that you will probably recognize as it is equally hideous as simple in it’s horrible complexity. Already with support for “CSV Input” and “Select Values” we could do a lot of dynamic things. Since then we received a lot of positive feedback on this functionality which encouraged me to extend it to a few more steps. Last year, right after the summer in version 4.1 of Pentaho Data Integration, we introduced the notion of dynamically inserted ETL metadata (Youtube video here).
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |