How to upload private data to Knoema and mash up with data from public sources Vladimir Bougay
May 25, 2015
How to upload private data to Knoema and mash up with data from
public sources
Vladimir Bougay
Table of Contents
1. Upload 1-2-32. Working with cross table/pivot table, time series
data– Advanced features: units, multiple frequencies
3. Mashing private and public data4. Working with list/record-based data5. Maintaining your data6. Corporate offering
Upload your data 1-2-3
1. Create an account if you do not have it yet
2. Click “Upload data” in the menu3. Upload your file and provide dataset
name4. Wait for upload to complete (usually 1-
2 mins)
Supported formats
Example of cross table/pivot table
Country Indicator 2009 2010 2011 2012
United States Total Revenue 6,237.50 7,219.80 7,875.00 7,055.20
United States Gross Profit 1,259.10 1,495.50 1,681.20 1,305.50
United States Net Income 315.4 380.2 411.1 -20.2
Canada Total Revenue 8,065.40 8,622.30 10,236.40 11,572.00
Canada Gross Profit 4,636.40 5,073.70 6,027.50 6,923.00
Canada Net Income 1,044.70 1,723.80 1,381.90 1,618.00
Mexico Total Revenue 1,938.60 1,943.00 2,050.50 2,195.50
Mexico Gross Profit 998 964.6 1,073.30 1,139.60
Mexico Net Income 172.5 148 28.6 11.4
Another cross table with units/multifreq. data
Country Indicator Unit 2012Q1 2012Q2 2012Q3 2012Q4 2012
United States Total Revenue USD 6,237.50 7,219.80 7,875.00 7,055.20 28,387.50
United States Gross Profit USD 1,259.10 1,495.50 1,681.20 1,305.50 5,741.30
United States Net Income USD 315.4 380.2 411.1 -20.2 1,086.50
United States Employee Count Persons 23,400.00 23,700.00 24,100.00 24,000.00 24,000.00
Canada Total Revenue USD 8,065.40 8,622.30 10,236.40 11,572.00 38,496.10
Canada Gross Profit USD 4,636.40 5,073.70 6,027.50 6,923.00 22,660.60
Canada Net Income USD 1,044.70 1,723.80 1,381.90 1,618.00 5,768.40
Canada Employee Count Persons 37,200.00 37,500.00 38,400.00 39,000.00 39,000.00
Mexico Total Revenue USD 1,938.60 1,943.00 2,050.50 2,195.50 8,127.60
Mexico Gross Profit USD 998 964.6 1,073.30 1,139.60 4,175.50
Mexico Net Income USD 172.5 148 28.6 11.4 360.50
Mexico Employee Count Persons 6,300.00 6,500.00 6,200.00 6,600.00 6,600.00
Supported date formats/frequencies
Supported data frequencies: annual, semiannual, quarterly, monthly, daily
Statistical date format 2009, 2010, 2011 – years 2009H1, 2013H2 – half-years 2009Q1, 2010Q3, 2012Q4 – quarters 2009M2, 2011M7 – months MM/DD/YYYY, DD.MM.YYYY - days
For Excel spreadsheets you can have any date format when cell format is «Date»
How to mash data?
1. Build a table/chart using data from one dataset first
2. Edit visualization -> Dataset Selection -> Browse for another dataset
3. Make selection in both datasets and get cross-dataset table/chart
How to mash data?
Private data mashed with public one
List/record-based data
Car Maker Model State Color List Price Sale Price Discount Date Sold
BMW BMW X6 Virginia Black 65400 63274.5 3% 5/25/2013
BMW BMW X1 Alabama Green 76400 73229.4 4% 1/15/2013
BMW BMW X3 Alaska Gold 24600 23124 6% 4/23/2013
BMW BMW M3 California Black 78900 73179.75 7% 2/22/2013
BMW BMW M5 California White 23700 22076.55 7% 5/25/2013
BMW BMW Z4 California Gray 56480 55903.904 1% 12/13/2013
BMW BMW M6 Florida Blue 51984 49374.403 5% 12/13/2013
BMW BMW X6 M Florida Silver 66984 64880.702 3% 5/25/2013
BMW BMW X6 M Idaho Yellow 74360 72857.928 2% 12/13/2013
Mercedes C-Class Maryland Silver 57000 54845.4 4% 6/24/2013
Mercedes S-Class Illinois Yellow 21000 18396 12% 1/15/2013
Mercedes E-Class Illinois White 96110 95869.725 0% 2/22/2013
Mercedes GLK-Class Georgia White 98550 96135.525 2% 5/25/2013
Definitions
Every row is a record, every column is a field. Types of fieldsDimension– Used to categorize your data, becomes
dimension– There is a limit for # of distinct values for
dimension fieldsMeasure– Number/currency/value data– All measure fields collapse into Measure
dimensionDate– Used to build time series from your data
Detail– Any additional information attached to every
record
Car sales visualization
Maintaining your data
Q: Where to look for your data?A: Profile [Your Name] -> Datasets -> My datasets
Q: Can I append/update data in a dataset?A: Yes. Open dataset -> Upload data. Update file structure should be compatible with dataset structure
Q: How do I delete a dataset?A: Open dataset -> Edit metadata -> Scroll to the bottom -> Delete -> Confirm
Corporate offering
Knoema has special offering for corporate clients which includes advanced data
toolsetUnlimited data uploads
Hierarchical/ordered dataAdvanced metadata Command line tools
Contact us or learn more athttp://knoema.com/products