Source Data

International trade data is taken from Feenstra, Lipsey, Deng, Ma, & Mo's "World Trade Flows: 1962-2000" dataset. This dataset consists of imports and exports both by country of origin and by destination, with products disaggregated to the SITC revision 4, four-digit level. The authors build this dataset using the United Nations COMTRADE database. The authors cleaned that dataset by calculating exports using the records of the importing country, when available, assuming that data on imports is more accurate than data from exporters. This is likely, as imports are more tightly controlled in order to enforce safety standards and collect customs fees. In addition, the authors correct the UN data for flows to and from the United States, Hong Kong, and China. We focus only on export data, and do not disaggregate by country of destination. More information on this dataset can be found in NBER Working Paper #11040, and the dataset itself is available at www.nber.org/data. and http://cid.econ.ucdavis.edu/data/undata/undata.html

Processed Data "Data Format":

Proximity Matrix : "ProductCode1 ProductCode2 Proximity"

Note: This proximity matrix was created to represent the product space between 1998 and 2000. The way in which it was built was by calculating the proximities for each year (1998,1999,2000) and then averaging them. When using this data please always remember that the Product Space is, before anything else, a perspective that can be used to study the evolution of comparative advantage, countries productive structures, etc., rather than a strictly defined mathematical concept. Feel free to use the data below to construct product spaces by defining the proximity measures you think is more adequate (correlations, mutual information, jaccard distance, etc).

Best Wishes

C

RCA matrix : "ProductCode CountryCode Year RCA CountryName"

Leamer Annotations: "ProductCode Cluster Name"

Product Characteristics: "ProductCode PRODY effectivenumberofexporters WorldTrade"

ProductNames: "ProductCode ProductName"

NEW: Exports by Country and Year "CountryName ProductCode Year ExportedValue[Thousands of Dollars]" (Fixed after it was noticed that it was incomplete (Thanks wu zhili))

NEW: Product Space Explorer

Download Software .zip

Installation:

Unzip all file into the same directory.

Usage:

1.- Open a DOS command prompt (you can find it in Start/Program/Accesories)

2.- Find your way to the directory where you unzipped all the files.

3.- You can generate a product space map for a country for all years in which we have data available (1975-1980-1985/2000) by executing:

ProductSpaceParser.exe CountryISOcode RCAthreshold

CountryISOcode is for example: ESP for Spain and DEU for Germany. The RCAthreshold is a number definning the threshold from which a country is considered an exporter of a product.

Example: To generate product space maps for Colombia where all products with RCA>0.5 will be painted as black squares type

ProductSpaceParser.exe COL 0.5

4.- The output will be files generated in the same directory with .gml extension. To open the files you need to download cytoscape (just google cytoscape and it is the first thing you get), we recommend downloading version 2.6 or 2.4 but not 2.5 as the gml files would not work with this one.

To visualize the product space go to file/import/network(multiple file types) and point the dialogue box to one of the .gml files generated by the program.

5.- (Optional) you can generate a file with your own RCAs and feed it to the program as an extra input. For example, you are interested in visualizing the patterns of comparative advantage of California, you can do these by generating a file with patterns of comparative advantage for California with the format.

"ProductSITCCode" "CAL" year RCA

where you need to put the quotes and the fields are tab delimited. You then feed it to the program as

ProductSpaceExplorer.exe CAL 1 Cal.txt

where Cal.txt is a file prepared as:

Example of a Cal.txt file

"1110" "CAL" 1989 0.5
"1120" "CAL" 1989 1
.....