Once a dataset has been loaded into Data & Insights, it is available for download in a number of different formats:
- CSV (Comma Separated Values)
- CSV (Europe)
- XML
- XLSX
If the dataset contains Geospatial information (Point, Line, Polygon, etc.), the dataset will have additional output formats*:
- KML
- KMZ
- GeoJSON
- Shapefile
*Note that exporting geospatial files containing multipolygons using the CSV button will export the data in Well Known Text format. Opening these text files in Excel may result in the text wrapping to the next line if the multipolygon string exceeds the cell character limit. If this happens, open the file with a powerful text editor.
*Exporting geospatial file types will have a max size limitation of 25 gigabytes. For exporting map data that is larger than the limit please export in a flat file format like CSV.
Downloading a dataset can be done from the primer page. Select the Export button in the upper right corner, then choose a format from the Export dataset window.
Ordering and Filtering Downloads
We cannot affect the order of rows when downloading data from an asset by clicking the Export button on the primer page. That said, data downloads from the Data & Insights Open Data Consumer API can be filtered, ordered, and queried.
Exporting Large Datasets
When a user attempts to export a large dataset with over 1M rows, they will get a warning asking them if they want to continue because the file is unlikely to open in many common desktop programs
Comments
Regarding the RDF export, I see a default export exists already, now I've made a quick test table and a couple of test rows.
However I have seen examples - eg opendata.socrata.com/dataset/My-Friends-RDF-Test-/w7f6-86zj - where the RDF is nicely tuned to external / well known RDF schemas. I don't see where this is configured yet, ... can it be done through the Web UI?
ps. you export "Web links" using a property foaf:document that doesn't exist in FOAF; that could do with tweaking somehow
Dan -
The default RDF output isn't super useful since datasets we need more semantics in order to describe the data. You'll want to configure the column to include the semantic type. If you bring up the Column Properties sidebar by clicking on the orange drop-down button in the column header, you can select the predicate you want to use from the "Advanced" section. That'll then be used to describe the data in the RDF output.
Note that you need to be the owner of the dataset in order to modify the column properties, so you should try this on a dataset you've created yourself.
Feel free to let me know if you have more questions.
Thanks,
Chris Metcalf
Director of Product Development
and Developer Evangelism
chris.metcalf (at) socrata.com
Hi Chris!
My question is somewhat related to Dan's; looking for...
Is there a tutorial, and/or can I access a sandbox (this is suggested by your answer to Dan).
Thanks!
John Erickson (TWC RPI)
Article is closed for comments.