Socrataonly supports certain file types on import. All plain text files should be converted to UTF-8 format prior to upload. Converting data into UTF-8 first ensures that the characters are properly preserved when uploaded into Socrata. This can be especially important for foreign characters that contain diacritics.
The following is a guide on how to use some common software to ensure your data is encoded in UTF-8.
HOW TO CONVERT DATASETS TO UTF-8 FORMAT
The steps below will provide instruction on how to ensure your data is in UTF-8 format.
Only the Windows version of Microsoft Excel is capable of creating UTF-8 formatted data files. If you have Microsoft Office on Windows you can use the following instructions:
File > Save As
Tools > Web Options....
Encoding > Unicode (UTF-8)
If Excel is not available, the conversion can also be achieved using free text editors such as Sublime Text or Notepad++.
SUBLIME TEXT INSTRUCTIONS:
File > Save with Encoding > UTF-8
Encoding -> Encode in UTF-8
- For additional format support, read this article: Importing, Data Types, and You!