We Are Here To Help

Follow

Converting Data to UTF-8 Prior to Upload

Socrataonly supports certain file types on import. All plain text files should be converted to UTF-8 format prior to upload. Converting data into UTF-8 first ensures that the characters are properly preserved when uploaded into Socrata. This can be especially important for foreign characters that contain diacritics.

The following is a guide on how to use some common software to ensure your data is encoded in UTF-8.

HOW TO CONVERT DATASETS TO UTF-8 FORMAT

The steps below will provide instruction on how to ensure your data is in UTF-8 format.

EXCEL INSTRUCTIONS

Only the Windows version of Microsoft Excel is capable of creating UTF-8 formatted data files. If you have Microsoft Office on Windows you can use the following instructions:

File > Save As

Tools > Web Options....

Encoding > Unicode (UTF-8)

If Excel is not available, the conversion can also be achieved using free text editors such as Sublime Text or Notepad++.

SUBLIME TEXT INSTRUCTIONS:

Open csv

File > Save with Encoding > UTF-8

NOTEPAD++ INSTRUCTIONS:

Open csv

Encoding -> Encode in UTF-8

 

Was this article helpful?
0 out of 0 found this helpful
Have more questions? Submit a request

Comments

Powered by Zendesk