©2020 by Arturo Devesa.

MountTamalpaisCalifornia.jpg

ARTURO DEVESA

AI Lead | Machine Learning Engineer | Tech Manager | Innovator

I'm a technology enthusiast with over 15 years of experience working in academia, startups, and corporations. Learn more on my blog

 
IMG_4763.PNG

ABOUT ARTURO

I'm passionate about entrepreneurship, technology, artificial intelligence, programming, startups, teaching, research, business and innovation. I have been keynote speaker in multiple events worldwide about AI and startups. I've been involved with Florida Atlantic University and Stanford University. I've also worked with Microsoft and Orange Telecom in various forms. I have a lot of of experience but I feel I have little because the more I learn, the more I realize what I don't know and the more I want to keep learning. I love learning new challenging things. My new addition to that continuing learning is Quantum Physics and Quantum Computing for AI.

 
 
Search

Big Data Formats

Updated: Feb 21


Smart companies don't let novices play with their most critical resource, their data.


Different file formats and how to read them in Python?

Comma-separated values

XLSX

ZIP

Plain Text (txt)

JSON

XML

HTML

Images

Hierarchical Data Format

PDF

DOCX

MP3

MP4


What is a file format?


A file format is a standard way in which information is encoded for storage in a file. First, the file format specifies whether the file is a binary or ASCII file. Second, it shows how the information is organized. For example, comma-separated values (CSV) file format stores tabular data in plain text.


To identify a file format, you can usually look at the file extension to get an idea. For example, a file saved with name “Data” in “CSV” format will appear as “Data.csv”. By noticing “.csv” extension we can clearly identify that it is a “CSV” file and data is stored in a tabular format.