Scientific Data formats

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(General)
(Social Sciences)
(7 intermediate revisions by one user not shown)
Line 51: Line 51:
 
* [[ACE (Sequence assembly)|ACE]] (Sequence assembly format)
 
* [[ACE (Sequence assembly)|ACE]] (Sequence assembly format)
 
* [[Affymetrix Raw Intensity Format]]
 
* [[Affymetrix Raw Intensity Format]]
 +
* [[AnnData Object]] (.h5ad)
 
* [[ARF (Axon Raw Format)]]
 
* [[ARF (Axon Raw Format)]]
 
* [[ARLEQUIN Project Format]]
 
* [[ARLEQUIN Project Format]]
Line 83: Line 84:
 
* [[ENCODE]] (Peak information Format)
 
* [[ENCODE]] (Peak information Format)
 
* [[FASTA and FASTQ]] (File format for sequence data, FASTQ with quality)
 
* [[FASTA and FASTQ]] (File format for sequence data, FASTQ with quality)
 +
* [[FAST5]] (.fast5)
 
* [[FuGEFlow]]
 
* [[FuGEFlow]]
 
* [[FuGE-ML]] (Functional Genomics Experiment Markup Language)
 
* [[FuGE-ML]] (Functional Genomics Experiment Markup Language)
Line 218: Line 220:
 
* [[GeoTIFF]] (Geospatial extensions to TIFF)
 
* [[GeoTIFF]] (Geospatial extensions to TIFF)
 
* [[GML]] (Geography Markup Language)
 
* [[GML]] (Geography Markup Language)
* [[HDFEOS, HD2, HD4]] (Hierarchical Data Format-Earth Observing System)
+
* [[HDF-EOS]] (Hierarchical Data Format-Earth Observing System)[https://hdfeos.org/ 1] (HD2, HD4, HD5)
 
* [[KML]] (KML (formerly Keyhole Markup Language), Version 2.2)
 
* [[KML]] (KML (formerly Keyhole Markup Language), Version 2.2)
 
* [[NDF]] (National Landsat Archive Production System (NLAPS) Data Format)
 
* [[NDF]] (National Landsat Archive Production System (NLAPS) Data Format)
Line 226: Line 228:
 
* [[MrSID]] (MrSID- Multi-resolution Seamless Image Database)
 
* [[MrSID]] (MrSID- Multi-resolution Seamless Image Database)
 
* [[TAB]] (MapInfo dataset format, must have component)
 
* [[TAB]] (MapInfo dataset format, must have component)
 +
* [[Bathymetric Attributed Grid]] (.bag)
  
 
== Mathematical ==
 
== Mathematical ==
Line 233: Line 236:
 
* [[graph6, sparse6]] (ASCII encoding of Adjacency matrices (.g6, .s6))
 
* [[graph6, sparse6]] (ASCII encoding of Adjacency matrices (.g6, .s6))
 
* [[graphML]] (Graph Markup Language)
 
* [[graphML]] (Graph Markup Language)
 +
* GraphPad Prism
 +
** [[PZM]]
 +
** [[PZF]]
 +
** [[PZFX]]
 +
** [[PRISM]]
 
* [[JMP]] (.jmp)
 
* [[JMP]] (.jmp)
 
* [[KaleidaGraph]] (.qda, .qdc)
 
* [[KaleidaGraph]] (.qda, .qdc)
Line 339: Line 347:
 
* [[SPS]] ("Syntax file" (plain text command script) for the [[SPSS]] Statistical package)
 
* [[SPS]] ("Syntax file" (plain text command script) for the [[SPSS]] Statistical package)
 
* [[SPV]] (Output file for the [[SPSS]] Statistical package - version 17 and later)
 
* [[SPV]] (Output file for the [[SPSS]] Statistical package - version 17 and later)
 +
* [[Statistix]] (.sx)
 
* [[Transana]] ([[Computer-assisted qualitative data analysis]] package)
 
* [[Transana]] ([[Computer-assisted qualitative data analysis]] package)
  

Revision as of 01:20, 1 August 2024

File Format
Name Scientific Data formats
Ontology

Mad scientist from 1940 movie

Mad scientist from 1940 movie

See also Health and Medicine for medical/biomedical data formats, and also see Engineering.

Contents

General

  • Common Data Format (CDF)
  • EAS3 (binary file format for structured data)
  • HDF (Hierarchical Data Format, originally from NCSA, now maintained by The HDF Group)
  • IGOR (.ibw)
  • NRRD (Nearly Raw Raster Data -- a simple format for n-dimensional raster data)
  • NetCDF (Network Common Data Format)
  • ROOT (CERN data-analysis package and related formats, used in their Open Data initiative)
  • SDXF (Structured Data Exchange Format)
  • Silo (a storage format for visualization developed at Lawrence Livermore National Laboratory)
  • Simple Data format (SDF) By George H. Fisher, Space Sciences Lab, UC Berkeley (A platform-independent, precision-preserving binary data I/O format capable of handling large, multi-dimensional arrays)
  • Standard Delay Format (SDF) A standard data structure for timing data
  • XDF (Extensible Data Format) [1]
  • XSIL (Extensible Scientific Interchange Language)

Astronomical and Space

Biological

Chemical

  • CCP4 (X-ray crystallography voxels (electron density))
  • CDX (ChemDraw file format)
  • CDXML (ChemDraw file format)
  • CHM (ChemDraw file format)
  • CIF (Crystallographic Information File, standardised by IUCr)
  • CML (Chemical markup language)
  • CTab (Chemical table file .mol, .sd, .sdf)
  • HITRAN (spectroscopic data with one optical/infrared transition per line in the ASCII file (.hit))
  • JCAMP (Joint Committee on Atomic and Molecular Physical Data, .dx, .jdx)
  • MOL (MDL Molfile)
  • MOP (MOPAC format)
  • MRC (voxels in cryo-electron microscopy)
  • MST ACD/ChemSketch v1 file format
  • Protein Data Bank (PDB)
  • RPT (OpenLynx) Waters OpenLynx reports
  • RXN (Reaction file format)
  • SK2 (ACD/ChemSketch v2 file format)
  • SKC (ISIS/Draw file format)
  • SMILES (Simplified molecular input line entry specification, .smi)
  • SPC (Spectroscopic Data)
  • Structure Data File (SDF)
  • TGF (ISIS/Draw reaction file format)
  • XYZ Chem Wiki

Chemical data may be distinguished in various ways, including Chemical MIME types.

Earth Sciences

Ecological

Environmental

  • HYT (AquiferTest)

Geographic and Geospatial

See also Geospatial

  • DEM (Digital Elevation Model)
  • DOQ (Digital Orthophotos)
  • e00 (ESRI ArcInfo Interchange File)
  • FGDC (Content Standard for Digital Geospatial Metadata??)
  • GeoTIFF (Geospatial extensions to TIFF)
  • GML (Geography Markup Language)
  • HDF-EOS (Hierarchical Data Format-Earth Observing System)1 (HD2, HD4, HD5)
  • KML (KML (formerly Keyhole Markup Language), Version 2.2)
  • NDF (National Landsat Archive Production System (NLAPS) Data Format)
  • SAIF (Spatial Archive and Interchange Format, Canadian)
  • SDTS (Spatial Data Transfer Standard)
  • Shapefile (ESRI, shp/shx)
  • MrSID (MrSID- Multi-resolution Seamless Image Database)
  • TAB (MapInfo dataset format, must have component)
  • Bathymetric Attributed Grid (.bag)

Mathematical

Microscopy

Neutron and X-ray Scattering

  • canSAS (tools for small-angle scattering)
  • CIF (Crystallographic Information File, standardised by IUCr)
  • NeXus (NeXus is a common data format for neutron, x-ray, and muon science)

Oceanographic, Atmospheric and Meteorological

  • GRIB (Gridded Binary)
  • BUFR (Binary Universal Format Representation)
  • IOAPI (netCDF augmented with metadata from the I/O API)
  • Meteosat data
  • PP (UK Met Office format for weather model data)

Physics

See subcategory Physics data

Scientific Signal data

  • ACQ (AcqKnowledge File Format for Windows)
  • BioSemi (BDF) data format
  • BKR (EEG data format)
  • CFWB (Chart Data File Format)
  • EDF (European data format)
  • FEF (File Exchange Format for Vital signs)
  • General Data Format for Biosignals (GDF)
  • GMS (Gesture And Motion Signal format)
  • IROCK (intelliRock Sensor Data File Format)
  • MFER (Medical waveform Format Encoding Rules)
  • REC (ATI Vision recorder file)
  • SCP-ECG (Standard Communication Protocol for Computer assisted electrocardiography)
  • SIGIF (SIGnal Interchange Format)

Social Sciences

Spectra

Miscellaneous

  • AIML (Artificial Intelligence Markup Language)
  • EMD-DF64 (used for high frequency energy monitoring)
  • IES (IESNA LM-63 Photometric Data File)
  • Jupyter Notebook (.ipynb)

Links

Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox