Files: YAML - Working With
Introduction
Working with YAML files in Clojure allows storing and exchanging data in a human-readable and easily editable format. YAML stands for “YAML Ain’t Markup Language” and is a popular data format for representing structured data. It provides a way to represent complex data structures using maps, lists, and scalars. You can easily save and load data with YAML files, making it convenient for configuration files, serialization, and interchanging data between different systems.
To work with YAML files in Clojure, you can use the https://github.com/clj-commons/clj-yaml library. This library provides functions like slurp
to read YAML data from a file and yaml/parse-string
to parse YAML data from a string. Similarly, you can use spit
and yaml/generate-string
to write YAML data to a file. These functions make reading and writing data in YAML format straightforward, allowing you to work efficiently with structured data in your Clojure programs.
Working with YAML Files
Code
example_read.yaml
name: John Doe
age: 30
email: john.doe@example.com
address:
street: 123 Main St
city: New York
state: NY
zip: '10001'
interests:
- programming
- reading
- hiking
files_yaml/core.clj
(ns files-yaml.core
(:require [clj-yaml.core :as yaml]))
;; Reading YAML Files
(def yaml-file (slurp "data_set/example_read.yaml"))
yaml-file
;; => "name: John Doe\nage: 30\nemail: john.doe@example.com\naddress:\n street: 123 Main St\n city: New York\n state: NY\n zip: \"10001\"\ninterests:\n - programming\n - reading\n - hiking\n"
(def kw-yaml-data (yaml/parse-string yaml-file))
kw-yaml-data
;; => {:name "John Doe",
;; :age 30,
;; :email "john.doe@example.com",
;; :address {:street "123 Main St", :city "New York", :state "NY", :zip "10001"},
;; :interests ("programming" "reading" "hiking")}
(class kw-yaml-data)
;; => flatland.ordered.map.OrderedMap
(type kw-yaml-data)
;; => flatland.ordered.map.OrderedMap
(select-keys kw-yaml-data [:name :age :email :interests])
;; => {:name "John Doe", :age 30, :email "john.doe@example.com", :interests ("programming" "reading" "hiking")}
(get-in kw-yaml-data [:address :street])
;; => "123 Main St"
(def non-kw-yaml-data (yaml/parse-string yaml-file :keywords false))
non-kw-yaml-data
;; => {"name" "John Doe",
;; "age" 30,
;; "email" "john.doe@example.com",
;; "address" {"street" "123 Main St", "city" "New York", "state" "NY", "zip" "10001"},
;; "interests" ("programming" "reading" "hiking")}
(class non-kw-yaml-data)
;; => clojure.lang.PersistentArrayMap
(type non-kw-yaml-data)
;; => clojure.lang.PersistentArrayMap
(select-keys non-kw-yaml-data ["name" "age" "email" "interests"])
;; => {"name" "John Doe", "age" 30, "email" "john.doe@example.com", "interests" ("programming" "reading" "hiking")}
(get-in non-kw-yaml-data ["address" "street"])
;; => "123 Main St"
;; Writing YAML Files
(spit "data_set/example_write_kw.yaml" (yaml/generate-string kw-yaml-data))
(slurp "data_set/example_write_kw.yaml")
;; => "name: John Doe\nage: 30\nemail: john.doe@example.com\naddress: {street: 123 Main St, city: New York, state: NY, zip: '10001'}\ninterests: [programming, reading, hiking]\n"
;; Note: the list of interests is rendered in different style than the original file, but the data is the same
;; the written file's parsed yaml data is the same as the original kw file's parsed yaml data
(= (yaml/parse-string (slurp "data_set/example_write_kw.yaml")) kw-yaml-data)
;; => true
(spit "data_set/example_write_non_kw.yaml" (yaml/generate-string non-kw-yaml-data))
(slurp "data_set/example_write_non_kw.yaml")
;; => "name: John Doe\nage: 30\nemail: john.doe@example.com\naddress: {street: 123 Main St, city: New York, state: NY, zip: '10001'}\ninterests: [programming, reading, hiking]\n"
;; Note: the list of interests is rendered in different style than the original file, but the data is the same
;; the written file's parsed yaml data is the same as the original non-kw file's parsed yaml data
(= (yaml/parse-string (slurp "data_set/example_write_kw.yaml") :keywords false) non-kw-yaml-data)
;; => true
;; the written files are the same for non-keyword and keyword yaml data
(= (slurp "data_set/example_write_non_kw.yaml")
(slurp "data_set/example_write_kw.yaml"))
;; => true
;; pretty printing when writing yaml files
(spit "data_set/example_write_pretty.yaml" (yaml/generate-string kw-yaml-data :dumper-options {:flow-style :block
:indicator-indent 2
:indent 6}))
example_write_kw.yaml
name: John Doe
age: 30
email: john.doe@example.com
address: { street: 123 Main St, city: New York, state: NY, zip: '10001' }
interests: [programming, reading, hiking]
example_write_non_kw.yaml
name: John Doe
age: 30
email: john.doe@example.com
address: { street: 123 Main St, city: New York, state: NY, zip: '10001' }
interests: [programming, reading, hiking]
example_write_pretty.yaml
name: John Doe
age: 30
email: john.doe@example.com
address:
street: 123 Main St
city: New York
state: NY
zip: '10001'
interests:
- programming
- reading
- hiking
Explanation
This Clojure code is about reading and writing YAML files. Here’s a more detailed breakdown of what each part does:
- Namespace and Library Requirement: The file defines the namespace
files-yaml.core
. This is a common practice in Clojure to organize code into logical groups. It also requires theclj-yaml.core
library asyaml
. This library, clj-yaml, is a Clojure library that provides functions for parsing and generating YAML, a human-readable data serialization format. - Reading YAML Files: The
slurp
function is used to read the contents of theexample_read.yaml
file into theyaml-file
variable.slurp
is a built-in Clojure function that reads a file into a string. - Parsing YAML Data: The
parse-string
function from theclj-yaml
library is used to parse the YAML data from theyaml-file
variable into a Clojure data structure. This is done twice, once with keywords (kw-yaml-data
) and once without (non-kw-yaml-data
). Keywords in Clojure are symbols used as identifiers, similar to strings, but with some performance benefits. - Inspecting the Data: The
class
andtype
functions are used to inspect the type of the parsed data. These functions help debug and understand the structure of the data. Theselect-keys
function selects specific keys from the data, and theget-in
function is used to access nested data. These functions are part of Clojure’s rich set of data manipulation functions. - Writing YAML Files: The
spit
function writes the parsed data into a new YAML file.spit
is a built-in Clojure function that writes a string to a file. This is done twice, once with the keyword data (example_write_kw.yaml
) and once with the non-keyword data (example_write_non_kw.yaml
). - Verifying the Written Data: The written files are read and parsed to verify that the written data matches the original data. This is done using the
slurp
,parse-string
functions, and the equality (=
) operator. This step is important to ensure the write operation succeeded and didn’t alter the data. - Pretty Printing: The
generate-string
function from theclj-yaml
library is used with the:dumper-options
option to print the YAML data pretty when writing it to a file (example_write_pretty.yaml
). The options specify the style of the output and the indentation levels. Pretty printing makes the output more human-readable, especially for complex data structures.