Visualizing family networks

Francisco Restivo
3 min readJun 16, 2023

--

A family is a very complex network of people. When two persons decide to live together, they start a family. When a child is born, she belongs to a family.

A family tree may be immense, unexpected, or surprising. If one family member meets and decides to live together with someone on another continent, the two families join in a single family.

There are many problems here: how to find information, organize it, and visualize it in a useful way.

Churches used to keep birth, marriage and death records for everyone after the Council of Trent, held between 1545 and 1563. Before that, only important families used to keep records of their members.

In each country, many manuscripts are available through dedicated archiving organizations, and one organization in particular, FamilySearch, provides links to most of those sites, all over the world.

Manuscripts may be difficult to read, and some attempts at automatic reading have to be considered with care.

So, there are people, alive or dead, and there are records, and the questions are to relate people in genealogical trees and to attach records to people.

People usually start by organizing their family tree, using some easy-to-use tool, paper, Excel sheets, or other, and data available in the family.

The next step uses to be to find a computer program, either to run locally or on the Web.

I prefer Web platforms, like Ancestry, MyHeritage, Geneanet, or others, and I use all of them since data can be downloaded/uploaded in the GEDCOM format, which was developed by the Church of Jesus Christ of Latter-day Saints as an aid to genealogical research.

At the moment, I have more than 2000 people in my tree, and to see the full picture becomes more and more difficult.

Two of my great-grandparents, and two generations above and below (print screen from Ancestry)

This representation is very poor and doesn’t provide a view of contemporaneity, or the big picture.

This was the motivation to look to the GEDCOM format.

This GEDCOM file has 47896 lines

Without entering into all the details, there are two basic concepts, the individual and the family. The individual can be attached to one or two families, FAMC from Child, and FAMS from Spouse.

In the picture above, the first individual, me, @12147680407@ is attached to the family created by parents FAMC @F15@ and created a family FAMS @F9@.

The next individual, my father, created the family @F15@ I am attached, and is attached to the family created by his parents, and so on.

So, the first challenge was to extract this bipartite network from a GEDCOM file.

I developed a very short program that does just that, and also the surname associated with each individual.

A very simple Python program

I used Gephi for a first look at the data I extracted.

A family is a reunion of families

It was really interesting to have this idea materialised, and this will be the starting point for a lot of work.

Please comment.

--

--

Francisco Restivo
Francisco Restivo

Written by Francisco Restivo

Husband, father, engineer, educator, dreamer.

Responses (1)