Google Summer of Code 2020

in-toto logo


I spent the last three to four months working on the open source project in-toto as part of my Google Summer of Code stipend at the Cloud Native Computing Foundation (CNCF). Followers of my blog might have already read about in-toto. If you do not know the project, I suggest you have a look on my introduction to in-toto. The introduction article has been written as part of my Google Summer of Code stipend and gives a good overview about the project and what its objectives are.

My challenge

The main objective of my Google Summer of Code stipend has been to port in-toto run functionality from the in-toto Python reference implementation to the Go implementation. The in-toto run functionality is responsible for generating in-toto link data. In-toto links are files, that represent a step in a software supply chain. Each of these steps can be signed and later verified. Adding in-toto run functionality to the Go implementation has been tracked in the following Github issues:

The following key results were to be achieved:

  • Signing generated link data via signature algorithms as specified in the in-toto specification
  • Full support for RSA-PSS, ED25519 and ECDSA.
  • Generating link files

My merged pull request, that addresses these key results can be found here:

During my journey I did a lot more than that. Implementing the above requirements led us (my mentors and me) to a few other issues. These issues were so significant that we decided to solve these issues, while working on the actual in-toto run implementation. The following listing shall give a brief insight on what I have worked on additionally.

  • Cleanup code indentation and multi-line comments:
  • Handling unhandled errors:
  • Reviving the in-toto symlink functionality PR and finishing it (this was a dependency for our in-toto run functionality):
  • Handling excess data returned by pem.Decode:
    • Issue #14
    • PR (fixied within the main objective PR #56)
  • Keeping OS interoperability via using the decoded PEM block, instead of raw PEM bytes:

The in-toto Go implementation is in direct relationship to the in-toto Python implementation and the in-toto specification. Therefore it is not surprising, that I have also addressed a few inconsistencies in the specification and the Python implementation.

  • Fixing over-importing in the in-toto Python implementation:
  • Fixing key word inconsistencies in the in-toto specification:
  • Fixing a wrong data type for the return-value in the in-toto specification:

Additional work

Additionally, the following issues have been uncovered during the development process:

  • key format inconsistency in securesystemslib: Issue #251
  • compare the way we store symlinks in our link metadata to the reference implementation: Issue #57
  • Hardware Security Module (HSM) support: Issue #61
  • Test interoperability with the Python implementation via subprocess calls: Issue #63
  • ecdsa curve sanity checks: Issue #65
  • validate functions, specifically key validations: Issue #68
  • In-toto record functionality: Issue #69
  • Do not share state in test functions: Issue #71
  • Use Go Linter for CI: Issue #74

These issues are already fixed or are on the verge of being fixed:

  • Moving our subSetCheck function to the utils.Set interface
  • Support for Go 1.15 (moving to ecdsa.SignASN1 + fixing testMain)
  • Add Logo + fix Readme (because who does not like fancy logos?!)
  • Add Github Actions support
    • Issue #62
    • PR (not merged yet) #72
  • Implement multi hash support
  • Gitignore like exclude patterns

My personal highlights

During the Google Summer of Code my personal highlights were finding and submitting a patch for a tiny bug in Go’s crypto/rsa library and attending the Kubecon 2020.

What did I learn?

During Google Summer of Code I had a lot of fun working on the CNCF project in-toto, but did I also learn something? The answer is clearly yes. Before Google Summer of Code I have contributed already to open source projects, but these contributions were mostly small bug fixes, reporting bugs or my very system and security focused work at Arch Linux. It has been a dream since long to contribute more than just a few lines of code to a project, but in the past I had difficulties to get into such a project. The Google Summer of Code was my first successful try to deep-dive into a foreign code base and to contribute more than just a few lines of code. This experience definitely increased my skills in reading foreign code, getting faster familiar with a foreign code-base and communicating with project developers. Furthermore, in-toto challenged my security skills and lead to a much wider understanding of signing algorithms such like ED25519, RSA-PSS or ECDSA and key formats such like PKCS1, PKCS8 or PEM. This project increased my security awareness in terms of cryptography significantly.

Plans for the future

Well, I think I totally fell in love with the project. Not only did I never join such a welcoming and interesting community, I also finally found a project that I think is important, interesting and challenging at the same time. Moreover I really think, that my future career goals will come one step closer with this project. The Go implementation and the near to the CNCF will definitely help me in increasing my Site Reliability Skills.

Special thanks

I do not want to finish, before honouring my four mentors Lukas PĆ¼hringer, Justin Cappos, Santiago Torres-Arias and Trishank Karthik Kuppusamy. They always reacted quickly when needed and they always gave me the right hints, when I had difficulties understanding the specification or the code base. Especially the work with Lukas was very enjoyable and I look forward to a lot more interesting discussions in future pull requests and issues. Santiago has been always there for me, when I did a few hours too much and slipped into a different timezone. The timezone difference was definitely a bonus and not a malus this time. Trishank was a person I could always count on. He supported me in many ways and gave me hints when I have encountered a problem. Furthermore we had a few interesting discussions around the in-toto specification, the securesystemslib and their relationship to TUF. I am pretty sure, this will be not the last discussions we had about the in-toto specification.


1064 Words

2020-08-31 00:03 +0000