Executor/Job: Generalize the use of artifacts

added Feature request label

Yeah, I think isolating jobs is best: anything in storage is only visible to that job, but two jobs fetching the same url will maybe hit a cache and not perform a new query on that remote server, no guarantees there (aka as a first implementation we don't need to have caching figured out, but users can't start relying on that as we'll have a cache eventually, but also that cache might be full and the file evicted and we still need to fetch from the remote again).

For the question of overriding existing paths, I think we should have something per-path like exists: overwrite/exists: append/exists: error (where error is the default), unless there are paths that we can't check in time to raise an error before the job actually starts?

I like destroy_on: first_read, which reminds me that I forgot to mention that we could have cache hints as well, to define whether to drop the file at the end of the job, or to keep it around for a future job (obviously subject to actually having enough space, with no guarantee); this feels especially important for the nbd case, probably not so much for the tiny files that ftfp/http will have.

As for controlling the port, tls certificates, folders, I think we can ignore them for now and re-visit them when a usecase appears. For ports we can always have http:8080 as a key later, it's easy to extend; for certificates it can be a new key inside http, and for folders it can be keyed on the presence of a / at the end of a path.

I'm not sure I understand the "how do we reconstruct the final URL" question, can you explain a bit more what you mean?

changed the description

I'm not sure I understand the "how do we reconstruct the final URL" question, can you explain a bit more what you mean?

I mean, a user asked for an artifact to be at a specific path... but on what server? On what port? Any prefix necessary?

We could use "prefix" variables such as storage-http-prefix that could be used in the job description (as a template variable) that would contain http://ci-gateway:blabla/job, but then it would mean that things the user isn't in control of (such as the the raspberry pi firmware looking for config.txt in the root of the tftp server) may just stop working if we make any change to this prefix.

Of course, we'll want to log in the job's output any http/tftp activity coming from the DUT... which would help debugging ... but that may not be sufficient.

Maybe the solution really is that any HTTP/TFTP request coming from a DUT should be checked with the job process first (to give full control to the job), then fallback to the defaults... This would allow users to override boot scripts (ipxe, u-boot, rpi, ...). Unfortunately, that means more work for us because flask isn't really designed to do that... so I guess we would have to have a separate flask server for the private network which would just redirect requests to the job process and otherwise forward the request to the main flask server (with maybe some filtering).

What to you think?

changed the description

My 2c after a quick glance over this idea.

If we are going to describe execution environment in this way, I think all paths should be automatically prepended with some job-unique prefix. Otherwise, scheduling similar jobs might become really troublesome.
Generally, we don't need to pass different kernels. Single kernel+modules set should be able to boot on a variety of platforms. We might need to specify, which modules get packed into the initramfs, that's true.
DTBs definitely need some handling. We need to be able to support booting the kernel+modules+dtbs combo, allowing the board to specify the dtb to be picked from this combo set
Especially for the kernel / dtb / initramfs, it might be useful to be able to pass data via the artifacts. First stage builds the kernel, next stage runs this kernel on the CI.
... but CI tron doesn't seem to be targeting/supporting testing of different kernels.

Pretty sure I answered all of these on IRC, but just to be sure here is another answer!

The concern is absolutely valid and users should not have to think about the impact of other jobs on their own testing! This is why I instead decided to use the source address of the DUT as an automatic namespacing :) No need to create separate paths for every DUT if they all think they are alone in the farm
True, but not sure what to think about that. Feels to me like you are asking for allowing more than one initrd, to which I would say I completely agree! The building of the initrd should however be done outside of the job, IMO. Thoughts?
Seems like you are asking to support more than one DTB, and have CI-tron just mash them at the end of the kernel binary (or in an initrd?). We are currently doing the mashing at the end of the kernel automatically for fastboot, do you think this is needed for other situations? For u-boot, we could simply load the right DTB for the board if we know it or generate a boot script that will query it by $fdt_file.
Yes, this was already done from the beginning... but there is a plan to also being able to use user-provided files via executorctl: #146 . So seems like we have everything we need, no?
I believe this was cleared on IRC, but CI-tron allows every job to select the kernel/initrd to use. It is however true that adding a new DUT to a farm currently requires this DUT to be able to boot with our default kernels... at least until #182 (closed) is addressed... which is exactly what I am finalizing now \o/