runbot

mirror of https://github.com/odoo/runbot.git synced 2025-03-15 23:45:44 +07:00

Author	SHA1	Message	Date
Xavier Morel	cf4d162907	[ADD] *: PR backport wizard This is not a full user-driven backport thingie for now, just one admins can use to facilitate thing and debug issues with the system. May eventually graduate to a frontend feature. Fixes #925	2024-10-22 11:41:58 +02:00
Xavier Morel	ed1f084c4f	[IMP] runbot_merge: style fixes - replace manual token_urlsafe by actual token_urlsafe - make conditional right side up and more readable - replace match by fullmatch, should not change anything since we end with a greedy universal match but is slightly more explicit	2024-10-22 10:51:47 +02:00
Xavier Morel	d9e6d39448	[IMP] mergebot_test_utils: minor style fixes	2024-10-22 10:51:30 +02:00
Xavier Morel	632763d390	[CHG] runbot_merge: move labels cron to triggered Missed it during the previous pass, probably because it's in the middle of `pull_requests.py`. It's a classic template for triggered crons since the model is just a queue of actions for the cron.	2024-10-22 10:50:09 +02:00
Xavier Morel	640392dc20	[FIX] significantly speed up local testing The mergebot tests have always been pretty gentle on system load which is nice, however it's just looking at the list of longest tests that I realised / re-membered the hook wait duration is 10 seconds for the benefit of github, which doesn't really matter locally. This means on interaction / cron-heavy tests the test might only be using on the order of 10% CPU or something, that is a waste of time. TBF this is easily compensated by increasing the concurrency of the test suite (e.g. from 16 to 32 when I switched machine, but it seems as if not more sensible to lower the webhook wait delay to something more reasonable. 1s seems to be a good fit here, on my new computer at n=16 it leads to the test suite running in 15mn at 600% CPU (which is pretty good on a 6/12 CPU as it loads the system heavily but doesn't completely bog it down). Reducing it to 0.5s, the test suite takes the same duration but CPU load increases to 770%, and errors creep up, likely a mix of concurrency issues in the DB and dummy-central sending webhooks too slowly as we compete with it for CPU resources (could actually make sense to restrict the number of threads tokio can use). Reducing concurrency could make this work better, but I think at this point we're in a pretty good state, it's even somewhat reasonable to run the test suite sequentially (taking about 1h10 but being functionally invisible in terms of load).	2024-10-18 10:19:28 +02:00
Xavier Morel	5748c086e5	[FIX] runbot_merge: status of closed PRs in extant batches If a PR is closed but part of an ongoing batch, the change in status of the batch might be reflected on the PR still: - if a PR is closed and the batch gets staged, the PR shows up as being staged - if the PR is merged then the batch gets merged, the PR shows up as merged Fixes #914 Also remove the unused `_tagstate` helper property.	2024-10-16 12:17:30 +02:00
Xavier Morel	20a4e97b05	[CHG] runbot_merge: make merge method non-blocking Because of the false negatives due to github's reordering of events on retargeting, blocking merge methods can be rather frustrating or the user as what's happening and how to solve it isn't clear in that case. Keep the warnings and all, but remove the blocking factor: if a PR doesn't have a merge method and is not single-commit, just skip it on staging. This way, PRs which are actually single-commit will stage fine even if the mergebot thinks they shouldn't be. Fixes #957	2024-10-07 08:07:59 +02:00
Xavier Morel	4215be770d	[IMP] runbot_merge: move read_tracking_value to utils And use it in test_trivial_flow instead of the kinda half-assed manual version.	2024-10-07 08:06:03 +02:00
Xavier Morel	a45db1e089	[IMP] conftest: support for a more generic current_date	2024-10-07 08:04:25 +02:00
Xavier Morel	3a8b4684da	[IMP] conftest: support for mapped(fn)	2024-10-07 08:03:36 +02:00
Xavier Morel	6a1b77b92c	[ADD] runbot_merge: support for unstaged patches Unstaged changes can be useful or necessary for some tasks e.g. absolute emergency (where even faking the state of a staging is not really desirable, if that's even possible anymore), or changes which are so broad they're difficult to stage (e.g. t10s updates). Add a new object which serves as a queue for patch to direct-apply, with support for either text patches (udiff style out of git show or format-patch) or commits to cherry-pick. In the former case, the part of the show / format-patch before the diff itself is used for the commit metadata (author, committer, dates, message) whereas for the commit version the commit itself is reused as-is. Applied patches are simply disabled for traceability. Fixes #926	2024-10-03 12:06:00 +02:00
Xavier Morel	aac987f2bb	[FIX] runbot_merge: dashboard display nits - fix staging reasons containing escaped quotes (would render as ` ` to the end user) - remove extra spacing around PR title @title/popovers - simplify a few view conditionals through judicious use of `t-elif` and nesting - make `staging_end` non-computed as it's not computed anymore, just set if and when the staging gets disabled (`146564a90a`)	2024-09-27 14:13:43 +02:00
Xavier Morel	430ccab2cb	[IMP] runbot_merge: suppress view validation warning This is a dumb false positive, kill it.	2024-09-27 12:53:51 +02:00
Xavier Morel	8f27773f8d	[IMP] forwardport: surfacing of modify/delete conflicts Given branch A, and branch B forked from it. If B removes a file which a PR to A later modifies, on forward port Git generates a modify/delete conflict (as in one side modifies a file which the other deleted). So far so good, except while it does notify the caller of the issue the modified file is just dumped as-is into the working copy (or commit), which essentially resurrects it. This is an issue, especially as the file is already part of a commit (rather tan just a U local file), if the developer fixes the conflict "in place" rather than re-doing the forward-port from scratch it's easy to miss the reincarnated file (and possibly the changes that were part of it too), which at best leaves parasitic dead code in the working copy. There is also no easy way for the runbot to check it as adding unimported standalone files while rare is not unknown e.g. utility scripts (to say nothing of JS where we can't really track the usages / imports at all). To resolve this issue, during conflict generation post-process modify/delete to insert artificial conflict markers, the file should be syntactically invalid so linters / checkers should complain, and the minimal check has a step looking for conflict markers which should fire and prevent merging the PR. Fixes #896	2024-09-27 12:37:49 +02:00
Xavier Morel	ef22529620	[ADD] support for recursive tree to the test GH proxy	2024-09-27 12:36:50 +02:00
Xavier Morel	26882c42aa	[FIX] warning in test logs	2024-09-27 12:36:02 +02:00
Xavier Morel	98868b5200	[IMP] runbot_merge: add notifications on inactive branch interactions Add warnings when trying to send comments / commands to PRs targeting inactive branches. This was missing leading to confusion, as one warning is clearly not enough. Fixes #941	2024-09-24 10:22:07 +02:00
Xavier Morel	e309e1a3a2	[FIX] runbot_merge: don't over-bump timeout By updating the staging timeout every time we run `_compute_state` and still have a `pending` status, we'd actually bump the timeout on every success CI except for the last one. Which was never the intention and can add an hour or two to the mergebot-side timeout. Instead, add an `updated_at` attribute to statuses (name taken from the webhook payload even though we don't use that), read it when we see `pending` required statuses, and update the staging timeout based on that if necessary. That way as long as we don't get new pending statuses, the timeout doesn't get updated. Fixes #952	2024-09-20 12:17:17 +02:00
Xavier Morel	154e610bbc	[IMP] *: modernize `TestPRUpdate` - Switch to just `default` ci. - Autouse fixture to init the master branch. - Switch to `make_commits`. - Merge `test_reopen_update` and `test_update_closed_revalidate` into `test_update_closed`: the former did basically nothing useful and the latter could easily be folded into `test_update_closed` by just validating the new commit.	2024-09-19 12:17:59 +02:00
Xavier Morel	c8a06601a7	[FIX] *: unstage on status going from success to failure And unconditionally unstage when the HEAD of a PR is synchronised. While a rebuild on a PR which was previously staged can be a false positive (e.g. because it hit a non-derministic test the second time around) it can also be legitimate e.g. auto-rebase of an old PR to check it out. In that case the PR should be unstaged. Furthermore, as soon as the PR gets rebuilt it goes back into `approved` (because the status goes to pending and currently there's no great way to suppress that in the rebuild case without also fucking it up for the sync case). This would cause a sync on the PR to be missed (in that it would not unstage the PR), which is broken. Fix that by not checking the state of the PR beforehand, it seems to be an unnecessary remnant of older code, and not really an optimisation (or at least one likely not worth bothering with, especially as we then proceed to perform a full PR validation anyway). Fixes #950	2024-09-18 15:19:13 +02:00
Xavier Morel	a046cf2f7c	[FIX] : UX around fw=no The UX around the split of limit and forward port policy (and especially moving "don't forward port" to the policy) was not really considered and caused confusion for both me and devs: after having disabled forward porting, updating the limit would not restore it, but there would be no indication of such an issue. This caused odoo/enterprise#68916 to not be forward ported at merge (despite looking for all the world like it should be), and while updating the limit post-merge did force a forward-port that inconsistency was just as jarring (also not helped by being unable to create an fw batch via the backend UI because reasons, see `10ca096d86`). Fix this along most axis: - Notify the user and reset the fw policy if the limit is updated while `fw=no`. - Trigger a forward port if the fw policy is updated (from `no`) on a merged PR, currently only sources. - Add check & warning to limit update process so it does not* force a port (though maybe it should under the assumption that we're updating the limit anyway? We'll see). Fixes #953	2024-09-17 11:31:20 +02:00
Xavier Morel	851656bec0	[IMP] runbot_merge: set status on skipchecks & use that - rather than enumerate states, forward-porting should just check if the statuses are successful on a PR - for the same consistency reasons explained in `f97502e503`, `skipchecks` should force the status of a PR to `success`: it very odd that a PR would be ready without being validated...	2024-09-16 12:49:23 +02:00
Xavier Morel	fe7cd8e1f0	[IMP] runbot_merge: name PR correctly on staging success Logging PRs by id is unusual, unreadable, and inconvenient.	2024-09-16 12:48:42 +02:00
Xavier Morel	10ca096d86	[FIX] forwardport: cron create prototype Apparently when I added these to trigger the corresponding cron the lack of decorator caused them to not work at all, at least when invoked from the interface, consequently I notably wasn't able to force a forward port via creating one such task when trying to work around #953	2024-09-16 12:46:44 +02:00
Xavier Morel	60188063f8	[FIX] : ensure I don't get bollocked up again by tags Today (or really a month ago) I learned: when giving git a symbolic ref (e.g. a ref name), if it's ambiguous then 1. If `$GIT_DIR/$name` exists, that is what you mean (this is usually useful only for `HEAD`, `FETCH_HEAD`, `ORIG_HEAD`, `MERGE_HEAD`, `REBASE_HEAD`, `REVERT_HEAD`, `CHERRY_PICK_HEAD`, `BISECT_HEAD` and `AUTO_MERGE`) 2. otherwise, `refs/$name` if it exists 3. otherwise, `refs/tags/$name` if it exists 4. otherwise, `refs/heads/$name` if it exists 5. otherwise, `refs/remotes/$name` if it exists 6. otherwise, `refs/remotes/$name/HEAD` if it exists This means if a tag and a branch have the same name and only the name is provided (not the full ref), git will select the tag, which gets very confusing for the mergebot as it now tries to rebase onto the tag (which because that's not fun otherwise was not even on the branch of the same name). Fix by providing full refs to `rev-parse` when trying to retrieve the head of the target branches. And as a defense in depth opportunity, also exclude tags when fetching refs by spec: apparently fetching a specific commit does not trigger the retrieval of tags, but any sort of spec will see the tags come along for the ride even if the tags are not in any way under the fetched ref e.g. `refs/heads/` will very much retrieve the tags despite them being located at `refs/tags/*`. Fixes #922	2024-09-06 15:09:08 +02:00
Xavier Morel	d0723499a2	[IMP] runbot_merge: stage by first ready This is an approximation under the assumption that stored computes update the `write_date`, and that there's not much else that will be computed on a batch. Eventually it might be a good idea for this to be a proper field, computed alongside the unblocking of the batch. Fixes #932	2024-09-06 13:51:55 +02:00
Xavier Morel	146564a90a	[FIX] runbot_merge: set staging_end on all terminations Rather than only setting `staging_end` on status change, set it when the staging gets deactivated. This way cancelling a staging (whether explicitely or via a PR update) will also end it, so will a staging timeout, etc..., rather than keep the counter running. Fixes #931	2024-09-06 13:16:37 +02:00
Xavier Morel	64f9dcbc22	[FIX] : unnecessary warning on r- of forward port Reminding users that `r-` on a forward port only unreviews that* forwardport is useful, as `r+;r-` is not a no-op (all preceding siblings are still reviewed). However it is useless if all siblings are not approved or already merged. So avoid sending the warning in that case. Fixes #934	2024-09-06 13:04:13 +02:00
Xavier Morel	017b8d4bb0	[FIX] forwardport: don't post reminder on PRs waiting to be staged The FW reminder is useful to remind people of the outstanding forward ports they need to process, as taking too long can be an issue. They are, however, not useful if the developer has already done everything, the PR is ready and unblocked, but the mergebot has fallen behind and has a hard time catching up. In that case there is nothing for the developer to do, so pinging them is not productive, it's only frustrating. Fixes #930	2024-09-06 12:33:51 +02:00
Xavier Morel	1d106f552d	[FIX] runbot_merge: missing feedback on fw r+ In some cases, feedback to the PR author that an r+ is redundant went missing. This turns out to be due to the convolution of the handling of approval on forward-port, and the fact that the target PR is treated exactly like its ancestors: if the PR is already approved the approval is not even attempted (and so no feedback if it's incorrect). Straighten up this bit and add a special case for the PR being commented on, it should have the usual feedback if in error or already commented on. Furthermore, update `PullRequests._pr_acl` to kinda work out of the box for forward-port: if the current PR is a forward port, `is_reviewer` should check delegation on all ancestors, there doesn't seem to be any reason to split "source_reviewer", "parent_reviewer", and "is_reviewer". Fixes #939	2024-09-05 13:25:19 +02:00
Xavier Morel	3b8d392548	[FIX] pytest warnings Backport of `d7a78f89d0` - `choice` is not a proper type - markers should be declared	2024-09-05 13:24:10 +02:00
Xavier Morel	0334225114	[FIX] runbot_merge: read sha first in commit statuses broadcast Because the first operation the notification task performs is updating the commit, this has to be flushed in order to read-back the commit hash (probably fixed in later version). This means if updating the flag triggers a serialization failure (which happens and is normal) we transition to the `exception` handler, which tries to retrieve the sha again and we get an "in failed transaction" error on top of the current "serialization failure", leading to a giant traceback. Also an unhelpful one since a serialization failure is expected in this cron. - Move the reads with no write dependency out of the `try`, there's no reason for them to fail, and notably memoize the `sha`. - Split the handling of serialization failures out of the normal one, and just log an `info` (we could actually log nothing, probably). - Also set the precommit data just before the commit, in case staging tracking is ever a thing which could use it. Fixes #909	2024-08-05 16:16:17 +02:00
Xavier Morel	52f2b1e381	[FIX] runbot_merge: don't show details of merged PR on GH Detailed statuses are useful in the actual PR dashboard as that allows direct access to the builds, however in the PR where it's only a picture it's useless, so fold that information. Also fold it when a PR is staged. And while at it add a note / sub-title that the PR is staged. Fixes #919	2024-08-05 15:18:19 +02:00
Xavier Morel	ff6f046811	[CHG] runbot_merge: deskill mergebot feedback twas heehee in the moment but it's not really a long term hoohoo haahaa.	2024-08-05 09:09:23 +02:00
Xavier Morel	ec01523875	[IMP] runbot_merge: prune repo during maintenance The weekly maintenance would not prune refs. This is not an issue on odoo/odoo because development branches are in a separate repository, thus never fetched (we push to them but only using local commits and remote refs). However on repos like odoo/documentation the reference and development branches are collocated, the lack of pruning thus keeps every development branch alive locally, even years after the branch has been deleted in the repository. By pruning remote-tracking refs before GC-ing, we should have cleaner local clones, and better packing.	2024-08-05 09:03:39 +02:00
Xavier Morel	8131271a9c	[FIX] runbot_merge: flaky test `test_inconsistent_target` was, appropriately, inconsistent, but would only fail a very small fraction of the time: the issue is that a PR would switch target between `other` and `master` assuming neither was an intrinsic blocker but the branches are created independently, just with the same content. This means if a second ticked over between the creation of the `master` branch's commit and that of `other`, they would get different commit hashes (because different timestamp), thus the PR would get 2 commits (or complete nonsense) when targeted to `other`, and the PR itself would be blocked for lack of a merge method. The solution is to be slightly less lazy, and create `other` from `master` instead of copy/pasting the `make_commits` directive. This means the PR has the exact same number of commits whether targeted to `master` or `other`, and we now test what we want to test 60 seconds out of every minute.	2024-08-05 08:58:05 +02:00
Xavier Morel	229ae45e11	[MERGE] *: triggerify mergebot and forwardport crons A few crons (e.g.database maintenance) remain on timers, but most of the work crons should be migrated to triggers. The purpose of this change is mostly to reduce log spam, as crons get triggered very frequently (most of them every minute) but with little to do. Secondary purposes are getting experience with cron triggers, and better sussing out dependencies between actions and crons / queues. Fixes #797	2024-08-05 08:54:07 +02:00
Xavier Morel	78cc8835ce	[IMP] rubnbot_merge: avoid triggering every cron on every test Since every cron runs on a fresh database, on the first `run_crons` every single cron in the db will run even though almost none of them is relevant. Aside from the slight inefficiency, this creates unnecessary extra garbage in the test logs. By setting the `nextcall` of all crons to infinity in the template we avoid this issue, only triggered crons (or the crons whose nextcall we set ourselves) will trigger during calls. This requires adjusting the branch cleanup cron slightly: it didn't correctly handle the initial run (`lastcall` being false).	2024-08-05 08:03:56 +02:00
Xavier Morel	157657af49	[REM] *: default_crons fixture With the trigger-ification pretty much complete the only cron that's still routinely triggered explicitly is the cross-pr check, and it's that in all modules, so there's no cause to keep an overridable fixture.	2024-08-02 15:14:50 +02:00
Xavier Morel	3ee3e9cc81	[IMP] *: trigger-ify staging cron The staging cron turns out to be pretty reasonable to trigger, as we already have a handler on the transition of a batch to `not blocked`, which is exactly when we want to create a staging (that and the completion of the previous staging). The batch transition is in a compute which is not awesome, but on the flip side we also cancel active stagings in that exact scenario (if it applies), so that matches. The only real finesse is that one of the tests wants to observe the instant between the end of a staging (and creation of splits) and the start of the next one, which because the staging cron is triggered by the failure of the previous staging is now "atomic", requiring disabling the staging cron, which means the trigger is skipped entirely. So this requires triggering the staging cron by hand.	2024-08-02 15:14:50 +02:00
Xavier Morel	f367a64481	[IMP] : trigger-ify merge cron The merge cron is the one in charge of checking staging state and either integrating the staging into the reference branch (if successful) or cancelling the staging (if failed). The most obvious trigger for the merge cron is a change in staging state from the states computation (transition from pending to either success or failure). Explicitly cancelling / failing a staging marks it as inactive so the merge cron isn't actually needed. However an other major trigger is timeout*, which doesn't have a trivial signal. Instead, it needs to be hooked on the `timeout_limit`, and has to be re-triggered at every update to the `timeout_limit`, which in normal operations is mostly from "pending" statuses bumping the timeout limit. In that case, `_trigger` to the `timeout_limit` as that's where / when we expect a status change. Worst case scenario with this is we have parasitic wakeups of this cron, but having half a dozen wakeups unnecessary wakeups in an hour is still probably better than having a wakeup every minute.	2024-08-02 15:14:50 +02:00
Xavier Morel	029957dbeb	[IMP] *: trigger-ify task queue type crons These are pretty simple to convert as they are straightforward: an item is added to a work queue (table), then a cron regularly scans through the table executing the items and deleting them. That means the cron trigger can just be added on `create` and things should work out fine. There's just two wrinkles in the port_forward cron: - It can be requeued in the future, so needs a conditional trigger-ing in `write`. - It is disabled during freeze (maybe something to change), as a result triggers don't enqueue at all, so we need to immediately trigger after freeze to force the cron re-enabling it.	2024-08-02 15:14:50 +02:00
Xavier Morel	57a82235d9	[IMP] : rely only on triggers to run statuses propagation The cron had been converted to using mostly triggers in `7cd9afe7f2` but I forgot to update the tests* to avoid explicitly triggering it.	2024-08-02 12:12:20 +02:00
Xavier Morel	dd17730f4c	[IMP] : crons tests running for better triggered compatibility Mergebot / forwardport crons need to run in a specific ordering in order to flow into one another correctly. The default ordering being unspecified, it was not possible to use the normal cron runner (instead of the external driver running crons in sequence one at a time). This can be fixed by setting sequences* on crons, as the cron runner (`_process_jobs`) will use that order to acquire and run crons. Also override `_process_jobs` however: the built-in cron runner fetches a static list of ready crons, then runs that. This is fine for normal situation where the cron runner runs in a loop anyway but it's any issue for the tests, as we expect that cron A can trigger cron B, and we want cron B to run right now even if it hadn't been triggered before cron A ran. We can replace `_process_job` with a cut down version which does that (cut down because we don't need most of the error handling / resilience, there's no concurrent workers, there's no module being installed, versions must match, ...). This allows e.g. the cron propagating commit statuses to trigger the staging cron, and both will run within the same `run_crons` session. Something I didn't touch is that `_process_jobs` internally creates completely new environments so there is no way to pass context into the cron jobs anymore (whereas it works for `method_direct_trigger`), this means the context values have to be shunted elsewhere for that purpose which is gross. But even though I'm replacing `_process_jobs`, this seems a bit too much of a change in cron execution semantics. So left it out. While at it tho, silence the spammy `py.warnings` stuff I can't do much about.	2024-08-02 09:00:34 +02:00
Xavier Morel	32871c3896	[FIX] *: stray prints Found a bunch of old leftover `print` calls which should not be in the repo.	2024-08-02 08:59:45 +02:00
Xavier Morel	48d22b920a	[FIX] mergebot coverage - add context, in case that allows tracking overlap between tests eventually - use `-m` to run odoo, requires backporting `odoo/__main__.py` but I'm not quite sure how I was running it previously, possibly via odoo-bin? It certainly doesn't seem to work out with a global `odoo` helper	2024-07-30 11:04:45 +02:00
Xavier Morel	cabab210de	[FIX] : don't send merge errors to logging Merge errors are logical failures, not technical, it doesn't make sense to log them out because there's nothing to be done technically, a PR having consistency issues or a conflict is "normal". As such those messages are completely useless and just take unnecessary space in the logs, making their use more difficult. Instead of sending them to logging, log staging attempts to the PR itself, and only do normal logging of the operation as an indicative item. And remove a bunch of `expect_log_errors` which don't stand anymore. While at it, fix a missed issue in forward porting: if the `root.head` doesn't exist in the repo its `fetch` will immediately fail before `cat-file` can even run, so the second one is redundant and the first one needs to be handled properly. Do that. And leave checking for that* specific condition as a logged-out error as it should mean there's something very odd with the repository (how can a pull request have a head we can't fetch?)	2024-07-26 14:48:59 +02:00
Xavier Morel	6f0aea799f	[IMP] : add test for r- on forward ports Apparently I'd already fixed that in `286c1fdaee` but it has yet to be deployed. While at it, add a feedback message to clarify that, unlike `r+`, `r-` on forward ports does not* propagate. Fixes #912	2024-07-26 12:29:52 +02:00
Xavier Morel	a0d4bb0d0c	[ADD] runbot_merge: expanded view for current batch The PR dashboard picture provides a great overview of the batch state both horizontally and vertically but apparently people can't for the life of them go check the actual dashboard when things don't line up. So expand the "current batch" to a view more similar to dashboard page, which gives details of the sub-checks being performed and whether they are or are not fulfilled. Fixes #908	2024-07-26 10:01:33 +02:00
Xavier Morel	f5eb7447fb	[IMP] runbot_merge: reorganise composition of PR dashboard pic The previous version worked but was extremely plodding and procedural. Initially I wanted to compose the table in a single pass but that turns out not to really be possible as the goal for #908 is to have a "drawer" for extended information about the current batch: this means different cells of the same row can have different heights, so we can't one-pass the image either vertically (later cells of the current column might be wider) or horizontally (later cells of the current row might be taller). However what can be done is give the entire thing structure, essentially defining a very cut down and ad-hoc layout system before committing the layout to raster. This also deduplicates formatting and labelling information which was previously in both the computation first step and the rasterisation second step.	2024-07-25 15:34:23 +02:00

1 2 3 4 5 ...

1696 Commits