runbot

mirror of https://github.com/odoo/runbot.git synced 2025-03-16 07:55:45 +07:00

Author	SHA1	Message	Date
Xavier Morel	b109225f44	[IMP] runbot_merge: quality of feedback on errorneous commands - When a redundant approval is sent to a PR, notify but don't ignore the entire command set, there's no actual risk. - Indicate that the entire comment was ignored when finding something which does not parse. Fixes #892, fixes #893	2024-06-21 15:38:54 +02:00
Xavier Morel	7cd9afe7f2	[IMP] runbot_merge: trigger commits cron The commit cron needs to be triggered any time we: - create a new commit - update a commit to set its `to_check` So do that in create and write as well as the SQL query in the webhook handler. This should mean we don't need the periodic cron anymore, but for safety's sake run it on 30mn for now. TBF even if we miss triggers, the next `status` webhook hitting will check all the relevant commits anyway...	2024-06-21 11:02:50 +02:00
Xavier Morel	92e8eecbb5	[FIX] runbot_merge: ability to create PRs via the UI This is useful to repro issues. `60c4b5141d` added `inverse=readonly` hooks to various newly computed fields to ensure they can not be written to, either overwriting the content (stored) or silently being dropped (non-stored). However because they're `inverse` hooks this had the effect of making them writeable from the backend UI since the ORM uses `inverse` as a signal to make the field writeable. This then caused the web client to send stuff for those fields, which are not necessarily even visible in the form, leading to write errors when trying to save a PR creation. By marking the fields as `readonly` explicitly we make sure that doesn't happen, and we can create PRs from the backend UI (kinda, I think the label is still an issue).	2024-06-21 10:42:37 +02:00
Xavier Morel	906505ed15	[IMP] runbot_merge: filter on the base attribute not computed Should not actually do anything relevant, but seems like a good idea.	2024-06-21 10:42:08 +02:00
Xavier Morel	3410f50248	[FIX] runbot_merge: `Commit.create` The method was not marked as a create, following which it did not allow creating commits via the UI (annoying for testing / reproducing issues involving statuses).	2024-06-21 10:41:01 +02:00
Xavier Morel	737cbd5de2	[IMP] *: merge fw overrides into their parent Not actually useful in any way, but it does remove a few lines, avoids a few dupe writes, and furthers the cause of #789	2024-06-21 10:40:06 +02:00
Xavier Morel	f303674434	[FIX] : re-enable notification on status failure If a PR gets approved then* fails CI, there should be a notification warning the author & reviewer since `48e08b657b`, it even has a test, which passes (in fact it has two, one of which is redundant, so merge `test_ci_failure_after_review` into the later `test_ci_approved`). However this is in runbot_merge, turns out in `fafa7ef437` some refactoring was done in order to override the notification and customise it for forward ports with a failed status... except that override never called its `super()`, so as soon as forwardport is installed the base notification stops working, and that's been that since October 2019 (had been added in March that year, ignoring deployment lag). This can be revealed by adding the corresponding check in the forwardport tests, revealing the failure. This was a pain to track down, thankfully it reproduced relatively easily locally. While this could be resolved in the override, might as well fold it into the base method in furtherance of #789: the mergebot is only used by odoo, and only with both modules combined, so splitting them is not useful. And furthermore it things should work fine with the forwardport installed but unused. Fixes #894	2024-06-21 10:27:01 +02:00
Xavier Morel	4a521e1251	[IMP] runbot_merge: hide backend links from group_user The backend links in the PR dashboard were gated behind the `group_user` (internal user) group, however turns out while internal users have read access to PRs they don't have access to ancillary objects (e.g. batches, stagings, the link between stagings and batches), and I think the only way to fix the issue would be to move it to an optional inheritance (inheritance + group), because `groups` on view nodes only hides the content without removing it. I believe in more recent Odoo versions this actually works correctly, so that might actually be more of an incentive to upgrade...	2024-06-20 14:21:40 +02:00
Xavier Morel	20d259aa77	[IMP] runbot_merge: always display PR title Previous version would always hide the title if the PR was blocked (e.g. blocked or failed), turns out there are people who actually use the PR title on the main dashboard, so suppressing that is inconvenient for them. Try to show the PR title if available, and add the blocked message if present.	2024-06-20 13:49:17 +02:00
Xavier Morel	728524db12	[IMP] runbot_merge: send merge method warning faster, and on review - Instead of warning about the merge method on ready PRs, also warn on approved (but exclude staged just cuz), as that's really when the user wants to know that they forgot to set the merge method - The cron only triggers hourly, but if a user approves a PR and the merge method is not set yet, chances are good they'll need a reminder (if they `r+ rebase-merge` or w/e the cron will just ignore the PR and it's no skin off our back), so `_trigger` the cron for validation. - Also do the same when skipchecks is set as it's very similar. In reality we might want to hook off of the state transitioning to reviewed but I'm not sure there's good ways to do that (triggering a cron inside a compute doesn't seem like a good idea). Update a pair of tests which would approve a multi-commit PR without setting a merge method, just because the helper they use to build the PR happens to create multiple commits. Fix #891	2024-06-13 13:36:34 +02:00
Xavier Morel	9d9cae1d57	[FIX] runbot_merge: access to self in loop This is a low issue as the prs of a commit are only listed from the form so the compute is pretty much always called with a single record, but still an unforced error which can easily be fixed.	2024-06-13 09:35:29 +02:00
Xavier Morel	2662411b96	[FIX] runbot_merge: `_schedule_fp_followup` not handling multiple batches `_schedule_fp_followup` correctly iterates on `self`, however some of the per-iteration work did not handle that correctly, and would try to access fields on `self`. Thankfully in most cases it only works on one batch at a time anyway, however if multiple PRs share a HEAD (which is weird but...) then `_validate` is called on multiple PRs, which through the forwardport override leads to `_schedule_fp_followup` being called on multiple batches, and failing when trying to access the `fw_policy`. Fix by avoiding the misuse of `self` in the two locations where it's doing something other than accessing `env`.	2024-06-13 08:04:12 +02:00
Xavier Morel	7711d09854	[IMP] *: add fw=no, deprecate ignore Without fw-bot being its bearer, "ignore" is a lot less clear than it used to as it looks to be asking to ignore the PR entirely (as if it was targeted to an unmanaged branch). Deprecate this command, and tack on the shortcut to the fw subcommand. It is slightly sub-par as technically it does not quite fit with the other subcommands, and furthermore can't be disabled via fw=default... although maybe it could be? Maybe instead of setting the limit fw=no could set that value to the forwardport mode, and the fw_policy users could check that? It would require some more finessing tho: - `DEFAULT` would need to be accessible to the author as well as the reviewers so the author could toggle between `NO` and `DEFAULT`. - There should probably be a warning of some sort when setting a limit to an unportable PR. - The dashboards would need to take `NO` in account (though I guess that's just defaulting the limit to the target).	2024-06-12 16:08:25 +02:00
Xavier Morel	413027ad5b	[IMP] runbot_merge: formatting & langage of PR attributes Replace the unclear "unchecked" and "unreviewed" by "missing statuses" and "missing r+", which are hopefully clearer as they better match other lingo. Also increase font for attributes, as size 10 was a bit small. And finally add staging state to caching key, to differentiate "ready" from "staged" pictures in gh's cache. "ready" should not be necessary as it ought be implied by the label.	2024-06-12 15:51:17 +02:00
Xavier Morel	a2d7180216	[IMP] runbot_merge: move limit to fwport tab And filter it to only consider branches in the same project as the PR, and a lower sequence than its target. That way it's harder to fuck up when trying to set limits from the backend.	2024-06-12 15:34:39 +02:00
Xavier Morel	d010f0374a	[FIX] *: dashboard when PRs have different limits The code selecting the lower and upper bounds for the PR dashboard did not deal correctly with getting multiple limits in the same genealogy.	2024-06-12 15:09:47 +02:00
Xavier Morel	d2e730c39b	[IMP] runbot_merge: log ACL error in PR controller Currently this just silently returns a 404. Since repos are gated by default (only accessible to internal users) this can get very confusing when trying to setup a new repo or when forgetting this information when writing tests.	2024-06-12 15:09:42 +02:00
Xavier Morel	2ab06ca96b	[IMP] *: require explicitly specifying whether exceptions in logs are valid Seems like a good idea to better keep track of the log of an Odoo used to testing, and avoid silently ignoring logged errors. - intercept odoo's stderr via a pipe, that way we can still write it back out and pytest is able to read & buffer it, pytest's capfd would not work correctly: it breaks output capturing (and printing on failure); and because of the way it hooks in it's unable to capture from subprocesses inheriting the standard stream, cf pytest-dev/pytest#4428 - update the env fixture to check that the odoo log doesn't have any exception on failure - make that check conditional on the `expect_log_errors` marker, this way we can mark tests for which we expect errors to be logged, and assert that that does happen	2024-06-12 15:09:42 +02:00
Xavier Morel	60c4b5141d	[FIX] runbot_merge: leftover direct setting of PR state Setting the PR state directly really doesn't work as it doesn't correctly save (and can get overwritten by any dependency of which there are many). This caused setting odoo/odoo#165777 in error to fail, leading to it being re-staged (and failing) repeatedly, and the PR being spammed with comments. - create a more formal helper for preventing directly setting computed functions (without an actual inverse) - replace direct state setting by setting the corresponding dependency e.g. `error` for error and `skipchecks` to force a PR to ready - add a `skipchecks` inverse to the PR so it can also set itself as reviewed, and is convenient, might be worth also adding stuff to `Batch.write`	2024-06-11 15:41:20 +02:00
Xavier Morel	e320de0439	[FIX] runbot_merge: handle gh comments ending with newlines Regex `$` apparently does not quite strip that out.	2024-06-11 15:24:09 +02:00
Xavier Morel	187f7f6429	[CHG] runbot_merge: allow pr author to approve all fw - trigger FW section on all forward ports, not just attached ones - allow author of original PR to approve any fwport	2024-06-10 15:21:24 +02:00
Xavier Morel	e403593799	[FIX] runbot_merge: incorrect computation dependencies `Batch.staging_ids` is a computed field, it can't be used as a dependency for an other compute (at least not in 15.0).	2024-06-10 14:31:02 +02:00
Xavier Morel	14a2b0068d	[FIX] runbot_merge: type error in conflict handling	2024-06-10 14:29:55 +02:00
Xavier Morel	4af515b20d	[IMP] runbot_merge: stagings button wrapping Because one of the previous commits adds the duration of the staging to the staging dropdown toggles, it's now much longer, and by default the text does not wrap so it looks like shit and goes completely out the column "CSS is awesome" style. Update the style of the dropdown toggles specifically to allow text wrapping. Also align them left instead of centering, because the text makes a centered layout super ugly.	2024-06-07 17:07:10 +02:00
Xavier Morel	2fb85c515e	[ADD] runbot_merge: missing staging migration `44084e303c` changed the interpretation and schema of the `statuses_cache` field on stagings, but I forgot to add a migration, so it would just blow up on opening the home dashboard or the staging lists.	2024-06-07 17:05:58 +02:00
Xavier Morel	f4035932e3	[IMP] runbot_merge: add staging status to dashboard The dashboard can be a bit unclear as to the state of a PR when everything's gone well. Make it more clear / explicit that it's ready or staged. Fixes #888	2024-06-07 15:51:26 +02:00
Xavier Morel	fec3d39d19	[ADD] : per-repository webhook secret Currently webhook secrets are configured per project* which is an issue both because different repositories may have different administrators and thus creates safety concerns, and because multiple repositories can feed into different projects (e.g. on mergebot, odoo-dev/odoo is both an ancillary repository to the main RD project, and the main repository to the minor / legacy master-wowl project). This means it can be necessary to have multiple projects share the same secret as well, this then mandates the secret for more repositories per (1). This is a pain in the ass, so just detach secrets from projects and link them only to repositories, it's cleaner and easier to manage and set up progressively. This requires a lot of changes to the tests, as they all need to correctly configure the signaling. For `runbot_merge` there was some setup sharing already via the module-level `repo` fixtures`, those were merged into a conftest-level fixture which could handle the signaling setup. A few tests which unnecessarily set up repositories ad-hoc were also moved to the fixture. But for most of the ad-hoc setup in `runbot_merge`, as well as `forwardport` where it's all ad-hoc, events sources setup was just appended as is. This should probably be cleaned up at one point, with the various requirements collected and organised into a small set of fixtures doing the job more uniformly. Fixes #887	2024-06-06 11:07:57 +02:00
Xavier Morel	c1e2e5a2e0	[REF] forwardport: update re_matches to not use a regex Using a regex as the pattern is quite frustrating due to all the escaping necessary, which in this refactoring I found out I'd missed, multiple times. Convert the pattern to something bespoke but not too complicated, we may want to add anchoring support and a bit more finesse and the future but for now straightforward "holes" seem to work well. I've added support for capturing and even named groups even if this as yet unnecessary and unused. Fixes #861 [^1]: https://docs.pytest.org/en/stable/reference.html#pytest.hookspec.pytest_assertrepr_compare	2024-06-04 14:18:04 +02:00
Xavier Morel	98aaa9107f	[CHG] forwardport: notify the outstanding forwardports rather than source I have been convinced that this might be an improvement to the affairs of the people: originally the message was sent to the source PR so we wouldn't have to ping the author & reviewer and to limit the amount of spam, however: - we ended up adding pings anyway - it also pings the followers of the source PR - it increases the size of the original discussion (especially if was - originally long) - it adds steps to fixing the issue as you need to bounce from the source to the forward ports Note that this might still notify a lot of people as they might be made followers of the forward ports automatically, and it increases the messaging load of the forwardbot significantly. But we'll see how things go. Worst case scenario, we can revert it back. Fixes #836	2024-06-04 08:56:51 +02:00
Xavier Morel	9c51f87aed	[ADD] runbot_merge: support for non-webhook staging validation Add support for the ability to validate stagings over RPC rather than via webhook. This may later be expanded to PRs as well. The core motivation for this is to avoid bouncing through github which sometimes drops the ball on statuses, and it's frustrating to have a staging time out because GH fucked up. Implemented via RPC, requiring both the staging itself (by id) and the head commit being affected, as that is necessary to know what CIs are required for that head and correctly report cross branch on the various PRs. Fix #881 (kinda)	2024-06-04 08:56:51 +02:00
Xavier Morel	44084e303c	[REF] runbot_merge: compute staging state Rather than compute staging state directly from commit statuses, copy statuses into the staging's `statuses_cache` then compute state based on that. Also refactor `statuses` and `staging_end` to be computed based on the statuses cache instead of the commits (or `state` update). The goal is to allow non-webhook validation of stagings, for direct communications between the mergebot and the CI (mostly runbot).	2024-05-31 12:33:13 +02:00
Xavier Morel	68cfeddaed	[ADD] runbot_merge: display required statuses after merge Github makes it painfully difficult to access the statuses (especially their URL / related build) once a PR has been merged, as it's necessary to find the last non-staging commit mention / update in order to find its statuses checkbox thingie, open that, and access the statuses. The mergebot has all the links, so it can just display them in the merged mode as well rather than only display them in open mode. That way even on a merged PR the statuses are just two clicks away. Fixes #873	2024-05-30 15:28:25 +02:00
Xavier Morel	67f1c1e288	[IMP] runbot_merge: add staging duration Computed on the fly for now. Formatted nicely in the frontend, there does not seem to be any sort of duration widget in the backend so just display the integer number of seconds. Fixes #865	2024-05-30 15:11:38 +02:00
Xavier Morel	3f4519d605	[CHG] runbot_merge: add signoff & related to all commits if rebased. Untouched commits (straight merge) remain unalterated, but all rebased or squashed commits now get signoff and `Related` headers added on top of the already previously added `part-of`. Implement by generalising `_build_merge_message` to `_build_message` and having `add_self_references` delegate to it, removes some of the redundancy / differential handling. Update the `part_of` helper to also add the S-O-B header to the PR, although it currently does not reference the entire forward port chain. Fixes #876	2024-05-30 10:59:07 +02:00
Xavier Morel	3c3100adfe	[IMP] runbot_merge: cleanup PR backend Shove a bunch of stuff in notebook tabs, add a few affordances (e.g. github and frontend links, links from m2m), surface a few missing fields. Hopefully makes the backend form both easier to navigate and easier to administrate from.	2024-05-29 07:55:07 +02:00
Xavier Morel	232aa271b0	[ADD] runbot_merge: PR dashboard V2 Displays the entire batch set as a table, along both repository (linked PRs) and branch (forward ports). Should provide a much more complete overview. Adds a copy of the dashboard as a raster render, to link from the PR: as usual SVG is shit, content-based viewboxes are hell and having to duplicate the entire CSS because `<img/>`-linked CSS can't run is gross. And there's no payoff since the image is not interactible anyway. Performing manual ad-hoc table rendering via pillow is not significantly worse, it works fine and it's possible to do really good conditional request handling (hopefully) because I've basically got all the information I need right here. In fact it might make sense to upgrade the regular HTML page with similar conditional request handling, at least for the last-update bit if not the etag. Fixes #771,fixes #770	2024-05-29 07:55:07 +02:00
Xavier Morel	3191c44459	[ADD] runbot_merge: synthetic batches & stagings to freeze wizard Merged PRs should have a batch which should have a staging, this makes the treatment uniform across the board and avoids funky data which is hard to place or issues when reconstructing history. Also create synthetic batches & stagings for older freezes (and bumps)	2024-05-29 07:55:07 +02:00
Xavier Morel	bbce5f8f46	[IMP] : don't remove PRs from batches on close Initially wanted to skip this only for FW PRs, but after some thinking I feel this info could still be valuable even for non-fw PRs which were never merged in the first place. Requires a few adjustments to not break everything: `batch.prs` excludes closed PRs by default as most processes only expect to be faced by a closed PR inside a batch, and we especially* want to avoid that before the batch is merged (as we'd risk staging a closed PR). However since PRs don't get removed from batches anymore (and batches don't get deleted when they have no PRs) we now may have a bunch of batches whose PRs (usually a single one) are all closed, this has two major side-effects: - a new PR may get attached to an old batch full of closed PRs (as batches are filtered out on being merged), which is weird - the eventual list of batches gets polluted with a bunch of irrelevant batches which are hard to filter out The solution is to reintroduce an `active` field, as a stored compute field based on the state of batch PRs. This way if all PRs of a batch are closed it switches to inactive, and is automatically filtered out by search which solves both issues.	2024-05-29 07:55:07 +02:00
Xavier Morel	0e0348e4df	[IMP] runbot_merge: preserve batch ordering in stagings Batch ordering in stagings is important in order to correctly reconstitute the full project history. In the old mergebot, since batches are created on the fly during staging this information is reified by the batch ids. But since batch ids are now persistent and there is no relationship between the creation of a batch and its merging (especially not relative to other batches) it's an issue as reconstituting sub-staging git history would be impossible. Which is not the worst, but is not great. The solution is to reify the join table between stagings and batches in order for that to keep history (simply via the sequential PK), and in converting to the new system carefully generate the new links in an order matching the old batch ids.	2024-05-29 07:55:07 +02:00
Xavier Morel	e7e81bf375	[IMP] : handle the addition of a new PR to a fw-ported batch Given a batch which has been merged, and been forward-ported, to multiple branches (because skipci was set or ci passed on the repos the batch covers). There might come the need to add a PR for one of the uncovered repos. This raises the question of what to do with it, since the forward-ports for the batch already exist it's not going to get forwardported normally, nor may we want to, possibly? Options are: - don't do anything, such additions don't get ported, this is incongruous and unexpected as by default PRs are forward-ported, and if the batch wasn't an intermediate (but e.g. a conflict) it probably would be ported forward - port on merge, this allows configuring the PR properly (as it might need its own limit) but it means further batches may get unexpectedly merged (or at least retied) without the additional PR even though we likely want it in - immediately port the additional PR on creation, this makes the limit harder or impossible to configure but it makes the batch sequence* more consistent We ended up selecting the latter, it feels closer to the updates system, and it creates more consistent batches through the sequence. It's also technically easier to ad-hoc port a PR through a bunch of branches than it is to update the "normal" forward-port process to handle partial fixups.	2024-05-29 07:55:07 +02:00
Xavier Morel	1e9fa48652	[ADD] runbot_merge: migration of models refactoring This is definitely non-trivial, due to the structural changes and the amounts of stuff to move around (e.g. lift from PR to batch), as well as the reification of previously non-existent relations (batches, batch history, ...) which sometimes uncovers inconsistencies in the current state of the mergebot (some of which are the result of bugs, the bug got fixed but the nonsense it generated was left untouched).	2024-05-29 07:55:02 +02:00
Xavier Morel	94fe0329b4	[FIX] : behaviour around branch deactivation & fw maintenance Test and refine the handling of batch forward ports around branch deactivation, especially with differential. Notably, fix an error in the conversion of the FW process to batches: individual PR limit was not correctly taken in account during forward port unless all* PRs were done, even though that is a primary motivation for the change. Partial forward porting should now work correctly, and the detection and handling of differential next target should be better handled to boot. Significantly rework the interplay between batches and PRs being closed in order to maintain sequencing / consistency of forward port sequences: previously a batch would get deleted if all its PRs are closed, but that is an issue when it is part of a forward port sequence as we now lose information. Instead, detach the PRs from the batch as before but have the batch skip unlinking if it has historical value (parent or child batch). Currently the batch's state is a bit weird as it doesn't get merged, but... While at it, significantly simplify `_try_closing` as it turns out to have a ton of incidental / historical complexity from old attempts at fixing concurrency issues, which should not be necessary anymore and in fact actively interfere with the new and more compute-heavy state of things.	2024-05-24 09:08:56 +02:00
Xavier Morel	a4a067e7e9	[CHG] *: move forward-porting over to batches Thank god I have a bunch of tests because once again I forgot / missed a bunch of edge cases in doing the conversion, which the tests caught (sadly that means I almost certainly broke a few untested edge cases). Important notes: Handling of parent links ------------------------ Unlike PRs, batches don't lose their parent info ever, the link is permanent, which is convenient to trawl through a forward port (currently implemented very inefficiently, maybe we'll optimise that in the future). However this means the batch having a parent and the batch's PRs having parents are slightly different informations, one of the edge cases I missed is that of conflicting PRs, which are deparented and have to be merged by hand before being forward ported further, I had originally replaced the checks on a pr and its sibling having parents by just the batch. Batches & targets ----------------- Batches were originally concepted as being fixed to a target and PRs having that target, a PR being retargeted would move it from one batch to an other. As it turns out this does not work in the case where people retarget forward-port PRs, which I know they do because #551 (`2337bd8518`). I could not think of a good way to handle this issue as is, so scrapped the moving PRs thing, instead one of the coherence checks of a batch being ready is that all its PRs have the same target, and a batch only has a target if all its PRs have the same target. It's possible for somewhat odd effects to arise, notably if a PR is closed (removed from batch), the other PRs are retargeted, and the new PR is reopened, it will now be on a separate batch even if it also gets retargeted. This is weird. I don't quite know how I should handle it, maybe batches could merge if they have the same target and label? however batches don't currently have a label so... Improve limits -------------- Keep limits on the PRs rather than lift them on the batchL if we can add/remove PRs of batches having different limits on different PRs of the same batch is reasonable. Also leave limit unset by default: previously, the limit was eagerly set to the tip (accessible) branch. That doesn't really seem necessary, so stop doing that. Also remove completely unnecessary `max` when trying to find a PR's next target: `root` is either `self` or `self.source_id`, so it should not be possible for that to have a later target. And for now ensure the limits are consistent per batch: a PR defaults to the limit of their batch-mate if they don't have one, and if a limit is set via command it's set on all PRs of a batch. This commit does not allow differential limits via commands, they are allowed via the backend but not really tested. The issue is mostly that it's not clear what the UX should look like to have clear and not super error prone interactions. So punt on it for now, and hopefully there's no hole I missed which will create inconsistent batches.	2024-05-24 09:08:56 +02:00
Xavier Morel	dae046708f	[IMP] runbot_merge: make batch blocked message more precise In case of PRs not being ready, don't just say the PRs are waiting for CI even though they might be unreviewed, and make a difference between waiting for CI (pending) and having failed CI.	2024-05-24 09:08:56 +02:00
Xavier Morel	f97502e503	[IMP] runbot_merge: make skipchecks impact PR state It's a bit weird and inconsistent to have a PR being staged while unreviewed or unapproved or w/e. If we compute the state based on skipchecks then everything is consistent. Also remove the implicit override of all statuses when explicitly marking the pr as `ready`, it risks creating difficult to understand states, and it's unnecessary since `skipchecks` gets set. Also as with setting skipchecks, sets the current user as reviewer on all PRs of the batch without a reviewer.	2024-05-24 09:08:56 +02:00
Xavier Morel	fa2bba3cb9	[CHG] runbot_merge: don't reset cancel_staging on r- Also send skipchecks removal to the PR being r-'d, as sending it to a random PR of the batch doesn't really make sense?	2024-05-24 09:08:56 +02:00
Xavier Morel	c66451a8c7	[IMP] runbot_merge: cleanup/modernize test_multirepo.py - remove the `legal/cla` and `ci/runbot` context names, which I use a lot for historical reasons but fundamentally they're not useful to the tests, the `default` context is generally simpler. - remove `make_branch` helper as we don't actually use branch protection and at the end of the day it doesn't do much else - convert a few explicit PR lookups to the project-wide `to_pr` helper	2024-05-23 07:58:58 +02:00
Xavier Morel	a6a37f8896	[FIX] runbot_merge: handling of staging cancellation Move staging cancellation to the batch, remove its (complicated) handling from the PRs. This loses some precision in the cancellation message, but that could likely be recovered (in part) by adding more precise checks & diagnostic extractions in the compute.	2024-05-23 07:58:58 +02:00
Xavier Morel	ad1d590d9c	[IMP] runbot_merge: fix dual merge of split prioritised PRs Because `alone` (formerly p != 2) is selected before split PRs, if a prioritised PR gets split (or a split PR gets prioritised) it will be staged once as prioritised, and again because split. Improve the selection of ready batches to exclude split batches upstream, such that they don't have to be rechecked over and over, and their priorities don't cause us issues.	2024-05-23 07:58:58 +02:00
Xavier Morel	83511f45e2	[CHG] runbot_merge: move priority field from PR to batch Simplifies the `ready_prs` query a bit and allows it to be converted to an ORM search, by moving the priority check outside. This also allows the caller to not need to post-process the records list anywhere near the previous state of affairs. `ready_prs` now returns either the "alone" batches, or the non-alone batches, rather than mixing both into a single sequence. This requires correctly applying the search filters to not retrieve priority of batches in error or targeting other branches.	2024-05-23 07:58:58 +02:00
Xavier Morel	ef6a002ea7	[CHG] runbot_merge: move staging readiness to batch Staging readiness is a batch-level concerns, and many of the markers are already there though a few need to be aggregated from the PRs. As such, staging has no reason to be performed in terms of PRs anymore, it should be performed via batches directly. There is a bit of a mess in order not to completely fuck up when retargeting PRs (implicitly via freeze wizard, or explicitely) as for now we're moving PRs between batches in order to keep the batches mostly target-bound. Some of the side-effects in managing the coherence of the targeting and moving PRs between batches is... not great. This might need to be revisited and cleaned up with those scenarios better considered.	2024-05-23 07:58:58 +02:00
Xavier Morel	9ddf017768	[CHG] *: move fw_policy from PR to batch	2024-05-23 07:58:58 +02:00
Xavier Morel	21b5dd439b	[CHG] runbot_merge: move merge_date to batch, remove active - `merge_date` should be common to an entire batch, so move it there - remove `Batch.active` which should probably have been removed when batches were made persistent (can eventually re-add as a proxy for `merge_date` being set maybe, but for now removing it seems a better way to catch mistakes) - update various sites to use `Batch.merge_date` instead of `Batch.active`	2024-05-23 07:58:58 +02:00
Xavier Morel	e910b8e857	[IMP] runbot_merge: move cross-pr properties to batch	2024-05-23 07:58:58 +02:00
Xavier Morel	473f89f87d	[CHG] *: persistent batches This probably has latent bugs, and is only the start of the road to v2 (#789): PR batches are now created up-front (alongside the PR), with PRs attached and detached as needed, hopefully such that things are not broken (tests pass but...), this required a fair number of ajustments to code not taking batches into account, or creating batches on the fly. `PullRequests.blocked` has also been updated to rely on the batch to get its batch-mates, such that it can now be a stored field with the right dependencies. The next step is to better leverage this change: - move cross-PR state up to the batch (e.g. skipchecks, priority, ...) - add fw info to the batch, perform forward-ports batchwise in order to avoid redundant batch-selection work, and allow altering batches during fw (e.g. adding or removing PRs) - use batches to select stagings - maybe expose staging history of a batch?	2024-05-23 07:58:58 +02:00
Xavier Morel	c140701975	[ADD] runbot_merge: support staging ready PRs over splits Not sure it's going to be useful but it's hard to know if we can't test it. The intent is mostly the ability to prioritize throughput (or attempt to) during high-load events, if we can favour staging N new batches over a split's N/2 we might be able to merge more crap. But maybe not, we'll see, either way now it's here and seems to more or less work. Fixes #798	2024-05-23 07:58:58 +02:00
Xavier Morel	9f54e6f209	[ADD] runbot_merge: option to disable staging without cron Because the mergebot crons are on such a tight scheduling, and just them finding out they have nothing to do can take a while, disabling them can be a chore. Disabling staging via the project is much less likely to cause issues as the projects don't normally (or ever?) get exclusively locked, so they can generally be written to at any moment. Furthermore, if we ever get in a situation where we have multiple active projects (not really the case currently, we have multiple projects but only one is really active) it's less disruptive to disable stagings on a single specific project. Fixes #860	2024-05-23 07:58:58 +02:00
Xavier Morel	d4fa1fd353	[CHG] : rewrite commands set, rework status management This commit revisits the commands set in order to make it more regular, and limit inconsistent command-sets, although it includes pseudo-command aliases for common tasks now removed from the core set. Hard Errors =========== The previous iteration of the commands set would ignore any non-command term in a command line. This has been changed to hard error (and ignoring the entire thing) if any command is unknown or invalid. This fixes inconsistent / unexpected interpretations where a user sends a command, then writes a novel on the same line some words of which happen to also* be commands, leading to merge states they did not expect. They should now be told to fuck off. Priority Restructuring ---------------------- The numerical priority system was pretty messy in that it confused "staging priority" (in ways which were not entirely straightforward) with overrides to other concerns. This has now being split along all the axis, with separate command subsets for: - staging prioritisation, now separated between `default`, `priority`, and `alone`, - `default` means PRs are picked by an unspecified order when creating a staging, if nothing better is available - `priority` means PRs are picked first when staging, however if `priority` PRs don't fill the staging the rest will be filled with `default`, this mode did not previously exist - `alone` means the PRs are picked first, before splits, and only `alone` PRs can be part of the staging (which usually matches the modename) - `skipchecks` overrides both statuses and approval checks, for the batch, something previously implied in `p=0`, but now independent. Setting `skipchecks` basically makes the entire batch `ready`. For consistency this also sets the reviewer implicitly: since skipchecks overrides both statuses and approval, whoever enables this mode is essentially the reviewer. - `cancel` cancels any ongoing staging when the marked PR becomes ready again, previously this was also implied (in a more restricted form) by setting `p=0` FWBot removal ============= While the "forwardport bot" still exists as an API level (to segregate access rights between tokens) it has been removed as an interaction point, as part of the modules merge plan. As a result, fwbot stops responding ---------------------- Feedback messages are now always sent by the mergebot, the forward-porting bot should not send any message or notification anymore. commands moved to the merge bot ------------------------------- - `ignore`/`up to` simply changes bot - `close` as well - `skipci` is now a choice / flag of an `fw` command, which denotes the forward-port policy, - `fw=default` is the old `ci` and resets the policy to default, that is wait for the PR to be merged to create forward ports, and for the required statuses on each forward port to be received before creating the next - `fw=skipci` is the old `skipci`, it waits for the merge of the base PR but then creates all the forward ports immediately (unless it gets a conflict) - `fw=skipmerge` immediately creates all the forward ports, without even waiting for the PR to be merged This is a completely new mode, and may be rather broken as until now the 'bot has always assumed the source PR had been merged. approval rework --------------- Because of the previous section, there is no distinguishing feature between `mergebot r+` = "merge this PR" and `forwardbot r+` = "merge this PR and all its parent with different access rights". As a result, the two have been merged under a single `mergebot r+` with heuristics attempting to provide the best experience: - if approving a non-forward port, the behavior does not change - else, with review rights on the source, all ancestors are approved - else, as author of the original, approves all ancestors which descend from a merged PR - else, approves all ancestors up to and including the oldest ancestor to which we have review rights Most notably, the source's author is not delegated on the source or any of its descendants anymore. This might need to be revisited if it provides too restrictive. For the very specialized need of approving a forward-port and none of its ancestors, `review=` can now take a comma (`,`) separated list of pull request numbers (github numbers, not mergebot ids). Computed State ============== The `state` field of pull requests is now computed. Hopefully this makes the status more consistent and predictable in the long run, and importantly makes status management more reliable (because reference datum get updated naturally flowing to the state). For now however it makes things more complicated as some of the states have to be separately signaled or updated: - `closed` and `error` are now separate flags - `merge_date` is pulled down from forwardport and becomes the transition signal for ready -> merged - `reviewed_by` becomes the transition signal for approval (might be a good idea to rename it...) - `status` is computed from the head's statuses and overrides, and that becomes the validation state Ideally, batch-level flags like `skipchecks` should be on, well, the batch, and `state` should have a dependency on the batch. However currently the batch is not a durable / permanent member of the system, so it's a PR-level flag and a messy pile. On notable change is that forcing the state to `ready` now does that but also sets the reviewer, `skipchecks`, and overrides to ensure the API-mediated readying does not get rolled back by e.g. the runbot sending a status. This is useful for a few types of automated / programmatic PRs e.g. translation exports, where we set the state programmatically to limit noise. recursive dependency hack ------------------------- Given a sequence of PRs with an override of the source, if one of the PRs is updated its descendants should not have the override anymore. However if the updated PR gets overridden, its descendants should have that override. This requires some unholy manipulations via an override of `modified`, as the ORM supports recursive fields but not recursive dependencies (on a different field). unconditional followup scheduling --------------------------------- Previously scheduling forward-port followup was contigent on the FW policy, but it's not actually correct if the new PR is immediately validated (which can happen now that the field is computed, if there are no required statuses or all of the required statuses are overridden by an ancestor) as nothing will trigger the state change and thus scheduling of the fp followup. The followup function checks all the properties of the batch to port, so this should not result on incorrect ports. Although it's a bit more expensive, and will lead to more spam. Previously this would not happen because on creation of a PR the validation task (commit -> PR) would still have to execute. Misc Changes ============ - If a PR is marked as overriding / canceling stagings, it now does so on retry not just when setting initially. This was not handled at all previously, so a PR in P0 going into error due to e.g. a non-deterministic bug would be retried and still p=0, but a current staging would not get cancelled. Same when a PR in p=0 goes into error because something was failed, then is updated with a fix. - Add tracking to a bunch of relevant PR fields. Post-mortem analysis currently generally requires going through the text logs to see what happened, which is annoying. There is a nondeterminism / inconsistency in the tracking which sometimes leads the admin user to trigger tracking before the bot does, leading to the staging tracking being attributed to them during tests, shove under the carpet by ignoring the user to whom that tracking is attributed. When multiple users update tracked fields in the same transaction all the changes are attributed to the first one having triggered tracking (?), I couldn't find why the admin sometimes takes over. - added and leveraged support for enum-backed selection fields - moved variuous fields from forwardport to runbot_merge - fix a migration which had never worked and which never run (because I forgot to bump the version on the module) - remove some unnecessary intermediate de/serialisation fixes #673, fixes #309, fixes #792, fixes #846 (probably)	2024-05-23 07:58:46 +02:00
Xavier Morel	955a61a1e8	[CHG] runbot_merge, forwardbot: merge commands parser - move all commands parsing to runbot_merge as part of the long-term unification effort (#789) - set up an actual parser-ish structure to parse the commands to something approaching a sum type (fixes #507) - this is mostly prep for reworking the commands set (#673), although strict command parsing has been implemented (cf update to `test_unknown_commands`)	2024-05-16 10:37:50 +02:00
Xavier Morel	f4889ec8cf	[ADD] runbot_merge: ad-hoc ACL tracking to res.partner Sadly m2ms don't support tracking, so add a bunch of ad-hoc tracking to the override rights in order to know who, what, when at least. Do the same for the review rights although maybe tracking works for those.	2024-05-16 09:32:03 +02:00
Xavier Morel	a8e4d6dfee	[IMP] runbot_merge: don't select content when locking rows It might not be a huge amount of extra work since we're never actually retrieving the rows, but it still seems completely unnecessary. Sadly we can't do something cleaner like an aggregation, because aggregating requires moving the locking query to a subquery, and experimentally that seems slower than just ignoring / discarding the result set.	2024-05-16 09:32:03 +02:00
Xavier-Do	2a18cd7f3d	[FIX] *: remove invalid escape sequences	2024-04-30 16:05:21 +02:00
Xavier Morel	9f22305903	[IMP] runbot_merge: view warnings around ACLs Eventually we might want to add a proper "sensitive" flag on overrides and compute the flag based on that. For now just check for `ci/security`.	2024-03-19 12:54:20 +01:00
Xavier Morel	327500bc83	[FIX] runbot_merge: don't notify on closing unknown PRs If an untracked PR is closed, especially on an inactive or untracked branch, the closer (or author) almost certainly don't care to receive 3 different notifications on the subject. The fix requires a schema change in order to track that we're fetching the PR due to a `closed` event, as in other cases we may still want to notify the user that we received the request (and it just happened to resolve to a closed PR). Fixes #857	2024-03-12 12:17:30 +01:00
Xavier Morel	721b769039	[IMP] runbot_merge: handling of signatures - correctly handle projects without a secret set, we don't want the requests to blow up by trying to `strip()` a `False` or `None`, that is dumb, who would do that? - provide better reporting on signature mismatch: which repo we tried to access, and the full list of headers - log when there was no signature matching, either because there was no signature in the request and no secret on the project, or because the request is signed but no secret is configured on the repo	2024-02-26 10:11:53 +01:00
Xavier Morel	bcf6074153	[FIX] runbot_merge: maintenance gc command `gc --prune` can not take a separate parameter, it has to be part of the same arg (the `=` is not optional), otherwise the `gc` call blows up. So use the positional form of the git command to generate the correct invocation, Python-level `foo=bar` generates a split-style option in two args which does not please git.	2024-02-26 09:58:22 +01:00
Xavier Morel	de32b54090	[FIX] runbot_merge: error in maintenance, and tracking Before this, we would check if a repository had a name and run maintenance on it, leading to repeated (but unnoticed until now because I didn't monitor it) tracebacks as the maintenance cron would fail to find the local repo then run maintenance on nowhere anyway. Also augment the repo-finding process to try and get better information about what it's doing when it fails, rather than failing completely silently.	2024-02-23 13:58:31 +01:00
Xavier Morel	5d615bd733	[IMP] runbot_merge: logging around webhook body & signature The signature validation code seems correct, but there are validation failure in production, increase logging around webhook requests to try and diagnose things better: - dump the entire body to the github_requests logfile - add the received & computed signatures to the log error	2024-02-12 10:19:53 +01:00
Xavier Morel	65c303a750	[FIX] runbot_merge: bot info fetch `r0` is still used afterwards as the response object, so don't overwrite it when parsing the JSON body.	2024-02-12 10:18:59 +01:00
Xavier Morel	3a4fa494f8	[FIX] runbot_merge: incorrectly named endpoint Method has the same name as its preceding sibling, so it overwrites it and one of the endpoints is not accessible.	2024-02-12 10:18:25 +01:00
Xavier Morel	95393afde8	[FIX] runbot_merge: extraction of authorship info during rebase Turns out I've always been mistaken about the handling of quotes inside shell parameters, apparently they are always consumed by the shell unless nested so --foo="bar" reaches the underlying program as --foo=bar This means when using subprocess (without shell=True), adding the quotes leads to mishandling of the parameters (as the subprocess now has quotes it's not equipped to deal with). This exact error is made in the `--pretty` parameter of git show, locally this results in the author name and the committer email being terminated by double quotes although somehow other layers seem to exclude those from the end result (I assume `commit-tree` strips the quotes from the envvars under the assumption that users can mistakenly quote them or something? Anyway while it does not seem harmful (so far), better safe than sorry.	2024-01-22 15:36:37 +01:00
Xavier Morel	64d80c276b	[FIX] *: tests not working with github actual Add intermediate forks to a pair of tests, because github now (?) requires being able to write on a branch to create a PR from it, so the non-collaborator reviewers were not able to create a PR from a branch created by user.	2024-01-16 15:07:25 +01:00
Xavier Morel	4b9fb513eb	[IMP] *: make to_pr more resilient to webhook delays Github delivery delays keep getting worse. Depending on what comes before `to_pr`, this leads it to fail more often as it runs before the PR it's looking for was signaled to the mergebot. In order to mitigate this issue, add a wait loop in `to_pr`, waiting up to 4 seconds for the PR it's looking for before aborting. Also replace manual lookups by `to_pr` in every method of `TestPRUpdate` while at it since it hit a few of the issues. And remove the xfail test case since it seems unlikely github will change tack (maybe? could be worth testing to be sure).	2024-01-16 15:03:45 +01:00
Xavier Morel	cea1b62ac2	[FIX] runbot_merge: commit messages should be trimmed indeed Reverts commit `85a7890023` which untrimmed the commits: while it's probably true that git and github's APIs differ in their treatment of whitespace (in that git pretty much always terminates the commit message with a newline while github does not, as far as I understand, though I didn't really validate it) the issue was that github also trims on output when fetching over the API, something the fake did not do. So rather than update the test data I should have fixed the fake, but I failed to realise that at the time. I only realised when I decided to re-run against github actual (something I rarely do anymore as it's painfully slow) and it went on to choke on every message I'd updated.	2024-01-16 10:51:37 +01:00
Xavier Morel	45f0c8cc81	[FIX] runbot_merge: rebase logging The logging line was copied over from the github-api version, but it was not correctly fixed up to match, leading to a lot of spam on stderr when debug is enabled (aka spams journalctl on the production server). Splat the logging call out of `rebase` and into the various callers, so they have access to the pr object to log it.	2024-01-16 09:53:57 +01:00
Xavier Morel	1cb31cf2c2	[FIX] runbot_merge: 1.9 version & migration Forgot to bump the version when creating the migration. Also convert the migration to a single sql query, although the migration will never run because I ran the query manually to fix things up after finding out the data was "dirty" since the new code (assuming only modern statuses) was merged without running the migration. Thankfully it looks like the impact was not too severe (because the legacy statuses should only be present on very old commits / PRs), I don't remember when I deployed the update but apparently just a pair of PRs got affected, because their `previous_failure` was the old style and thus broke the "new failure" check.	2024-01-16 09:44:13 +01:00
Xavier Morel	994cea467c	[FIX] runbot_merge: typo in freeze wizard Forgot to deref the id of the staging we're trying to lock, so the specific case where we start a freeze with a bump PR and an outstanding staging in master would instantly blow up.	2024-01-16 07:54:43 +01:00
Xavier Morel	b21fbaf9cc	[IMP] runbot_merge: prevent merging empty commits The low-level APIs used by the staging process don't do any merge check, so because of the way Git works it's possible for them to merge commits with content as empty commits, e.g. if something was merged then backported and the backport was merged on top. This should trigger a merge failure as we don't really want to merge newly empty. This is a feature which some high level commands of git support, kind-of, e.g. by default `git rebase --interactive` will ask about newly empty commits. Take care to allow merging already-empty commits, as these do have a use for signaling, freezes, .... Fixes #809	2023-11-30 12:45:39 +01:00
Xavier Morel	2cd3fb8999	[IMP] runbot_merge: make uniquifier commit optional Prepares the possibility of either more direct communication with the CI platform(s) or just assuming CI has gotten reliable enough and colleagues intelligent enough that this is not an issue anymore because they've stopped pushing empty branches (which we know is not the case). Fixes #806	2023-11-30 12:45:39 +01:00
Xavier Morel	a15086a8a9	[FIX] runbot_merge: "not something we can merge" freeze error During the 17.0 freezeathon, the freeze wizard blew up with MergeError: merge-tree: {oid} - not something we can merge Turns out when freezes were moved to local (`4d2c0f86e1`) I forgot to fetch the heads of the release and bump PRs into the local repo, so rebasing them atop their branch would fail because the local repository would just not find the object being rebased. I had missed that case in testing as well, but in fairness even if I had tried testing it I'd likely have missed it: implementation limitations (shortcuts) of dummy central mean it currently ignores what objects the client requests and bundles everything it can find associated with the repository (meaning it sends the entire network). This is not usually an issue because the test repos are pretty small, but it means the client can have objects they should not because they never requested them and might not even be supposed to be aware of their existence. Anyway solve by doing the obvious: fetch the heads of the release and bump PRs at the same time we update the branch being forked off. Also update the freeze tests to trigger the issue (by creating the release / bump PRs in different repos) and running the tests against github actual to make sure we can actually see them fail (correctly, the merge error we expect) not via errors in the test), and we do fix them. Fixes #821	2023-11-30 12:45:39 +01:00
Xavier Morel	f44b0c018e	[IMP] forwardport: allow updating the fw limit after merging the source Currently, once a source PR has been merged it's not possible to set or update a limit, which can be inconvenient (e.g. people might have forgotten to set it, or realise after the fact that setting one is not useful, or might realise after the fact that they should unset it). This PR relaxes that constraint (which is not just a relaxation as it requires a bunch of additional work and validation), it should now be possible to: - update the limit to any target, as long as that target is at or above the current forwardport tip's - with the exception of a tip being forward ported, as that can't be cancelled - resume a forward port stopped by a previously set limit (just increase it to whatever) - set a limit from any forward-port PR - set a limit from a source, post-merge The process should also provide pretty chatty feedback: - feedback on the source and closest root - feedback about adjustments if any (e.g. setting the limit to B but there's already a forward port to C, the 'bot should set the limit to C and tell you that it happened and why) Fixes #506	2023-10-06 13:19:01 +02:00
Xavier Morel	76f4ed3bf6	[ADD] runbot_merge: delete scratch branches when a branch is disabled If a branch `foo` is disabled, then `tmp.foo` and `staging.foo` become unnecessary (with #247 fixed the tmp refs are not used for creating stagings anymore, but for now they're still used for the "safety dance" of merging a successful staging into the corresponding mainline). Fixes #605	2023-08-31 09:07:01 +02:00
Xavier Morel	65ed7c51bc	[IMP] *: note to merge using mergebot in conflict message The message has a lot of info, but left the merging bit unwritten. Correct this issue. Fixes #765	2023-08-30 12:10:46 +02:00
Xavier Morel	69f5cac2d7	[FIX] runbot_merge: support non-ascii secrets & sha256 signatures Per the Github webhook documentation: 1. sha1 signatures are deprecated, github recommends sha256 (though that's unlikely to be a concern anyway), and dummy-central supports both so it should be no issue. > If possible, we recommend that you use the x-hub-signature-256 > header for improved security. 2. Non-ascii secrets are supported and should be utf8-encoded to compute signatures... that's not actually documented as github docs only mention payload encoding but it seems to make sense anyway. Also improve the warning message by replacing the signature (which is useless) by the delivery id (which could allow introspecing the hook or something).	2023-08-30 11:43:13 +02:00
Xavier Morel	302fd42cae	[ADD] forwardport: message on parent of detached PR Currently a user is not notified that the parent of a detached PR needs to be independently approved and may miss that information. Add a notification to that PR as well. Fixes #788	2023-08-29 15:59:05 +02:00
Xavier Morel	73e4ac6066	[REM] runbot_merge: check_visibility Its sole use was removed with the switch to local staging, but I missed removing it. Closes #625 as there is no need to update it to v2 smart protocol.	2023-08-29 13:26:12 +02:00
Xavier Morel	b0b609ebe7	[CHG] runbot_merge: perform stagings in a local clone of the repo The github API has gotten a lot more constraining (with rate restrictions being newly enforced or added somewhat out of nowhere), and as importantly a lot less reliable. So move the staging process off of github and locally, similar to the forward porting process (whose repo cache is being reused for this). Fixes #247	2023-08-25 15:33:25 +02:00
Xavier Morel	f0344fd34a	[ADD] runbot_merge: link back from commit to PR	2023-08-25 15:31:06 +02:00
Xavier Morel	4d2c0f86e1	[CHG] runbot_merge: convert freeze wizard to local repo Probably less necessary than for the regular staging stuff, but might as well while at it. Requires updating one of the test to generate a non-ff push, as O_CREAT doesn't exist at the git level, and the client (and it is client-side) only protects against force pushes. So there is no way to trigger an issue with just the creation of the new branch, it needs to exist and point to a non-ancestor commit. Also remove a sleep in the ref update loop as there are no ref updates anymore, until the very final sync via git. NB: maybe it'd be possible to push both bump and release PRs together for each repo, but getting which update failed in case of failure seems difficult.	2023-08-25 15:06:04 +02:00
Xavier Morel	85a7890023	[CHG] runbot_merge: switch staging from github API to local It has been a consideration for a while, but the pain of subtly interacting with git via the ignominous CLI kept it back. Then ~~the fire nation attacked~~ github got more and more tight-fisted (and in some ways less reliable) with their API. Staging pretty much just interacts with the git database, so it's both a facultative github operator (it can just interact with git directly) and a big consumer of API requests (because the git database endpoints are very low level so it takes quite a bit of work to do anything especially when high-level operations like rebase have to be replicated by hand). Furthermore, an issue has also been noticed which can be attributed to using the github API (and that API's reliability getting worse): in some cases github will fail to propagate a ref update / reset, so when staging 2 PRs it's possible that the second one is merged on top of the temporary branch of the first one, yielding a kinda broken commit (in that it's a merge commit with a broken error message) instead of the rebase / squash commit we expected. As it turns out it's a very old issue but only happened very early so was misattributed and not (sufficiently) guarded against: - 41bd82244bb976bbd4d4be5e7bd792417c7dae6b (October 8th 2018) was spotted but thought to be a mergebot issue (might have been one of the opportunities where ref-checks were added though I can't find any reference to the commit in the runbot repo). - 2be25052e147b151d1d8a5bc73cceb351586ce03 (October 15th, 2019) was missed (or ignored). - 5a9fe7a7d05a9df7186072a7bffd60c6b428fd0e (July 31st, 2023) was spotted, but happened at a moment where everything kinda broke because of github rate-limiting ref updates, so the forensics were difficult and it was attributed to rate limiting issues. - f10d03bf0f2e8f88f62a5d8356b84f714196130f (August 24th, 2023) broke the camel's back (and the head block): the logs were not too interspersed with other garbage and pretty clear that github ack'd a ref update, returned the correct oid when checking the ref, then returned the wrong oid when fetching it later on. No Working Copy =============== The working copy turns out to not be necessary, the plumbing commands we need work just fine on a bare repository. Working without a WC means we had to reimplement the high level operations (rebase) by hand much as we'd done previously, but we needed to do that anyway as git doesn't seem to provide any way to retrieve the mapping when rebasing/cherrypicking, and cherrypicking by commit doesn't work well as it can't really find the merge base it needs. Forward-porting can almost certainly be implemented similarly (with some overhead), issue #803 has been opened to keep track of the idea. No TMP ====== The `tmp.` branches are no more, the process of creating stagings is based entirely around oids, if staging something fails we can just abandon the oids (they'll be collected by the weekly GC), we only need to update the staging branches at the very end of the process. This simplifies things a fair bit. For now we have stopped checking for visibility / backoff as we're pushing via git, hopefully it is a more reliable reference than the API. Commmit Message Formatting ========================== There's some unfortunate churn in the test, as the handling of trailing newlines differs between github's APIs and git itself. Fixes #247 PS: It might be a good idea to use pygit2 instead of the CLI eventually, the library is typed which is nice, and it avoids shelling out although that's really unlikely to be a major cost.	2023-08-25 15:06:04 +02:00
Xavier Morel	2fbbe3fcdb	[ADD] runbot_merge: github identity for the mergebot Necessary to create commits as the mergebot without going through the github API. Copy of the improved version from forwardport. Not an override, to avoid unnecessarily triggering one or the other which is confusing and weird.	2023-08-25 15:04:48 +02:00
Xavier Morel	86a1b5523e	[MOV] runbot_merge: all the staging creation code to a separate module Move almost all the staging code to free functions, in a separate module, and extensively typed. The only bits which didn't move are: - the entry point (the cron hook), because it has to be a model method in order to be called - the `_build_merge_message` method, because it needs to be overridable There's also a bit of an import mess, because the cron & `_build_merge_message` need to call into the new module, but the new module wants the types they belong to, so it's a bit circular.	2023-08-25 15:04:48 +02:00
Xavier Morel	9de18de454	[CHG] *: move repo cache from forwardbot to mergebot If the stagings are going to be created locally (via a git working copy rather than the github API), the mergebot part needs to have access to the cache, so move the cache over. Also move the maintenance cron. In an extermely minor way, this prefigures the (hopeful) eventual merging of the ~~planes~~ modules.	2023-08-25 15:04:48 +02:00
Xavier Morel	7bca6f0bd7	[ADD] runbot_merge: allow resolving commits by sha `_rec_name = 'sha'` means name_search and cross-model searches will work much better. Relates to #802	2023-08-25 11:01:46 +02:00
Xavier Morel	0826b3484b	[ADD] runbot_merge: view improvements - add formatting for a bunch of backend objects - add cross-links in order to use toplevel navigation between objects e.g. project -> branch -> staging -> PR with breadcrumbs instead of shitty dialog boxes Relates to #802	2023-08-25 11:01:38 +02:00
Xavier Morel	9b5bb338b4	[REM] runbot_merge: status compatibility functions When I updated the status storage (including `previous_failure`) for some reason I didn't just migrate from the old to the new format, and added bridge functions instead. This is not really necessary (or useful), so convert all the legacy data and remove the conversion helpers. Relates to #802	2023-08-24 10:47:16 +02:00
Xavier Morel	90961b99c9	[ADD] : changelog entries I forgot Can't hurt to have* them.	2023-08-14 09:28:19 +02:00
Xavier Morel	7348e4d7a4	[IMP] runbot_merge: ensure at least 1s between mutating GH calls Mostly a temporary safety feature after the events of 07-31: it's still not clear whether that was a one-off issue or a change in policy (I was not able to reproduce locally even doing several set_refs a second) and the gh support is not super talkative, but it probably doesn't hurt to commit the workaround until #247 gets implemented. On 2023-07-31, around 08:30 UTC, `set_ref` started failing, a lot (although oddly enough not continuously), with the unhelpful message that > 422: Reference cannot be updated This basically broke all stagings, until a workaround was implemented by adding a 1s sleep before `set_ref` to ensure no more than 1 `set_ref` per second, which kinda sorta has been the github recommendation forever but had never been an issue before. Contributing to this suspicion is that in late 2022, the documentation of error 422 on `PATCH git/refs/{ref}` was updated to: > Validation failed, or the endpoint has been spammed. Still would be nice if GH was clear about it and sent a 429 instead. Technically the recommendation is: > If you're making a large number of POST, PATCH, PUT, or DELETE > requests for a single user or client ID, wait at least one second > between each request. So... actually implement that. On a per-worker basis as for the most part these are serial processes (e.g. crons), we can still get above the rate limit due to concurrent crons but it should be less likely. Also take `Retry-After` in account, can't hurt, though we're supposed to retry just the request rather than abort the entire thing. Maybe a future update can improve this handling. Would also be nice to take `X-RateLimit` in account, although that's supposed to apply to all requests so we'd need a second separate timestamp to track it. Technically that's probably also the case for `Retry-After`. And fixing #247 should cut down drastically on the API calls traffic as staging is a very API-intensive process, especially with the sanity checks we had to add, these days we might be at 4 calls per commit per PR, and up to 80 PRs/staging (5 repositories and 16 batches per staging), with 13 live branches (though realistically only 6-7 have significant traffic, and only 1~2 get close to filling their staging slots).	2023-08-11 12:32:21 +02:00
Xavier Morel	85a74a9e32	[ADD] runbot_merge: staging query endpoints `/runbot_merge/stagings` ======================== This endpoint is a reverse lookup from any number of commits to a (number of) staging(s): - it takes a list of commit hashes as either the `commits` or the `heads` keyword parameter - it then returns the stagings which have all these commits as respectively commits or heads, if providing all commits for a project the result should always be unique (if any) - `commits` are the merged commits, aka the stuff which ends up in the actual branches - `heads` are the staging heads, aka the commits at the tip of the `staging.$name` branches, those may be the same as the corresponding commit, or might be deduplicator commits which get discarded on success `/runbot_merge/stagings/:id` ============================ Returns a list of all PRs in the staging, grouped by batch (aka PRs which have the same label and must be merged together). For each PR, the `repository` name, `number`, and `name` in the form `$repository#$number` get returned. `/runbot_merge/stagings/:id1/:id2` ================================== Returns a list of all the successfully merged stagings between `id1` and `id2`, from oldest to most recent. Individual records have the form: - `staging` is the id of the staging - `prs` is the contents of the previous endpoint (a list of PRs grouped by batch) `id1` must be lower than `id2`. By default, this endpoint is inclusive on both ends, the `include_from` and / or `include_to` parameters can be passed with the `False` value to exclude the corresponding bound from the result. Related to #768	2023-08-11 11:13:34 +02:00
Xavier Morel	4eefc980bb	[IMP] runbot_merge: logger messages	2023-08-10 16:14:33 +02:00

1 2 3 4 5 ...

521 Commits