runbot

mirror of https://github.com/odoo/runbot.git synced 2025-03-17 16:35:47 +07:00

Author	SHA1	Message	Date
Xavier Morel	aa1df22657	[MERGE] bot from 16.0 to 17.0 Broken (can't run odoo at all): - In Odoo 17.0, the `pre_init_hook` takes an env, not a cursor, update `_check_citext`. - Odoo 17.0 rejects `@attrs` and doesn't say where they are or how to update them, fun, hunt down `attrs={'invisible': ...` and try to fix them. - Odoo 17.0 warns on non-multi creates, update them, most were very reasonable, one very wasn't. Test failures: - Odoo 17.0 deprecates `name_get` and doesn't use it as a source anymore, replace overrides by overrides to `_compute_display_name`. - Multiple tracking changes: - `_track_set_author` takes a `Partner` not an id. - `_message_compute_author` still requires overriding in order to handle record creation, which in standard doesn't support author overriding. - `mail.tracking.value.field_type` has been removed, the field type now needs to be retrieved from the `field_id`. - Some tracking ordering have changed and require adjusting a few tests. Also added a few flushes before SQL queries which are not (obviously at least) at the start of a cron or controller, no test failure observed but better safe than sorry (probably).	2024-08-12 13:13:03 +02:00
Xavier Morel	b1d3278de1	[CHG] forwardport: perform forward porting without working copies The goal is to reduce maintenance and odd disk interactions & concurrency issues, by not creating concurrent clones, not having to push forks back in the repository, etc... it also removes the need to cleanup "scratch" working copies though that looks not to have been an issue in a while. The work is done on isolated objects without using or mutating refs, so even concurrent work should not be a problem. This turns out to not be any more verbose (less so if anything) than using `cherry-pick`, as that is not really designed for scripted / non-interactive use, or for squashing commits thereafter. Working directly with trees and commits is quite a bit cleaner even without a ton of helpers. Much of the credit goes to Julia Evans for [their investigation of 3-way merges as the underpinnings of cherry-picking][3-way merge], this would have been a lot more difficult if I'd had to rediscover the merge-base trick independently. A few things have been changed by this: - The old trace/stderr from cherrypick has disappeared as it's generated by cherrypick, but for a non-interactive use it's kinda useless anyway so I probably should have looked into removing it earlier (I think the main use was investigation of the inflateinit issue). - Error on emptied commits has to be hand-rolled as `merge-tree` couldn't care less, this is not hard but is a bit annoying. - `merge-tree`'s conflict information only references raw commits, which makes sense, but requires updating a bunch of tests. Then again so does the fact that it usually doesn't send anything to stderr, so that's usually disappearing. Conveniently `merge-tree` merges the conflict marker directly in the files / tree so we don't have to mess about moving them back out of the repository and into the working copy as I assume cherry-pick does, which means we don't have to try and commit them back in ether. That is a huge part of the gain over faffing about with the working copy. Fixes #847 [3-way merge]: https://jvns.ca/blog/2023/11/10/how-cherry-pick-and-revert-work/	2024-07-08 14:37:14 +02:00
Xavier Morel	3191c44459	[ADD] runbot_merge: synthetic batches & stagings to freeze wizard Merged PRs should have a batch which should have a staging, this makes the treatment uniform across the board and avoids funky data which is hard to place or issues when reconstructing history. Also create synthetic batches & stagings for older freezes (and bumps)	2024-05-29 07:55:07 +02:00
Xavier Morel	ef6a002ea7	[CHG] runbot_merge: move staging readiness to batch Staging readiness is a batch-level concerns, and many of the markers are already there though a few need to be aggregated from the PRs. As such, staging has no reason to be performed in terms of PRs anymore, it should be performed via batches directly. There is a bit of a mess in order not to completely fuck up when retargeting PRs (implicitly via freeze wizard, or explicitely) as for now we're moving PRs between batches in order to keep the batches mostly target-bound. Some of the side-effects in managing the coherence of the targeting and moving PRs between batches is... not great. This might need to be revisited and cleaned up with those scenarios better considered.	2024-05-23 07:58:58 +02:00
Xavier Morel	21b5dd439b	[CHG] runbot_merge: move merge_date to batch, remove active - `merge_date` should be common to an entire batch, so move it there - remove `Batch.active` which should probably have been removed when batches were made persistent (can eventually re-add as a proxy for `merge_date` being set maybe, but for now removing it seems a better way to catch mistakes) - update various sites to use `Batch.merge_date` instead of `Batch.active`	2024-05-23 07:58:58 +02:00
Xavier Morel	d4fa1fd353	[CHG] : rewrite commands set, rework status management This commit revisits the commands set in order to make it more regular, and limit inconsistent command-sets, although it includes pseudo-command aliases for common tasks now removed from the core set. Hard Errors =========== The previous iteration of the commands set would ignore any non-command term in a command line. This has been changed to hard error (and ignoring the entire thing) if any command is unknown or invalid. This fixes inconsistent / unexpected interpretations where a user sends a command, then writes a novel on the same line some words of which happen to also* be commands, leading to merge states they did not expect. They should now be told to fuck off. Priority Restructuring ---------------------- The numerical priority system was pretty messy in that it confused "staging priority" (in ways which were not entirely straightforward) with overrides to other concerns. This has now being split along all the axis, with separate command subsets for: - staging prioritisation, now separated between `default`, `priority`, and `alone`, - `default` means PRs are picked by an unspecified order when creating a staging, if nothing better is available - `priority` means PRs are picked first when staging, however if `priority` PRs don't fill the staging the rest will be filled with `default`, this mode did not previously exist - `alone` means the PRs are picked first, before splits, and only `alone` PRs can be part of the staging (which usually matches the modename) - `skipchecks` overrides both statuses and approval checks, for the batch, something previously implied in `p=0`, but now independent. Setting `skipchecks` basically makes the entire batch `ready`. For consistency this also sets the reviewer implicitly: since skipchecks overrides both statuses and approval, whoever enables this mode is essentially the reviewer. - `cancel` cancels any ongoing staging when the marked PR becomes ready again, previously this was also implied (in a more restricted form) by setting `p=0` FWBot removal ============= While the "forwardport bot" still exists as an API level (to segregate access rights between tokens) it has been removed as an interaction point, as part of the modules merge plan. As a result, fwbot stops responding ---------------------- Feedback messages are now always sent by the mergebot, the forward-porting bot should not send any message or notification anymore. commands moved to the merge bot ------------------------------- - `ignore`/`up to` simply changes bot - `close` as well - `skipci` is now a choice / flag of an `fw` command, which denotes the forward-port policy, - `fw=default` is the old `ci` and resets the policy to default, that is wait for the PR to be merged to create forward ports, and for the required statuses on each forward port to be received before creating the next - `fw=skipci` is the old `skipci`, it waits for the merge of the base PR but then creates all the forward ports immediately (unless it gets a conflict) - `fw=skipmerge` immediately creates all the forward ports, without even waiting for the PR to be merged This is a completely new mode, and may be rather broken as until now the 'bot has always assumed the source PR had been merged. approval rework --------------- Because of the previous section, there is no distinguishing feature between `mergebot r+` = "merge this PR" and `forwardbot r+` = "merge this PR and all its parent with different access rights". As a result, the two have been merged under a single `mergebot r+` with heuristics attempting to provide the best experience: - if approving a non-forward port, the behavior does not change - else, with review rights on the source, all ancestors are approved - else, as author of the original, approves all ancestors which descend from a merged PR - else, approves all ancestors up to and including the oldest ancestor to which we have review rights Most notably, the source's author is not delegated on the source or any of its descendants anymore. This might need to be revisited if it provides too restrictive. For the very specialized need of approving a forward-port and none of its ancestors, `review=` can now take a comma (`,`) separated list of pull request numbers (github numbers, not mergebot ids). Computed State ============== The `state` field of pull requests is now computed. Hopefully this makes the status more consistent and predictable in the long run, and importantly makes status management more reliable (because reference datum get updated naturally flowing to the state). For now however it makes things more complicated as some of the states have to be separately signaled or updated: - `closed` and `error` are now separate flags - `merge_date` is pulled down from forwardport and becomes the transition signal for ready -> merged - `reviewed_by` becomes the transition signal for approval (might be a good idea to rename it...) - `status` is computed from the head's statuses and overrides, and that becomes the validation state Ideally, batch-level flags like `skipchecks` should be on, well, the batch, and `state` should have a dependency on the batch. However currently the batch is not a durable / permanent member of the system, so it's a PR-level flag and a messy pile. On notable change is that forcing the state to `ready` now does that but also sets the reviewer, `skipchecks`, and overrides to ensure the API-mediated readying does not get rolled back by e.g. the runbot sending a status. This is useful for a few types of automated / programmatic PRs e.g. translation exports, where we set the state programmatically to limit noise. recursive dependency hack ------------------------- Given a sequence of PRs with an override of the source, if one of the PRs is updated its descendants should not have the override anymore. However if the updated PR gets overridden, its descendants should have that override. This requires some unholy manipulations via an override of `modified`, as the ORM supports recursive fields but not recursive dependencies (on a different field). unconditional followup scheduling --------------------------------- Previously scheduling forward-port followup was contigent on the FW policy, but it's not actually correct if the new PR is immediately validated (which can happen now that the field is computed, if there are no required statuses or all of the required statuses are overridden by an ancestor) as nothing will trigger the state change and thus scheduling of the fp followup. The followup function checks all the properties of the batch to port, so this should not result on incorrect ports. Although it's a bit more expensive, and will lead to more spam. Previously this would not happen because on creation of a PR the validation task (commit -> PR) would still have to execute. Misc Changes ============ - If a PR is marked as overriding / canceling stagings, it now does so on retry not just when setting initially. This was not handled at all previously, so a PR in P0 going into error due to e.g. a non-deterministic bug would be retried and still p=0, but a current staging would not get cancelled. Same when a PR in p=0 goes into error because something was failed, then is updated with a fix. - Add tracking to a bunch of relevant PR fields. Post-mortem analysis currently generally requires going through the text logs to see what happened, which is annoying. There is a nondeterminism / inconsistency in the tracking which sometimes leads the admin user to trigger tracking before the bot does, leading to the staging tracking being attributed to them during tests, shove under the carpet by ignoring the user to whom that tracking is attributed. When multiple users update tracked fields in the same transaction all the changes are attributed to the first one having triggered tracking (?), I couldn't find why the admin sometimes takes over. - added and leveraged support for enum-backed selection fields - moved variuous fields from forwardport to runbot_merge - fix a migration which had never worked and which never run (because I forgot to bump the version on the module) - remove some unnecessary intermediate de/serialisation fixes #673, fixes #309, fixes #792, fixes #846 (probably)	2024-05-23 07:58:46 +02:00
Xavier Morel	a8e4d6dfee	[IMP] runbot_merge: don't select content when locking rows It might not be a huge amount of extra work since we're never actually retrieving the rows, but it still seems completely unnecessary. Sadly we can't do something cleaner like an aggregation, because aggregating requires moving the locking query to a subquery, and experimentally that seems slower than just ignoring / discarding the result set.	2024-05-16 09:32:03 +02:00
Xavier Morel	45f0c8cc81	[FIX] runbot_merge: rebase logging The logging line was copied over from the github-api version, but it was not correctly fixed up to match, leading to a lot of spam on stderr when debug is enabled (aka spams journalctl on the production server). Splat the logging call out of `rebase` and into the various callers, so they have access to the pr object to log it.	2024-01-16 09:53:57 +01:00
Xavier Morel	994cea467c	[FIX] runbot_merge: typo in freeze wizard Forgot to deref the id of the staging we're trying to lock, so the specific case where we start a freeze with a bump PR and an outstanding staging in master would instantly blow up.	2024-01-16 07:54:43 +01:00
Xavier Morel	a15086a8a9	[FIX] runbot_merge: "not something we can merge" freeze error During the 17.0 freezeathon, the freeze wizard blew up with MergeError: merge-tree: {oid} - not something we can merge Turns out when freezes were moved to local (`4d2c0f86e1`) I forgot to fetch the heads of the release and bump PRs into the local repo, so rebasing them atop their branch would fail because the local repository would just not find the object being rebased. I had missed that case in testing as well, but in fairness even if I had tried testing it I'd likely have missed it: implementation limitations (shortcuts) of dummy central mean it currently ignores what objects the client requests and bundles everything it can find associated with the repository (meaning it sends the entire network). This is not usually an issue because the test repos are pretty small, but it means the client can have objects they should not because they never requested them and might not even be supposed to be aware of their existence. Anyway solve by doing the obvious: fetch the heads of the release and bump PRs at the same time we update the branch being forked off. Also update the freeze tests to trigger the issue (by creating the release / bump PRs in different repos) and running the tests against github actual to make sure we can actually see them fail (correctly, the merge error we expect) not via errors in the test), and we do fix them. Fixes #821	2023-11-30 12:45:39 +01:00
Xavier Morel	4d2c0f86e1	[CHG] runbot_merge: convert freeze wizard to local repo Probably less necessary than for the regular staging stuff, but might as well while at it. Requires updating one of the test to generate a non-ff push, as O_CREAT doesn't exist at the git level, and the client (and it is client-side) only protects against force pushes. So there is no way to trigger an issue with just the creation of the new branch, it needs to exist and point to a non-ancestor commit. Also remove a sleep in the ref update loop as there are no ref updates anymore, until the very final sync via git. NB: maybe it'd be possible to push both bump and release PRs together for each repo, but getting which update failed in case of failure seems difficult.	2023-08-25 15:06:04 +02:00
Xavier Morel	9260384284	[FIX] runbot_merge: concurrency error in freeze wizard (hopefully) During the 16.3 freeze an issue was noticed with the concurrency safety of the freeze wizard (because it blew up, which caused a few issues): it is possible for the cancelling of an active staging to the master branch to fail, which causes the mergebot side of the freeze to fail, but the github state is completed, which puts the entire thing in a less than ideal state. Especially with the additional issue that the branch inserter has its own concurrency issue (which maybe I should fix): if there are branches being forward-ported across the new branch, it's unable to see them, and thus can not create the now-missing PRs. Try to make the freeze wizard more resilient: 1. Take a lock on the master staging (if any) early on, this means if we can acquire it we should be able to cancel it, and it won't suffer a concurrency error. 2. Add the `process_updated_commits` cron to the set of locked crons, trying to read the log timeline it looks like the issue was commits being impacted on that staging while the action had started: REPEATABLE READ meant the freeze's transaction was unable to see the update from the commit statuses, therefore creating a diverging update when it cancelled the staging, which postgres then reported as a serialization error. I'd like to relax the locking of the cron (to just FOR SHARE), but I think it would work, per postgres: > SELECT FOR UPDATE, and SELECT FOR SHARE commands behave the same as > SELECT in terms of searching for target rows: they will only find > target rows that were committed as of the transaction start > time. However, such a target row might have already been updated (or > deleted or locked) by another concurrent transaction by the time it > is found. In this case, the repeatable read transaction will wait > for the first updating transaction to commit or roll back (if it is > still in progress). If the first updater rolls back, then its > effects are negated and the repeatable read transaction can proceed > with updating the originally found row. But if the first updater > commits (and actually updated or deleted the row, not just locked > it) then the repeatable read transaction will be rolled back with > the message This means it would be possible to lock the cron, and then get a transaction error because the cron modified one of the records we're going to hit while it was running: as far as the above is concerned the cron's worker had "just locked" the row so it's fine to continue. However this makes it more and more likely an error will be hit when trying to freeze (to no issue, but still). We'll have to see how that ends up. Fixes #766 maybe	2023-06-21 14:26:19 +02:00
Xavier Morel	907c6072d1	[FIX] runbot_merge: cancel master staging on freeze If there are bump PRs anyway: the bump commits will cause the forward-port of the staging to fail, so might as well clearly notify everybody of the issue if there is a pending staging, and not waste too much time waiting for a staging which can not succeed. We could also cancel stagings when there's no bump PR, but it's not clear that there's any reason to do so: if we didn't touch any master branch, there's no reason for the staging to fail, or to otherwise cancel it. And obviously we can't have staged anything on the new branch so there's nothing to cancel. Part-Of: #718	2023-01-25 12:25:45 +01:00
Xavier Morel	8d7d6302d3	[FIX] runbot_merge: make freeze wizard labels lookup not shit I DECLARE BANKRUPTCY!!! The previous implementation of labels lookup was really not intuitive (it was just a char field, and matched labels by equality including the owner tag), and was also full of broken edge cases (e.g. traceback if a label matched multiple PRs in the same repo because people reuse branch names). Tried messing about with contextual `display_name` and `name_search` on PRs but the client goes wonky in that case, and there is no clean autocomplete for non-relational fields. So created a view which reifies labels, and that can be used as the basis for our search. It doesn't have to be maintained by hand, can be searched somewhat flexibly, we can add new view fields in the future if desirable, and it seems to work fine providing a nice understandable UX, with the reliability of using a normal Odoo model the normal way. Also fixed the handling of bump PRs, clearly clearing the entire field before trying to update existing records (even with a link_to inbetween) is not the web client's fancy, re-selecting the current label would just empty the thing entirely. So use a two-step process slightly closer to the release PRs instead: - first update or delete the existing bump PRs - then add the new ones The second part is because bump PRs are somewhat less critical than release, so it can be a bit more DWIM compared to the more deliberate process of release PRs where first the list of repositories involved has to be set up just so, then the PRs can be filled in each of them. Fixes #697	2023-01-25 12:25:45 +01:00
Xavier Morel	ae53c87fc9	[FIX] runbot_merge: allow adding and removing release PR lines In order to support partial freezing, we need the ability to remove some of the release lines for the repos we don't want to freeze (e.g. because they don't use per-version branches). This subsequently means we need the ability to create new lines if we fucked up and removed one we should not have. Alternatively the freeze meat-bot could cancel the entire thing and redo the wizard but that seems harsh and mean, so don't do that. Fixes `0f3647b7c7` which specifically mentioned partial freeze then proceeded to make them entirely impossible anyway. Part of #718	2023-01-25 12:25:45 +01:00
Xavier Morel	fb60c38731	[IMP] runbot_merge: add color key to freeze wizard Was difficult to understand what the colors meant on the required PRs. Part of #718	2023-01-25 12:25:45 +01:00
Xavier Morel	0f3647b7c7	[FIX] : freeze wizard take 3 Fixes to the new bits which didn't really work: - Fix borked view layout - Add some help to the label fields - Improve the resolution of label -> pr, and fix - Also make the feature actually work for bump PRs - Also make pr -> label work more reliably, now allows setting one PR and getting the other PRs of the same batch (with the same label) even without setting the label by hand An autocomplete for the label has been considered but there is no autocomplete field for char/selection fields, and it seems way too much work for the utility: - either create a brand new widget for 15.0 which will have to be entirely rewritten in 16 - or create a transient model composed entirely of fake records to provide an m2o to records which don't actually exist as label bearers, which is also a lot of unnecessary work NOTE: we want to support partial freezing (aka not freeze all the branches because some of them have different release models than others), so some project repos not* having a release PR is fine and normal, such a validation should not be added. Fixes #664	2022-12-08 10:46:22 +01:00
Xavier Morel	57162547e0	[FIX] runbot_merge: Odoo 15.0 + Py3.10 compat Turns out I was running "15.0" except just on the runbot, enterprise and community were still the 14.0 repos, so some of the changes were missing. While at it, bundle fixes for 3.10, as that's what Jammy needs, and the mergebot/15.0 will be running on that.	2022-11-17 10:30:04 +01:00
Xavier Morel	b86092de83	[IMP] : freeze wizard v3, freezer and wizarder Stop staging* release PRs: they are normally fairly simple and should not fail their staging outside of unreliable tests (or possibly a few edge cases e.g. forgot one version change thing), however staging them creates the possibility of a "version hole" on the release branch which is undesirable. Instead, immediately and unconditionally push the release commits onto the newly created branches, if there are things which don't work they can be fixed afterwards (and the process refined, maybe). Also add the same feature for bump PRs, with the difference that the bump PRs are not created / requested by default (they have to be opted in individually). For convenience, add a feature which automatically finds the PRs via inputting the label (not really tested yet). Closes #603	2022-08-05 15:35:51 +02:00
Xavier Morel	de70bd6f83	[IMP] runbot_merge: show PR titles in freeze wizard Currently limited to release/freeze PRs: it can be difficult to be sure the right PR was selected then, and a mistake there seems more impactful than in the PRs being waited for? Note: adds a test to make sure I don't break the check that all release PRs must have the same label (be linked). This was already safe, and in a way this PR adds convenience but not really safety, but better sure than sorry.	2022-02-08 12:28:10 +01:00
Xavier Morel	e2887a7473	[IMP] runbot_merge: allow only freezing a subset of a project - add flag to not select repos for freezing - allow removing more repositories from the wizard - when performing the freeze, only create branches for the selected repos	2022-02-07 15:15:13 +01:00
Xavier Morel	1add3d4854	[FIX] runbot_merge: freeze being triggered upon reopening the wizard The freeze wizard was implemented using a single action to open and validate the dialog. This was a mistake, as it means if there are no errors left (e.g. all the PRs being waited for are now validated) trying to view the freeze wizard will immediately validate it and commit the freeze, which is unexpected, surprising, and unsafe e.g. - open wizard - add freeze prs - add a required pr or two - close and go do something else - be told that more PRs need to be waited for - reopen wizard - oops freeze is done So split the "open action" part of `action_freeze` into opening the action and performing the freeze. The "freeze" / "view freeze" button on the project only activates the latter, and the actual freeze operation is only triggered from the wizard's "Freeze" button. Part of #559.	2022-02-07 13:23:41 +01:00
Xavier Morel	4da0f5df69	[ADD] runbot_merge: ~~tree~~ freeze wizard Provides a less manual interface for creating the freeze: * takes the name of the branch to create * takes any number of PRs which must be part of the freeze * takes PRs representing the HEADs of the new branches Then essentially takes care of the test. Implementation of the actual wizard is not trivial but fairly straightforward and linear, biggest issue is not being able to `project_id.branch_ids[1]` to get the new branch, not sure why but it seems to ignore the ordering, clearing the cache doens't fix it. When creating the branches, add a sleep after each one for secondary rate limiting purposes. Same when deleting branches. Also the forwardbot has been updated to disable the forwardport cron while a freeze is ongoing, this simplifies the freezing process. Note: after recommendation of @aab-odoo, tried using `_applyChanges` in `_checkState` but it simply did not work: the two relational fields got completely frozen and were impossible to update, which is less than ideal. Oh well, hopefully it works well enough like this for now.	2021-11-17 10:40:12 +01:00

23 Commits