diff options
author | Jonathan Gray <jsg@cvs.openbsd.org> | 2023-01-28 08:15:04 +0000 |
---|---|---|
committer | Jonathan Gray <jsg@cvs.openbsd.org> | 2023-01-28 08:15:04 +0000 |
commit | 1c5c7896c1d54abd25c0f33ca996165b359eecb3 (patch) | |
tree | bc5a10bbe89ecd4008a5bed7a029babc992f66b5 /lib/mesa/docs/relnotes | |
parent | c55dff45ea3a2c70e45601240b248017acc48198 (diff) |
Import Mesa 22.3.4
Diffstat (limited to 'lib/mesa/docs/relnotes')
-rw-r--r-- | lib/mesa/docs/relnotes/21.3.9.rst | 119 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.0.2.rst | 293 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.0.3.rst | 201 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.0.4.rst | 152 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.0.5.rst | 139 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.1.6.rst | 61 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.1.7.rst | 196 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.2.0.rst | 6219 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.2.1.rst | 227 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.2.2.rst | 145 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.3.0.rst | 5723 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.3.1.rst | 207 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.3.2.rst | 115 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.3.3.rst | 181 | ||||
-rw-r--r-- | lib/mesa/docs/relnotes/22.3.4.rst | 212 |
15 files changed, 14190 insertions, 0 deletions
diff --git a/lib/mesa/docs/relnotes/21.3.9.rst b/lib/mesa/docs/relnotes/21.3.9.rst new file mode 100644 index 000000000..d3aa2937a --- /dev/null +++ b/lib/mesa/docs/relnotes/21.3.9.rst @@ -0,0 +1,119 @@ +Mesa 21.3.9 Release Notes / 2022-06-08 +====================================== + +Mesa 21.3.9 is a bug fix release which fixes bugs found since the 21.3.8 release. + +Mesa 21.3.9 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 21.3.9 implements the Vulkan 1.2 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + 91fe6f20339884e37a7c47bfba42fb1cd397512c9ed5ad423de369f047cd8b5c mesa-21.3.9.tar.xz + + +New features +------------ + +- None + + +Bug fixes +--------- + +- [regression][bisected] MSVC: Build failure in libmesa_util when targeting x86 32-bit +- A crash in radeonsi driver + + +Changes +------- + +Alyssa Rosenzweig (2): + +- panfrost: Emulate GL_CLAMP on Bifrost +- pan/bi: Handle non-2D arrays + +Dave Airlie (2): + +- crocus: don't map scanout buffers as write-back +- intel/perf: use a function to do common allocations + +Emma Anholt (2): + +- vulkan: Make sure we've loaded our connectors when querying plane props. +- intel/perf: Move some static blocks of C code out of the python script. + +Eric Engestrom (3): + +- .pick_status.json: Update to 05d687723530ed3c5c9f7d0addb3b047138613a3 +- .pick_status.json: Update to fc381fa1e341bc9cf71f53a915dc47926a3f621f +- Revert "glx: Fix build errors with --enable-mangling (v2)" + +Erik Faye-Lund (1): + +- Revert "ci: downgrade sphinx to v3.x" + +Georg Lehmann (1): + +- radv: Don't hash ycbcr sampler base object. + +Ian Romanick (1): + +- intel/fs: Force destination types on DP4A instructions + +Icecream95 (2): + +- panfrost: Don't initialise the trampolines array +- panfrost: Optimise recalculation of max sampler view + +Jason Ekstrand (1): + +- anv: Allow MSAA resolve with different numbers of planes + +Jonathan Gray (1): + +- util: fix msvc build + +Lionel Landwerlin (4): + +- anv: fix variable shadowing +- anv: zero-out anv_batch_bo +- anv: emit timestamp & availability using the same part of CS +- anv: flush tile cache with query copy command + +Matt Turner (8): + +- intel/perf: Don't print leading space from desc_units() +- intel/perf: Deduplicate perf counters +- intel/perf: Use a function to initialize perf counters +- intel/perf: Use slimmer intel_perf_query_counter_data struct +- intel/perf: Store indices to strings rather than pointers +- intel/perf: Mark intel_perf_counter_* enums as PACKED +- intel/perf: Fix mistake in description string +- intel/perf: Destination array calculation into function + +Mike Blumenkrantz (9): + +- llvmpipe: fix occlusion queries with early depth test +- anv: fix xfb usage with rasterizer discard +- anv: fix CmdSetColorWriteEnableEXT for maximum rts +- anv: fix some dynamic rasterization discard cases in pipeline construction +- lavapipe: always clone shader nir for shader states +- gallivm: fix oob image detection for cube/1dArray/2dArray/cubeArray +- zink: flag sample locations for re-set on batch flush +- zink: force-add usage when adding last-ref tracking +- zink: only update usage on buffer rebind if rebinds occurred + +Pierre-Eric Pelloux-Prayer (1): + +- radeonsi: don't clear framebuffer.state before dcc decomp diff --git a/lib/mesa/docs/relnotes/22.0.2.rst b/lib/mesa/docs/relnotes/22.0.2.rst new file mode 100644 index 000000000..d909fb10e --- /dev/null +++ b/lib/mesa/docs/relnotes/22.0.2.rst @@ -0,0 +1,293 @@ +Mesa 22.0.2 Release Notes / 2022-04-21 +====================================== + +Mesa 22.0.2 is a bug fix release which fixes bugs found since the 22.0.1 release. + +Mesa 22.0.2 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.0.2 implements the Vulkan 1.2 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + df4fa560dcce6680133067cd15b0505fc424ca703244ce9ab247c74d2fab6885 mesa-22.0.2.tar.xz + + +New features +------------ + +- None + + +Bug fixes +--------- + +- Vulkan::Calling vkWaitForFences Timeout +- Intel (CHT) - Uplink text rendering bugged out in Mesa 22.0 +- gen9atom gpu hang on dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_1 +- bad memory managment on panfrost RK3399 - cannot alocate more ram - fury unleashed +- [Intel, Iris] Broken rendering in Ryujinx on Tigerlake +- intel: integer_mad_hi / integer_mad_sat / integer_mul_hi produce invalid results +- Textures colors distortion in "Black Geyser: Couriers of Darkness" with radeonsi +- ShaderStorageBlocksWriteAccess not set for spir-v shaders? +- radeonsi dEQP-GLES3.functional.buffer.map.write.explicit_flush.* flake crashes +- radv: nir validation error with invalid array access +- Intel Iris Xe Geometry Flickering/Assets Disappearing +- Rendering artifacts when playing Outer Wilds [Reproducible with latest ANV driver built from main] +- Vulkan wsi leaks vk_sync object on every wsi_AcquireNextImageKHR call +- [bisected] panfrost(RK3399/T860): Emulationstation: broken, black or missing menus with v22.0.0 +- Plasma/KDE settings menus disappear on daily build +- Square Artifacts Dragons Dogma +- r300: Anmesia the dark descent corruption +- Error compiling with LLVM-git/15 + + +Changes +------- + +Alexey Bozhenko (1): + +- spirv: fix OpBranchConditional when both branches are the same + +Alyssa Rosenzweig (1): + +- panfrost: Process scissor state earlier + +Benjamin Cheng (1): + +- vulkan/queue: Destroy wait temps if they are skipped + +Boris Brezillon (1): + +- lavapipe: Don't use VK_OUTARRAY_MAKE()/vk_outarray_append() + +Caio Oliveira (2): + +- intel/fs: Initialize the sample mask in flags register when using demote +- intel/fs: Fix IsHelperInvocation for the case no discard/demote are used + +Cristian Ciocaltea (1): + +- ci: Avoid altering EXTRA_CARGO_ARGS environment variable + +Daniel Schürmann (1): + +- aco/optimizer: fix call to can_use_opsel() in apply_insert() + +Danylo Piliaiev (3): + +- turnip: Fix the lack of WFM before indirect draws +- turnip: enable has_ccu_flush_bug workaround for a660 +- turnip: Correctly store separate stencil in gmem store + +Dave Airlie (6): + +- radv: use flush vgt streamout like PAL does. +- gallivm/sample: detect if rho is inf or nan and flush to zero. +- llvmpipe: fix nr_sampler_view in key creation. +- draw: handle tess eval shader when getting num outputs +- u_blitter/stencil: take dstbox x/y into accounts for dst fb width +- util/stencil: fix stencil fallback blit shader texture types. + +Dylan Baker (20): + +- docs: add sah256 sum for mesa 22.0.1 +- .pick_status.json: Update to 5a459b8f6b0b59814865bc5869a868269f8f7ebf +- pick/core: have update_commits return the commit list +- bin/auto-pick: Add a script that automates applying commits +- .pick_status.json: Mark 3ef093f697ad9027ba514c7a4a6a10b7bd95bd47 as denominated +- core: don't pass the ui back into commit.apply +- .pick_status.json: Update to 33c800bf913771ddadb5bd632df46582f4d1c9f3 +- .pick_status.json: Update to 837f781c9a4f909ac1ee0b3b8331af87301b5fc3 +- .pick_status.json: Update to 7478b00c7cc7f7ec8cbe224ce600362f7fdb9411 +- .pick_status.json: Update to c7eaf0306876df53d3608ae3bf67afc272dcdf20 +- .pick_status.json: Mark 94706601fa2f52605d6e488f30fad9a0e2440612 as denominated +- .pick_status.json: Mark 174d086e8c818db4522731e845bd259e2baa8830 as denominated +- .pick_status.json: Mark 8b71118aa066ab125b7440912ed1e14da79c9451 as backported +- .pick_status.json: Mark 4e30da7874e6586c7ac94cd34560c782d0f9678d as backported +- .pick_status.json: Mark 7da211e24fe14d8b107461bdfcb3b8d67212f88a as backported +- .pick_status.json: Mark 8294d454245d4318befac7c8915e55db3d47cefd as backported +- .pick_status.json: Update to e509598470c000498c3e1328e012142d9047e292 +- .pick_status.json: Mark 2b688942c1b1f5ca4f3e4d33231579149aeb8fdb as denominated +- .pick_status.json: Mark dd783d7144932d79ef761810d46e45c3df7148cf as denominated +- .pick_status.json: Update to 9f44a264623461c98368185b023d99446676e039 + +Emma Anholt (13): + +- r600: Update the PS state before checking for cb_misc update. +- r600: Fix ordering of SSBO loads versus texturing. +- r600: Add a workaround and explanation for shadowcubearray TG4. +- vulkan: Make sure we've loaded our connectors when querying plane props. +- st/glsl-to-tgsi: Fix handling of csel(bool, vec, vec). +- r600: Disable SB in the presence of indirection on temp arrays. +- r600: Disable SB when INTERP_SAMPLE is used. +- r600: Implement memoryBarrier() in the non-SFN path. +- nir_to_tgsi: Fix emitting the sample number for non-array MSAA image access. +- nir_to_tgsi: Fix the address reg mapping for images and SSBOs to match G-T-T. +- turnip: Stop allocating unused pvtmem space in the pipeline CS. +- nir: Add lowering for fround_even on r300. +- nouveau/nir: Fix the inverted sense of usesSampleMaskIn. + +Erik Faye-Lund (3): + +- vbo/dlist: do not try to pad an empty draw +- d3d12: fix return-code without dxcompiler.dll +- vulkan: explicitly cast object-type enum + +Francisco Jerez (1): + +- intel/perf: Fix OA report accumulation on Gfx12+. + +Georg Lehmann (7): + +- nir/fold_16bit_sampler_conversions: Don't fold dest upcasts. +- radv: Add a vkCmdBuildAccelerationStructuresIndirectKHR stub. +- radv: Add more RT pipeline stubs. +- radv: Don't hash ycbcr sampler base object. +- nir/fold_16bit_sampler_conversions: Fix dest type mismatches. +- nir/fold_16bit_sampler_conversions: Fix src type mismatches. +- radv: Enable global bo list if 1.2 features are used. + +Ian Romanick (3): + +- intel/compiler: Fix sample_d messages on DG2 +- intel/fs: Force destination types on DP4A instructions +- nir: intel/compiler: Lower TXD on array surfaces on DG2+ + +Icecream95 (1): + +- panfrost: Stop overallocating compressed textures + +Jason Ekstrand (7): + +- v3dv/queue: Rework multisync_free +- nir: Dont set coord_components on txs +- lavapipe: Use VK_OUTARRAY for GetPhysicalDeviceQueueFamilyProperties[2] +- anv: Disallow blending on swizzled formats +- nir/lower_int64: Fix [iu]mul_high handling +- iris: Don't leak scratch BOs +- nir/opcodes: fisfinite32 should return bool32 + +Jordan Justen (1): + +- intel/dev: Add device info for RPL-P + +Kai Wasserbäch (2): + +- fix(FTBFS): clover: work around removal of PointerType::getElementType() +- fix(clover): FTBFS: Added missing include for ConstantInt for LLVM 15 + +Kenneth Graunke (2): + +- intel/compiler: Call inst->resize_sources before setting the sources +- iris: Add pre-draw flushing for stream output targets + +Lionel Landwerlin (9): + +- anv: fix variable shadowing +- anv: zero-out anv_batch_bo +- anv: implement EXT_tooling_info +- iris: don't synchronize BO for batch decoding +- anv: disable preemption on 3DPRIMITIVE on gfx12 +- anv: fix dynamic sample locations on Gen7/7.5 +- blorp: disable depth bounds +- intel: fix URB programming for GT1s +- anv: allow getting the address of the beginning of the batch + +Marcin Ślusarz (3): + +- anv: fix push constant lowering for task/mesh +- anv: invalidate all metadata in anv_nir_lower_ubo_loads +- intel/compiler: invalidate all metadata in brw_nir_lower_intersection_shader + +Michael Olbrich (1): + +- crocus: export GEM handle with RDWR access rights + +Mike Blumenkrantz (29): + +- draw: fix gs vertex stream counting +- radv: fix CmdSetColorWriteEnableEXT(attachmentCount==MAX_RTS) +- draw: fix nonzero stream primitives generated queries +- lavapipe: enforce monotonic timeline incrementing +- zink: force push descriptors cache update if hashing detects changes +- llvmpipe: handle sampling from 2d views of 3d images +- zink: unset resource layout+access when doing storage setup +- zink: adds refs to user index buffers when tc is not active +- zink: handle 0 ubos and 0 ssbos in pipeline layout +- zink: create pipeline layout if only bindless descriptor set is used +- zink: fix xfb analysis variable finding for arrays +- zink: track internal conditional render state +- zink: flag sample locations for re-set on batch flush +- lavapipe: fix typo in set_event execution +- mesa/st: add special casing for pointsize constant updating during validate +- intel/isl: fix 2d view of 3d textures +- zink: fix max geometry input component advertising +- nir/lower_tex: avoid adding invalid LOD to RECT textures +- zink: use the calculated last struct member idx for ssbo size in ntv +- zink: avoid creating ssbo variable types with multiple runtime arrays +- nir/lower_tex: fix rect queries with lower_rect set +- mesa/st: set normalized coords for RECT samplers if rects are unsupported +- zink: reject resource creation if format features don't match attachment +- zink: unconditionally set line width on rasterizer state change +- zink: support restart with PIPE_PRIM_LINES_ADJACENCY if ext is available +- zink: handle device-local unsynchronized maps +- util/draw: fix map size of indirect buffer in util_draw_indirect_read +- util/draw: handle draw_count=0 when reading indirect parameters +- util/draw: fix indirect draw count readback + +Pavel Ondračka (1): + +- r300: respect output_semantic_index when writing colors + +Pierre-Eric Pelloux-Prayer (6): + +- glsl/nir/linker: update shader_storage_blocks_write_access for SPIR-V +- gallium/u_threaded: late alloc cpu_storage +- gallium/tc: warn if an app is incompatible with cpu_storage +- gallium/tc: zero alloc transfers +- glsl/nir/linker: fix shader_storage_blocks_write_access +- drirc: enable radeonsi_zerovram for Black Geyser + +Qiang Yu (1): + +- nir/linker: disable varying from uniform lowering by default + +Rohan Garg (1): + +- iris: set a default EDSC flag + +Samuel Pitoiset (8): + +- radv,drirc: move RADV workarounds to 00-radv-defaults.conf +- radv: disable DCC for Fable Anniversary, Dragons Dogma, GTA IV and more +- radv: enable radv_disable_aniso_single_level for DXVK/vkd3d +- radv: fix cleaning the image view for CmdCopyImageToBuffer() +- radv: save/restore the stencil write mask during internal driver operations +- radv: suspend/resume queries during internal driver operations +- radv: save/restore the stencil reference during internal driver operations +- radv: fix initializing pipeline_key::topology for GFX9 and older + +Sviatoslav Peleshko (1): + +- mesa: flush bitmap caches when changing scissors or window rects state + +Timothy Arceri (1): + +- glsl: fix needs_lowering() call in varying packing pass + +Timur Kristóf (4): + +- aco: Fix VOP2 instruction format in visit_tex. +- nir: Handle out of bounds access in nir_vectorize_tess_levels. +- radv: Fix gs_vgpr_comp_cnt for NGG VS without passthrough mode. +- radv: Use correct buffer offset for conditional rendering. diff --git a/lib/mesa/docs/relnotes/22.0.3.rst b/lib/mesa/docs/relnotes/22.0.3.rst new file mode 100644 index 000000000..568aa4327 --- /dev/null +++ b/lib/mesa/docs/relnotes/22.0.3.rst @@ -0,0 +1,201 @@ +Mesa 22.0.3 Release Notes / 2022-05-04 +====================================== + +Mesa 22.0.3 is a bug fix release which fixes bugs found since the 22.0.2 release. + +Mesa 22.0.3 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.0.3 implements the Vulkan 1.2 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + 9f2b30f5276a9abaf71aafc6979685e2636189de1a87aea2c9e69744a6d0ebb9 mesa-22.0.3.tar.xz + + +New features +------------ + +- None + + +Bug fixes +--------- + +- freedreno: fragile location packing can break cull/clip distance +- turnip: dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.generated_args.* failures +- piglit ext_image_dma_buf_import.ext_image_dma_buf_import* regression +- piglit ext_image_dma_buf_import.ext_image_dma_buf_import* regression +- Grid Autosport via proton/vulkan: glitches on the grass on amdgpu, fine on amdgpu-pro +- [Bisected] r600: Big performance regression on some games +- Huge amount of anon_inode:sync_file file descriptor created +- anv regressions from commit 57445adc891 "anv: Re-enable CCS_E on TGL+" +- NIR validation failed after nir_opt_if +- intel: integer_mad_hi / integer_mad_sat / integer_mul_hi produce invalid results +- Aperture Desk Job rendering artifacts Intel TGL +- intel: integer_sub_sat produces stack corruption / OpISubBorrow compilation issue +- r300/rv530: "Black rendering" of Baldur's Gate Dark Alliance in Xbox emulator CXBX-R under wine with Gallium Nine and RADEON_DEBUG=use_tgsi +- radv/aco: xfb bug +- Occasional flicker corruption in Rage 2, e.g. after loading, with ACO on RX 5700 XT +- RADV: Quake 2 RTX v1.6.0 Corrupted Graphics +- '../src/amd/common/ac_surface.c:1575: void ac_copy_dcc_equation(const struct radeon_info \*, ADDR2_COMPUTE_DCCINFO_OUTPUT \*, struct gfx9_meta_equation \*): assertion "dcc->equation.gfx9.num_bits <= ARRAY_SIZE(equation->u.gfx9.bit)" failed' + + +Changes +------- + +Chia-I Wu (1): + +- util: Keep quiet NaNs quiet when converting to half float. + +Dave Airlie (2): + +- llvmpipe: add user memory resources to the debug global list. +- vulkan/wsi: keep allocate queue families in image, just don't fill them + +Dylan Baker (21): + +- docs: Add sha256sums for 22.0.2 +- .pick_status.json: Update to 5e84335fb578f03938468bb8ac0029f68349d330 +- .pick_status.json: Update to a6a4bf0f1eae36cb68d5c67653ac013fe0fbde8a +- .pick_status.json: Update to ef9d97ec1f9d9834db3066b5a7ec67ce6d4ecd86 +- .pick_status.json: Update to f17d3a125126dd7e0f1c2072545976cec1eeb1c8 +- .pick_status.json: Update to 1dc697942af7254798f83583f5788e1abf136d27 +- .pick_status.json: Mark d5870c45ae82c091e72f1258920aa54974e19a36 as denominated +- .pick_status.json: Mark f348103fce4484b891a58bf3147b71e3a3558135 as denominated +- .pick_status.json: Mark a4f502de3228ec37dfcaa38225077ec3709d74ea as denominated +- .pick_status.json: Mark 1d250b7b959d2a206280e4f83db05780a83e0927 as denominated +- .pick_status.json: Mark 4eca6e3e5d2b9426d6357e9af1514a0c942638f0 as denominated +- .pick_status.json: Mark 3dcb80da9d6d34e567680ef7fecd24aab4537e8a as denominated +- .pick_status.json: Mark dd078d13cb6b445ad02087a3e80433ef053b7490 as denominated +- .pick_status.json: Mark bbdf22ce136a7f02a8bd6c7e646d12a7e10db18d as denominated +- .pick_status.json: Mark 8806f444a5c005d20cd4ec4ed42e508ffd9cb1da as denominated +- .pick_status.json: Mark 373c8001d6615d70d112c7526ea4f4953257b6b9 as denominated +- .pick_status.json: Mark a056cbc691843861da417b5bcea9f871fdceeb19 as denominated +- .pick_status.json: Mark fc5edf9b68159288baa22c055e471de958812312 as denominated +- .pick_status.json: Mark e509598470c000498c3e1328e012142d9047e292 as denominated +- .pick_status.json: Update to 53a94fbdd57acbbe2bc745901cbfeca968e25273 +- .pick_status.json: Update to 2505afc430c6b151e7e4243503953ba0b0a2c842 + +Erik Faye-Lund (3): + +- meson: remove unused defines +- gallium/xlib: fix stale comment +- mesa: add missing error-path + +Filip Gawin (1): + +- r300: Print warning when stubbing derivatives + +Gert Wollny (1): + +- r600: Allow eight bit, three channel formats for vertex buffers + +Icecream95 (1): + +- panfrost: Fix pack_32_2x16 implementation + +Iván Briano (1): + +- vulkan/wsi/wayland: Fix double free on error condition + +Jason Ekstrand (4): + +- util/set: Respect found in search_or_add_pre_hashed +- nir: Lower all bit sizes of usub_borrow +- vulkan: Set signals[i].stageMask = ALL_COMMANDS for QueueSubmit2 wrapping +- spirv: Handle Op*MulExtended for non-32-bit types + +Jordan Justen (3): + +- vulkan/wsi/x11: Use atomic_int with sent_image_count +- iris: Fix assertion meant to only target the clear-color stride +- iris: Fix assertion meant to only target the clear-color stride + +Juan A. Suarez Romero (1): + +- mesa: unref syncobj after wait_sync + +Konstantin Seurer (2): + +- radv: Fix barriers with cp dma +- radv: Fix lowering ignore_ray_intersection + +Lionel Landwerlin (6): + +- nir/divergence: handle load_global_block_intel +- intel: fixup number of threads per EU on XeHP +- anv: fix acceleration structure descriptor template writes +- anv: skip acceleration structure in binding table emission +- anv: fix clflush usage on utrace copy batch +- anv: fix invalid utrace memcpy l3 config on gfx < 11 + +Marek Olšák (3): + +- nir: fix an uninitialized variable valgrind warning in nir_group_loads +- ac/surface: fix an addrlib race condition on gfx9 +- winsys/amdgpu: fix a mutex deadlock when we fail to create pipe_screen + +Mike Blumenkrantz (10): + +- wgl: support GL 4.6 +- zink: fix tcs control barriers for use without vk memory model +- zink: fix semantics mask for compute control barriers +- zink: add synchronization for buffer clears +- mesa/st: clamp GL_RENDERBUFFER to GL_TEXTURE_2D for sparse queries +- glsl/nir: set new_style_shadow for sparse tex ops as necessary +- zink: fix group memory barrier emission +- vulkan: bump layer api versions to current vk header version +- llvmpipe: disable compute statistics queries if queries are disabled +- llvmpipe: disable statistic queries during draw if queries are disabled + +Nanley Chery (2): + +- Revert "anv: Disable CCS_E for some 8/16bpp copies on TGL+" +- iris: Return a 64B stride for clear color plane + +Paulo Zanoni (1): + +- iris: fix race condition during busy tracking + +Pavel Ondračka (1): + +- r300: set PIPE_BIND_CONSTANT_BUFFER for const_uploader + +Pierre-Eric Pelloux-Prayer (1): + +- ac/surface: adjust gfx9.pitch[*] based on surf->blk_w + +Samuel Pitoiset (6): + +- radv: only apply enable_mrt_output_nan_fixup for 32-bit float MRTs +- aco: fix load_barycentric_at_{sample,offset} on GFX6-7 +- nir: fix marking XFB varyings as always active IO +- nir: mark XFB varyings as unmoveable to prevent them to be remapped +- radv: fix handling divisor == 0 with dynamic vertex input state +- radv: enable radv_disable_sinking_load_input_fs for Grid Autosport + +Sviatoslav Peleshko (1): + +- anv: workaround apps that assume full subgroups without specifying it + +Tapani Pälli (1): + +- isl: disable mcs (and mcs+ccs) for color msaa on DG2 + +Timothy Arceri (1): + +- nir: fix sorting before assigning varying driver locations + +Yevhenii Kolesnikov (1): + +- nir: Remove single-source phis before opt_if_loop_last_continue diff --git a/lib/mesa/docs/relnotes/22.0.4.rst b/lib/mesa/docs/relnotes/22.0.4.rst new file mode 100644 index 000000000..683d72ed8 --- /dev/null +++ b/lib/mesa/docs/relnotes/22.0.4.rst @@ -0,0 +1,152 @@ +Mesa 22.0.4 Release Notes / 2022-05-19 +====================================== + +Mesa 22.0.4 is a bug fix release which fixes bugs found since the 22.0.3 release. + +Mesa 22.0.4 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.0.4 implements the Vulkan 1.2 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + c7971f58fa826e474617cda53250c6600fce60994b9fac5b641522e21f471ed4 mesa-22.0.4.tar.xz + + +New features +------------ + +- None + + +Bug fixes +--------- + +- turnip: gfxbench mh3.1 offscreen crash with ANGLE +- [radv] DCC causes artifacts in Senran Kagura Shinovi Versus +- Radeon RX 5700XT crash when using prusa-slicer +- clover: Assertion \`NumContainedTys && "Attempting to get element type of opaque pointer"' failed + + +Changes +------- + +Alyssa Rosenzweig (1): + +- nir: Don't set writes_memory for reading XFB + +Chia-I Wu (1): + +- anv: advertise rectangularLines only for Gen10+ + +Danylo Piliaiev (3): + +- tu: Fix indices of drm_msm_gem_submit_cmd when filling them +- tu: Do not flush ccu in clear/blits during renderpass +- pps: Open writable renderer node in DrmDevice::create + +Dylan Baker (17): + +- docs: Add sh256sum for mesa 20.0.3 +- .pick_status.json: Update to 7f91e8fad94dd34f83c6a124dbbe5d210be7715f +- .pick_status.json: Update to 14b1ed1ce105d42652f70e2fd13c90fc4f2e7ffc +- .pick_status.json: Mark 69e6417e19793043e4bb3cd500cfcf377bce4c03 as denominated +- .pick_status.json: Mark c025cb9ee9d79ebfb66a577556e04deecfe012ed as denominated +- .pick_status.json: Mark 6317f88b044501354a052064478d5b43dfe41809 as denominated +- .pick_status.json: Mark 5ff3fa5912778adb8117fa26bfe4786b583e741b as denominated +- .pick_status.json: Mark 9a412c10b7a96adf71c9a2ca44a0abca75de1c49 as denominated +- .pick_status.json: Mark 0e49ef5c9f2ec34567613226ad498edca28bce88 as denominated +- .pick_status.json: Mark ae369e9f6d4f2c826f1f2a748c32a14f9d5f1f54 as denominated +- .pick_status.json: Update to 14b1ed1ce105d42652f70e2fd13c90fc4f2e7ffc +- .pick_status.json: Mark 8c1d9c7b744b2e5b40fd42cfd51256b16deea6a8 as denominated +- .pick_status.json: Mark f1d1371e512e32d03f7f54d873020e589ee67d47 as denominated +- .pick_status.json: Update to 57293dee2b11ba7e52052edc4d0437f08db19144 +- .pick_status.json: Update to 5c90eb1c53f46e86717c6bf4d5253dd23c4dac1f +- .pick_status.json: Mark 5a3aee78cbb70918b413cdd40dffcae7c9e97d8c as denominated +- ci: Some panfrost tests are unexpectedly passing. + +Georg Lehmann (1): + +- nir/opt_algebraic: Fix mask in shift by constant combining. + +Icecream95 (6): + +- nir/lower_tex: Copy more fields in lower_tex_to_txd and friends +- pan/mdg: Keep min_bound at 16 when alignment requires it +- pan/mdg: Use MAX2 to set min_alignment +- pan/mdg: Fix mask usage when filling before a spill +- pan/mdg: Return the instruction from mir_insert_instruction_*_scheduled +- pan/mdg: Fix multiple spilt writes in the same bundle + +Jason Ekstrand (4): + +- vulkan/wsi: Set the right stage flags for semaphore waits +- gallium/u_threaded_context: Use PIPE_MAX_SHADER_SAMPLER_VIEWS for sampler_buffers +- nir/cf: Return a cursor from nir_cf_extract as well +- nir: Preserve metadata if remove_dead_derefs makes no progress + +Lionel Landwerlin (4): + +- nir/cf: return cursor after insertion of cf_list +- nir/lower_shader_calls: don't use nop instructions as cursors +- nir/lower_shader_calls: don't insert code after break/continue +- nir/lower_shader_calls: put inserted instructions into a dummy block + +Marcin Ślusarz (2): + +- anv: update task/mesh distribution with the recommended values +- anv: disable streamout before emitting mesh shading state + +Marek Olšák (2): + +- ac/llvm: set the correct cache policy for sparse buffer loads +- radeonsi: fix a crash when failing to create a context + +Mike Blumenkrantz (14): + +- util/blitter: fix sampler restore with 0 saved samplers +- st/draw_feedback: set constant buffer stride +- llvmpipe: always set ssbo data pointers for draw +- gallivm: fix oob txf swizzling +- zink: fix up sparse texture sampling for shadow samplers +- zink: clamp out min_lod operands for explicit lod ops +- zink: fix sparse binding for arrayed textures +- zink: fix sparse texture depth calcs for arrayed textures +- zink: remove refs from shader images +- zink: reuse local res pointer in set_shader_images +- zink: simplify dumb update flagging in set_shader_images +- zink: read shader image r/w usage from incoming data struct +- zink: copy incoming shader image struct after doing updates +- zink: stop leaking shader image surfaces + +Pavel Ondračka (1): + +- r300: guard for unsigned underflow when unrolling loops + +Pierre-Eric Pelloux-Prayer (2): + +- radeonsi: don't use wave32 for GE on gfx10 if culling is used +- amd: fix ac_build_mbcnt_add in wave32 mode + +Rhys Perry (1): + +- aco/ra: fix vgpr_limit + +Samuel Pitoiset (3): + +- radv: re-emit dynamic line stipple state if the primitive topology changed +- radv: disable instance packing to fix pipeline query stats +- radv: disable DCC for Senra Kagura Shinovi Versus + +Thomas Debesse (1): + +- gallium/clover: pass -no-opaque-pointers to Clang diff --git a/lib/mesa/docs/relnotes/22.0.5.rst b/lib/mesa/docs/relnotes/22.0.5.rst new file mode 100644 index 000000000..bb76924cf --- /dev/null +++ b/lib/mesa/docs/relnotes/22.0.5.rst @@ -0,0 +1,139 @@ +Mesa 22.0.5 Release Notes / 2022-06-01 +====================================== + +Mesa 22.0.5 is a bug fix release which fixes bugs found since the 22.0.4 release. + +Mesa 22.0.5 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.0.5 implements the Vulkan 1.2 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + 5ee2dc06eff19e19b2867f12eb0db0905c9691c07974f6253f2f1443df4c7a35 mesa-22.0.5.tar.xz + + +New features +------------ + +- None + + +Bug fixes +--------- + +- nir_lower_phis_to_regs_block: Assertion \`src->src.is_ssa' failed. +- Build failure on sparc +- Signal 6 (Aborted) when opening GTK3 applications +- radv: Conditional rendering on compute queue +- anv: line failure +- panfrost midgard - major issues with MelonDS emulator - not able to trace properly too +- Nheko misrendering on RK3399 (Mali T860) + + +Changes +------- + +Daniel Schürmann (1): + +- aco: fix spilling of phis without temp operands + +Dave Airlie (2): + +- meson: add build-id to pipe libraries +- lavapipe: fix depth bias offset flag enables. + +Dylan Baker (6): + +- docs: add sha256sum to 22.0.4 notes +- .pick_status.json: Update to 95d4e5435bf63239105a50370ebbbbdeb7fed2f8 +- .pick_status.json: Update to ee2278de654853f1d6107e6ef6fc8537a668866e +- .pick_status.json: Update to 9773ed1dc74b98019640d909cdc2b5d6cffae85a +- .pick_status.json: Update to 5067a26f4432ae5e9690e70ef2498ea24798593e +- .pick_status.json: Update to 1c63a0beafefccec599ddbe355ec5a8b61df6438 + +Erik Faye-Lund (4): + +- editorconfig: remove scons-ism +- editorconfig: remove html-rule +- editorconfig: remove m4-rule +- editorconfig: remove pl-rule + +Filip Gawin (1): + +- r300: keep negation if w is an inline constant + +Hans-Kristian Arntzen (1): + +- radv: Fix RTPSO hashing of pGroups. + +Icecream95 (5): + +- panfrost: Copy blend constant into variant even when reusing it +- nir: Add store_combined_output_pan BASE back +- pan/bi: Read base for combined stores +- pan/mdg: Read base for combined stores +- panfrost: Only write depth / stencil once if MRT is used + +Jason Ekstrand (6): + +- intel/fs: Copy color_outputs_valid into wm_prog_data +- anv: Drop alpha_to_coverage from the NULL FS optimization +- anv: Handle the null FS optimization after compiling shaders +- anv: Don't disable the fragment shader if XFB is enabled +- nir: Handle register sources in lower_phis_to_regs_block +- glsl/nir: Stop leaking varyings_info + +Matt Turner (1): + +- mesa: Drop dead #include "sparc/sparc.h" + +Michael Skorokhodov (1): + +- anv: Update line range + +Mike Blumenkrantz (13): + +- spirv: fix barrier scope assert +- zink: only reject non-blittable resource creation if it has no other binds +- zink: add extra validation for resource creation +- zink: remove input attachment usage when pruning surface usage +- zink: add extended usage and retry if resource creation fails +- zink: fix surface/bufferview cache comparisons +- zink: force render target usage for sampler-only resources +- zink: fix non-dynamic vertex stride update flagging +- turnip: fix assert for max xfb outputs +- zink: require draw params ext/cap in ntv if drawid is used +- zink: emit spirv cap for subgroup vote ops +- zink: drop wideLines requirement +- zink: drop largePoints requirement + +Pierre-Eric Pelloux-Prayer (2): + +- glx: set max values for pbuffer width / height +- radeonsi: don't use sel->nir in si_check_blend_dst_sampler_noop + +Qiang Yu (1): + +- radeonsi: lower nir_intrinsic_sparse_residency_code_and + +Rhys Perry (1): + +- aco: fix p_constaddr with a non-zero offset + +Samuel Pitoiset (1): + +- radv: fix writing buffer markers with non-zero memory offset + +Timur Kristóf (1): + +- radv: Implement conditional rendering for async compute queue. diff --git a/lib/mesa/docs/relnotes/22.1.6.rst b/lib/mesa/docs/relnotes/22.1.6.rst index e69de29bb..c5cc21404 100644 --- a/lib/mesa/docs/relnotes/22.1.6.rst +++ b/lib/mesa/docs/relnotes/22.1.6.rst @@ -0,0 +1,61 @@ +Mesa 22.1.6 Release Notes / 2022-09-22 +====================================== + +Mesa 22.1.6 is a bug fix release which fixes bugs found since the 22.1.5 release. + +Mesa 22.1.6 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.1.6 implements the Vulkan 1.2 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + TBD. + + +New features +------------ + +- None + + +Bug fixes +--------- + +- anv: sampleLocationsEnable ignored in VkPipelineSampleLocationsStateCreateInfoEXT + + +Changes +------- + +Dave Airlie (2): + +- llvmpipe: make last_fence a screen/rast object not a context one. +- llvmpipe: keep context list and use to track resource usage. + +Dylan Baker (3): + +- docs: Add sha256 sum for 22.1.5 +- docs: Add sha256 sum for 22.1.4 +- docs: add release notes for 22.1.6 + +Eric Engestrom (1): + +- Revert "pan/bi: Require ATEST coverage mask input in R60" + +Mark Janes (1): + +- intel/dev: drop warning for unhandled hwconfig keys + +Mykhailo Skorokhodov (1): + +- anv: Use sampleLocationsEnable for sample locations diff --git a/lib/mesa/docs/relnotes/22.1.7.rst b/lib/mesa/docs/relnotes/22.1.7.rst index e69de29bb..e940146b6 100644 --- a/lib/mesa/docs/relnotes/22.1.7.rst +++ b/lib/mesa/docs/relnotes/22.1.7.rst @@ -0,0 +1,196 @@ +Mesa 22.1.7 Release Notes / 2022-09-22 +====================================== + +Mesa 22.1.7 is a bug fix release which fixes bugs found since the 22.1.6 release. + +Mesa 22.1.7 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.1.7 implements the Vulkan 1.3 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + TBD. + + +New features +------------ + +- None + + +Bug fixes +--------- + +- [r300] - osu! - Shader optimizations cause black and glitchy rendering +- [Bisected] Microsoft CLC: build failure with MSVC and Dshared-llvm=disabled with AMDGPU target available +- anv: invalid returns from vkCreateDescriptorPool +- turnip: depth/stencil regressions +- freedreno / mesa 22.1.3 crashes Firefox Nightly +- Trine 4 causing GPU HANG: ecode 12:1:849f7804, in vkr-ring-7 +- [RadeonSI] Teamfortress 2 crashing frequently with mesa 22.1.1 +- Unity game segfaults often +- Alien Isolation segfaulting in radeonsi_dri.so starting with Mesa 22.1.0 + + +Changes +------- + +Alyssa Rosenzweig (2): + +- pan/decode: Fix overrun decoding planes +- agx: Fix packing of samplers in texture instrs + +Axel Davy (6): + +- frontend/nine: Skip invalid swvp calls +- frontend/nine: Fix buffer tracking out of bounds +- frontend/nine: Fix ATOC handling +- frontend/nine: Fix cso restore bug +- frontend/nine: Fix shader multi-use crash +- frontend/nine: Fix ff position_t fallback when w = 0 + +Charmaine Lee (1): + +- mesa/st: fix reference to nir->info after nir_to_tgsi + +Connor Abbott (2): + +- tu: Treat CP_WAIT_FOR_ME as a cache invalidate +- tu: Fix sysmem depth attachment clear flushing + +Dave Airlie (1): + +- draw: don't touch info values that aren't valid. + +Dmitry Osipenko (1): + +- virgl: Fix unmapping of blob resources + +Dylan Baker (6): + +- .pick_status.json: Update to 56f331951984e698e4b947d2ad6e34a8444bd62c +- .pick_status.json: Update to a3bf0da1cbd4b10043c80bf44609a3024b5fcc36 +- .pick_status.json: Update to 24b9ad7cd5ebc7cfa5d03cf0f243ea4841c971b9 +- .pick_status.json: Update to 74fc367127ccf945f4c649dd6ddff955c802e36e +- .pick_status.json: Mark 11ab6087797f805cf158048915c67945613c9a72 as denominated +- docs: add release notes for 22.1.7 + +Emma Anholt (1): + +- zink: Make sure that we keep the existing ici pNext chain on inserts. + +Eric Engestrom (4): + +- docs/features.txt: mark VK_ANDROID_native_buffer as supported on turnip +- docs/features.txt: mark VK_ANDROID_native_buffer as supported on v3dv +- bin/gen_release_notes.py: bump advertised vulkan version to 1.3 +- vk/device-select-layer: fix .sType of VkPhysicalDeviceGroupProperties + +Erik Faye-Lund (4): + +- gallium/hud: do not use texture-rect for font +- docs: fixup link to virgl docs +- zink: add have_D32_SFLOAT_S8_UINT boolean +- zink: do not use VK_FORMAT_D32_SFLOAT_S8_UINT without checking + +Filip Gawin (1): + +- r300: don't read from output transform_r300_vertex_SEQ/SNE + +Georg Lehmann (1): + +- aco/ir: Fix swapped nle. + +Gert Wollny (1): + +- virgl: honor host caps in shader cash sha evaluation + +Jesse Natalie (2): + +- egl/wgl: Delete unused variables/code +- egl/wgl: Fix some awkward sizeof formatting + +Lionel Landwerlin (2): + +- anv: fixup PIPE_CONTROL restriction on gfx8 +- anv: don't return incorrect error code for vkCreateDescriptorPool + +Liviu Prodea (1): + +- meson: Microsoft / maybe Intel CLC need the all-targets workaround just like clover + +Marcin Ślusarz (3): + +- intel/compiler: emit URB fences for TASK/MESH +- intel/compiler: there are 4 types of fences on gfx >= 12.5 +- intel/compiler: insert URB fence before task/mesh termination + +Marek Olšák (5): + +- util/u_threaded: fix a CPU storage crash due to an invalid codepath taken +- glthread: don't ignore glPushAttrib/glPopAttrib when tracking GL_CULL_FACE +- radeonsi: don't assume that TC_ACTION_ENA invalidates L1 cache on gfx9 +- glthread: unbind framebuffers in glDeleteFramebuffers +- glthread: call _mesa_glthread_DeleteBuffers unconditionally + +Mike Blumenkrantz (15): + +- lavapipe: fix stencil clears +- util/blitter: respect PIPE_TEXTURE_RECT +- d3dadapter: fix sw pipe loading +- nine: set view_mask=0 +- zink: fix gfx program cache pruning with generated tcs +- zink: don't flag lazy push constant set dirty on batch change +- zink: handle mutable swapchain images with dmabuf +- zink: init cache_put program fence on program creation +- zink: drop mode_changed check from linewidth/depthbias draw updates +- nine: check return on resource_get_handle +- zink: demote dmabuf tiling to linear if modifiers aren't supported +- zink: use modifier_aspect to check for modifier plane in zink_resource_get_param +- zink: don't add modifiers if EXT_image_drm_format_modifier isn't present +- zink: add all format modifiers when adding for dmabuf export +- nir/validate: clamp unsized tex dests to 32bit + +Mykhailo Skorokhodov (2): + +- iris: Move Wa_1806527549 and enable by default +- anv: Move Wa_1806527549 and enable by default + +Nanley Chery (2): + +- anv: Make the D16 reg mode single-sampled +- iris: Make the D16 reg mode single-sampled + +Pavel Ondračka (1): + +- r300: fix variables detection for paired ALU and TEX instructions in different branches + +Pierre-Eric Pelloux-Prayer (1): + +- amdgpu/bo: update uses_secure_bos when importing buffers + +Qiang Yu (1): + +- nir/lower_gs_intrinsics: fix primitive count for points + +Rhys Perry (1): + +- aco: fix LdsBranchVmemWARHazard with 2+ branch chains + +Rob Clark (1): + +- freedreno/drm: Fix potential bo cache vs export crash + +sjfricke (1): + +- isl: fix bug where sb.MOCS is not being set diff --git a/lib/mesa/docs/relnotes/22.2.0.rst b/lib/mesa/docs/relnotes/22.2.0.rst new file mode 100644 index 000000000..fd9ee516b --- /dev/null +++ b/lib/mesa/docs/relnotes/22.2.0.rst @@ -0,0 +1,6219 @@ +Mesa 22.2.0 Release Notes / 2022-09-21 +====================================== + +Mesa 22.2.0 is a new development release. People who are concerned +with stability and reliability should stick with a previous release or +wait for Mesa 22.2.1. + +Mesa 22.2.0 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.2.0 implements the Vulkan 1.3 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + b1f9c8fd08f2cae3adf83355bef4d2398e8025f44947332880f2d0066bdafa8c mesa-22.2.0.tar.xz + + + +New features +------------ + +- WGL_ARB_create_context_robustness + +- d3d12 ARB_robust_buffer_access_behavior + +- VK_EXT_robustness2 for lavapipe + +- VK_EXT_image_2d_view_of_3d on RADV + +- zink and d3d12 GL_EXT_memory_object_win32 and GL_EXT_semaphore_win32 support + +- vertexAttributeInstanceRateZeroDivisor support for lavapipe + +- panfrost Valhall support (conformant OpenGL ES 3.1 on Mali-G57) + +- VK_EXT_primitives_generated_query on RADV + +- VK_EXT_non_seamless_cube_map on RADV, ANV, lavapipe + +- VK_EXT_border_color_swizzle on lavapipe, ANV, turnip, RADV + +- VK_EXT_shader_module_identifier on RADV + +- variablePointers on lavapipe + +- VK_EXT_multisampled_render_to_single_sampled on lavapipe + +- VK_EXT_shader_subgroup_vote/ballot on lavapipe + +- EGL_KHR_context_flush_control for all drivers + +- GLX_ARB_context_flush_control for all drivers + +- VK_EXT_attachment_feedback_loop_layout on RADV + + +Bug fixes +--------- + +- turnip/a660: dEQP-VK.spirv_assembly.instruction.terminate_invocation.terminate.subgroup_ballot failure due to "movmsk" reordered with "kill" +- [radeonsi] Graphical corruption in Valve Source-based games +- Updates to 22.2 RCs cause artifacts on nouveau and blank screen on VirtIO +- nouveau: corruption on NVAC after switching to the NIR backend +- zink: GL_TEXTURE_LOD_BIAS vs mipLodBias, and GTF-GL46.gtf21.GL3Tests.texture_lod_bias.texture_lod_bias_all failure on Turnip +- Gallium OpenGL megadriver fails to build if LLVM is enabled but not required by any component built +- Abort due to "Unknown intrinsic: intrinsic store_deref" (firefox, radeonsi) +- turnip: dEQP-VK.subgroups.vote.graphics.subgroupallequal_*int regression on a635 +- regression from !17949 +- r600/sfn: xonotic shows a big performance regression when using the NIR backend with sb disabled +- r600/sfn: register allocation fails for a number of piglits and deqp tests +- DG2 corruption in gnome desktop from F36 +- radeonsi: error allocating sparse buffers.. bug? +- [BISECTED/radeonsi] Flickering triangles with wayfire cube +- ../src/util/u_queue.c:218: util_queue_fence_destroy: Assertion \`fence->signalled' failed for iris when disabled UTIL_FUTEX_SUPPORTED +- turnip: gfxbench aztec ruins crashes on zink +- glsl: textureGatherOffset param broken +- v3dv: android: VTS big regression after 316728a55bc8fe398be1ac2796a22f8c91fb886c +- radeonsi: driver crashes with kwin-effects-cube +- [Turnip] Final Fantasy X: Incorrect hair rendering +- turnip: gfxbench car chase crash on zink +- [r300] - osu! - Shader optimizations cause black and glitchy rendering +- [Bisected] Microsoft CLC: build failure with MSVC and Dshared-llvm=disabled with AMDGPU target available +- anv: invalid returns from vkCreateDescriptorPool +- [Bisected][RadeonSI] Blender crashes on startup +- turnip: depth/stencil regressions +- llvmpipe: assert in KHR-Single-GL45.enhanced_layouts.xfb_all_stages +- Poor performance/utilization with Splitgate +- freedreno / mesa 22.1.3 crashes Firefox Nightly +- [Bisected][RADV] Rendering issues and performance drop with GFX10 on RPCS3 +- dzn: Build failure when it's the only driver built - 22.2-devel regression +- d3d12: Unresolved external symbols from Version.lib +- turnip: Geometry shader never invoked if there is no EmitVertex and gl_Position is not written +- Trine 4 causing GPU HANG: ecode 12:1:849f7804, in vkr-ring-7 +- turnip: GPU fault on clear +- intel_nullhw.c:179:28: error: passing 'const void \*const' to parameter of type 'void \*' discards qualifiers [-Werror,-Wincompatible-pointer-types-discards-qualifiers] +- macOS undefined symbol _glapi_tls_Context build error +- Build on macOS regressed +- Regression: virgl: The switch to NTT breaks "Total War: Warhammer" +- macOS's LIST_ENTRY macro conflicts with ours +- src/util/disk_cache_os.c:234:21: error: too many arguments provided to function-like macro invocation +- [RadeonSI] Teamfortress 2 crashing frequently with mesa 22.1.1 +- Unity game segfaults often +- Alien Isolation segfaulting in radeonsi_dri.so starting with Mesa 22.1.0 +- turnip: dEQP-VK.subgroups.ballot_broadcast.framebuffer.* has failures on older deqp-vk +- !17062 broke XTS xts5/Xlib9/XDrawImageString(16) with Xephyr using glamor on llvmpipe +- DG2 full BAR broken by low BAR? +- crocus: SSBOs regressed +- Homerun Clash game triggers assert in compiler +- radv: line rasterization mode affects multisampling +- turnip: caselist fail +- vtn: mediump_16bit_alu and compatibility +- radv: dynamic vertex input failure +- navi21\\radv\\amdgpu Halo Infinite screen corruption and then hang +- ci/freedreno: a618_vk 1/7 timeouts +- Windows OpenGL32.dll without shared-glapi currently broken +- anv: KHR-GL46.tessellation_shader.single.xfb_captures_data_from_correct_stage fails on TGL +- perfetto: crash on context teardown / driver unload +- panfrost shader classification is wrong in some cases +- kopper: the latest awfulness exposed by piglit glx-multi-window-single-context +- android: r600 building error after commit 79ca456b +- Error compiling gallium-nine on i686 using musl libc +- [anv][regression][bisected][bsw,gen9atom] dEQP-VK.memory.mapping.dedicated_alloc failing on bsw and gen9atom +- anv: GTF-GL46.gtf32.GL3Tests.packed_pixels.packed_pixels_pbo failure +- r600: Tesseract does not render properly with the new NIR backend +- radv: some zink cts fails +- anv: ICL hiz issue +- [RADV, bisected] addrlib changes cause Navi CTS regressions +- r300: Segfault in shaders/closed/steam/kerbal-space-program/1017.shader_test +- Get rid of debug_assert() +- nine: docs? +- d3d12: Gallium d3d12 backend performance is slow +- [RADV][bisected] after commit 124848bf9efa1cc5d8a3cdfc654e0c03d57540e3 the games which use vulkan API not starting +- freedreno: skqp instability from GPU hangs +- ci/a618: intermittent network failure +- turnip: EXT_primitives_generated_query broken +- Godot crashes with RX5500m +- intel: Some Alan Wake shader fails validation due to bad register usage in EOT message +- r600/sfn: Add support for fmulz +- radv: failure in image.sample_texture.*_compressed_format* +- radeonsi: Hang on RX 5500 XT on latest mesa git +- blob_align assert triggers +- radv: Enabling support for instance extensions inside vkCreateInstance does not work +- zink: rendering artifacts in Portal 2 +- a630: AmnesiaTDD trace gpu faults +- freedreno build issue - FAILED: src/freedreno/decode/crashdec.txt +- segfaults under X11 since the Ubuntu update to 22 +- anv: renderdoc explosion on icl +- turnip: EXT_depth_clip_enable broken +- lava: test result parsing fail +- turnip: LRZ fast clear +- [ANV] GTA IV flickering +- [TGL] 25-100% perf impact on SynMark2 with 2GB dynamic pool size +- [Crocus] - Minecraft with Forge and JourneyMap - Graphical glitches +- intel - Minecraft char bar is broken after Mesa upgrade +- [wine] RAGE: texture problems +- "Unsupported modifier, resource creation failed." on stderr +- [RADV/ACO] Star Citizen Flickeriing Lights +- anv: query regressions +- ir3: Infinite loop in scheduler when splitting +- ci/lava: "Listened to connection" spam +- ci: docs build fail +- turnip: 3d copy/clear/... ops should not be counted by occlusion query +- anv: sampleLocationsEnable ignored in VkPipelineSampleLocationsStateCreateInfoEXT +- turnip: wrong border color for Z24S8 +- aco_tests fail with llvm-15 +- Clang 14 issues compiling src/util/u_cpu_detect.c +- eglExportDMABUFImageQueryMESA crashes on amdgpu gallium/drivers/radeonsi +- crocus: GL_SELECT regressed on hsw +- Random texture flickering in Minecraft +- panfrost: Broken interaction between CRC and tile size +- panfrost: allow_forward_to_be_killed is too conservative +- APU downgraded to OpenGL 3.0 since UBO size change in radeonsi +- [anv, bisected, regression] Pipeline statistics query segfaults +- radv/NIR: Suboptimal codegen for AccessChain on buffer device addresses +- Regression - vah264dec + glimagesink causes green taint on entire video with mesa>=22.0.0 +- va: AMD: Wrong buffer size of P010 Luma plane +- 625b352f14b broke glcts on Navi1x +- Use marketing_name for the Vulkan device string in RADV +- Gallium driver crashes due to fd double close in iris driver +- radv: Khronos ray tracing sample issues +- r300: copy propagate does not correctly account for control flow +- r300: Amnesia the Dark Descent slowdown since NIR loop unrolling +- r300: some unused uniforms are not removed with the NIR varying linker (compared to old glsl varying linker) +- Mesa currently FTBFS on x32 architecture +- Runescape randomly crashes. malloc_consolidate(): unaligned fastbin chunk +- tu: viewport assert +- d3d12: point coord regression due to NIR IO changes +- nir_lower_phis_to_regs_block: Assertion \`src->src.is_ssa' failed. +- Redream emulator showing artifacts +- Build failure on sparc +- r600 video playback broken +- r300: hundreds of tests fail since "clear color buffers using color from a constant buffer" +- turnip: shader viewport handling +- Signal 6 (Aborted) when opening GTK3 applications +- radv: Conditional rendering on compute queue +- anv: line failure +- 32 bit build failure +- d3d12 build regression: d3d12video.h: error: expected ‘;’ at end of member declaration +- panfrost midgard - major issues with MelonDS emulator - not able to trace properly too +- Intel/anv: The cloak is flickering in Batman:Arkham Knight +- regression: "NIR based varying linker" breaks some xwayland/glamor applications with etnaviv +- pvr: build failure on archs with unsigned long != uint64_t +- turnip: dEQP-GLES31.functional.copy_image.non_compressed.viewclass_32_bits.* fails with angle +- Nheko misrendering on RK3399 (Mali T860) +- intel_pps_driver.cc:308:74: error: too many arguments to function ‘void intel_perf_query_result_accumulate_fields(intel_perf_query_result*, const intel_perf_query_info*, const void*, const void*, bool) +- radv: no nir validation in radv_link_shaders +- r300: deqp regressions from "glsl: enable the use of the nir based varying linker" +- [crocus] glmark2 -b jellyfish rendering issues +- radeonsi-stoney: Failing jobs during a CI bug +- iris: skqp gles_atlastext flake +- turnip: use-after-free in shader compilation +- anv: EXT_primitives_generated_query fails +- r300: vs predicate logic fails in vs-loop-complex-unroll-with-else-break.shader_test +- [Intel/Vulkan] White blinking artifacts in Wolfenstein Youngblood +- radv: Indirect unaligned dispatch +- [radv] DCC causes artifacts in Senran Kagura Shinovi Versus +- Radeon RX 5700XT crash when using prusa-slicer +- glcts proxy_textures_invalid_samples tests assert on iris +- turnip: gfxbench mh3.1 offscreen crash with ANGLE +- dEQP-GLES31.functional.shaders.builtin_functions.integer.[i|u]mulextended.* fail +- Radv leaks a descriptor set layout +- radeonsi: KHR_debug thread safety +- [radeon, rv370] Running piglit shaders@glsl-vs-raytrace-bug26691 test causes hard lockup & reboot +- clover: Assertion \`NumContainedTys && "Attempting to get element type of opaque pointer"' failed +- wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes causes memory leak in Apex Legends +- v3d: port v3dv workaround for GFXH-1918 +- freedreno: fragile location packing can break cull/clip distance +- turnip: dEQP-VK.spirv_assembly.instruction.compute.float_controls.fp16.generated_args.* failures +- Panfrost: wrong textures and lighting in "Thief" games in wine (not happening when using Software render) +- [Bisected] r600: Big performance regression on some games +- turnip: report corresponding drmFormatModifierPlaneCount for planar format +- piglit ext_image_dma_buf_import.ext_image_dma_buf_import* regression +- Huge amount of anon_inode:sync_file file descriptor created +- enable warnings for GNU extensions we don't want +- Grid Autosport via proton/vulkan: glitches on the grass on amdgpu, fine on amdgpu-pro +- NIR validation failed after nir_opt_if +- Etnaviv: The triangle is lost during linking together triangle strips with degenerate triangles +- intel: integer_mad_hi / integer_mad_sat / integer_mul_hi produce invalid results +- Aperture Desk Job rendering artifacts Intel TGL +- intel: integer_sub_sat produces stack corruption / OpISubBorrow compilation issue +- r300/rv530: "Black rendering" of Baldur's Gate Dark Alliance in Xbox emulator CXBX-R under wine with Gallium Nine and RADEON_DEBUG=use_tgsi +- radv/aco: xfb bug +- '../src/amd/common/ac_surface.c:1575: void ac_copy_dcc_equation(const struct radeon_info \*, ADDR2_COMPUTE_DCCINFO_OUTPUT \*, struct gfx9_meta_equation \*): assertion "dcc->equation.gfx9.num_bits <= ARRAY_SIZE(equation->u.gfx9.bit)" failed' +- Occasional flicker corruption in Rage 2, e.g. after loading, with ACO on RX 5700 XT +- Vulkan::Calling vkWaitForFences Timeout +- turnip: bad gmem rendering with PUBG mobile on angle +- Intel (CHT) - Uplink text rendering bugged out in Mesa 22.0 +- gen9atom gpu hang on dEQP-VK.spirv_assembly.instruction.graphics.float16.arithmetic_1 +- bad memory managment on panfrost RK3399 - cannot alocate more ram - fury unleashed +- radv,aco: Regression with global atomics with negative offsets + + +Changes +------- + +Adam Jackson (32): + +- docs: Note EGL enum allocation for EGL_EXT_present_opaque +- docs: Update supported drivers for 22.x +- glx/kopper: Wire up a way for SwapBuffers to fail +- wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes +- kopper: Grow a swap interval API +- glx/kopper: Enable GLX_EXT_swap_control etc. +- egl/kopper: Hook up eglSwapInterval +- wsi/x11: Hook up KHR_incremental_present +- dri: Inline dri2InvalidateDrawable into its one caller +- dri: Move driCopySubBufferExtension into drisw +- dri: move driQueryRendererIntegerCommon near its one caller +- dri: Move driC{reate,oncat}Configs near their only caller +- dri: Move dri{Get,Index}ConfigAttrib near their vtable +- dri: Remove what's left of utils.[ch] +- glx/dri2: Require DRI2 >= 1.3 for working buffer invalidation +- dri2: Require a loader with working buffer invalidation +- zink: Print the VkResult if vkCreateInstance fails +- glx/dri: Fix DRI drawable release at MakeCurrent time +- mesa: Stub out GL_APPLE_object_purgeable +- glx: Remove the remains of GLX_MESA_multithread_makecurrent +- gallium/dri: Remove the memory of GLX_MESA_multithread_makecurrent +- glx: Remove some can't-happen checks from protocolDestroyDrawable +- glx: Remove unused __glXSetupForCommand from __glXIsDirect +- glx: Remove excess screen parameter from CreateContext +- glx/dri2: Stop tracking the protocol version in the display state +- glx: Fix return values from __glXGetDrawableAttribute +- glx: Fix drawable refcounting for naked Windows +- kopper: Implement {EGL,GLX}_EXT_buffer_age +- wsi/x11: Synchronously check for error from xcb_present_pixmap +- mesa: Remove Purgeable fields from texture/buffer/renderbuffer objects +- llvmpipe: Indentation fix +- gallium/dri2: Enable {GLX_ARB,EGL_KHR}_context_flush_control + +Alejandro Piñeiro (22): + +- v3dv/pipeline: fix small comment typo +- v3dv/pipeline_cache: adds check to skip searching for a entry +- v3dv/pipeline_cache: add on disk cache hit stats +- v3dv/pipeline: include pipeline layout on the pipeline sha1 +- v3dv/bo: unmap any bo that goes to the bo cache +- v3dv/format: no need for GetPhysicalDeviceFormatProperties +- v3dv/format: Add support for VK_KHR_format_feature_flags2 +- nir: get res binding using component 0, instead of asssumig an uint +- v3dv/pipeline: call nir_lower_explicit_io after first nir optimization loop +- v3dv/pipeline: expand nir_optimize, drop st_nir_opts +- v3dv/device: add comment on depthClamp feature +- broadcom: add no_merge_jobs debug option +- broadcom: document which debug options are specific for a given driver +- broadcom/compiler: wrap nir_convert_to_lcssa with NIR_PASS_V +- broadcom/compiler: use NIR_PASS for nir_lower_vars_to_ssa at v3d_optimize_nir +- nir: call nir_metadata_preserve at nir_remove_unused_io_vars +- v3dv/pipeline: use nir_shader_instructions_pass when lowering layout +- broadcom/compiler: removed unused function +- broadcom/compiler: make several passes to return a progress +- v3d/v3dv: use NIR_PASS(_ +- vc4/v3d: stop adding NORAST when SHADERDB debug option is used +- vc4/v3d: restore calling debug_get_option_vc4/v3d_debug + +Alessandro Astone (5): + +- kopper: Only include XCB headers on supported platforms +- Android.mk: Cleanup mesa3d-lib +- Android.mk: Make mesa3d-lib work with absolute path meson outputs +- Android.mk: Generate the dummy source in local-generated-sources-dir +- Android.mk: Intermediate output paths may already be absolute + +Alexander Kanavin (1): + +- swrast_kms: use swkmsDRI2Extension instead of driDRI2Extension + +Alexey Bozhenko (2): + +- intel/compiler: fix singleton pointer coverity warning +- spirv: fix OpBranchConditional when both branches are the same + +Alyssa Rosenzweig (454): + +- panfrost: Add 48-bit and 96-bit tiling routines +- panfrost: Remove bpp check for tiling +- panfrost: Only handle 2D in should_linear_convert +- panfrost: Implement tiled 3D resource access +- panfrost: Only tile for 2D dimensions +- panfrost: Allow tiling all texture targets +- panfrost: Assert alignment in tiling routine +- panfrost: Document arguments to tiling routines +- panfrost: Unit test u-interleaved tiling routines +- docs/panfrost: Document u-interleaved tiling +- panfrost: Rework dirty tracking for Valhall +- panfrost: Dirty track batch masks +- panfrost: Rework PAN_GPU_ID mechanism +- docs/panfrost: Document drm-shim usage +- meson: Ignore unused variables in release builds +- pan/bi: Simplify bi_propagate_pass_flags +- panvk: Respect line width +- panvk: Consider primitive restart in index buffer walks +- pan/bi: Implement fquantize2f16 +- panvk: Handle empty shaders gracefully +- panvk: Lower shared memory +- util: Add a generic worklist implementation +- nir: Use u_worklist to back nir_block_worklist +- pan/bi: Implement pack_uvec[24]_to_uint +- panfrost: Add texture features enum to v9.xml +- pan/blit: Compile blit shaders without IDVS +- pan/blit: Generalize texture alignment +- pan/blit: Prepare for Valhall port +- pan/blit: Support v9 data structures +- panfrost: Don't allocate storage for PSIZ on Valhall +- panfrost: Compile libpanfrost for v9 +- panfrost: Add helper to emit UBOs +- panfrost: Port uniform/UBO logic to Valhall +- panfrost: Don't fix up alpha test on Bifrost +- panfrost: Use common state emit for compute jobs +- panfrost: Adapt compute job emit for Valhall +- panfrost: Specialize rasterizer state for Valhall +- panfrost: Simplify attribute format expression +- panfrost: Specialize vertex elements for Valhall +- panfrost: Specialize ZSA state for Valhall +- panfrost: Specialize shader descriptors for Valhall +- panfrost: Use dirty flags to emit v9 descriptors +- panfrost: Generate Valhall Malloc IDVS jobs +- panfrost: Compile for v9 +- panfrost: Kick off v9 support in the driver +- panfrost: Advertise all textures in drm-shim +- panfrost: Split pan_layout.c from pan_texture.c +- panfrost: Always use 64-bit SD with strides +- panfrost: Add afbc_superblock_{size, width, height} helpers +- panfrost: Use panfrost_afbc_superblock_width +- panfrost: Don't pretend to support multiplane AFBC +- panfrost: Extract panfrost_afbc_is_wide helper +- panfrost: Use pan_block_size in layout calculation +- panfrost: Unify panfrost_block_size paths +- panfrost: Unify paths through image_layout_init +- panfrost: Remove unused dev argument +- panfrost: Unit test block size queries +- panfrost: Simplify how image_layout_init is called +- panfrost: Add a test for pan_image_layout_init +- asahi: Workaround broken GLSL compiler +- agx: Implement some shader-db stats +- agx: Note that RA proceeds in dominance-order +- agx: Track write registers more accurately +- agx: Match order for designated initializers +- agx: Use correct types for some IR enums +- agx: Wrap compiler header in extern "C" +- agx: Add unit test infrastructure +- agx: Use pseudo ops for mov/not/and/xor/or +- agx: Implement simple copyprop +- agx: Dynamically allocate agx_instr->src +- agx: Add p_split pseudoinstruction +- agx: Add phi pseudo instruction +- agx: Add a hash table for vector extracts +- agx: Add helper to emit combines +- agx: Add helper to emit splits +- agx: Use a transfer graph for parallel copies +- agx: Lower p_split after RA +- agx: Optimize p_split(kill) specially +- agx: Emit splits for intrinsics +- agx: Use extract helper for tex internally +- agx: Remove else optimization +- agx: Use a dynarray for predecessors +- agx: Add predecessor index helper +- agx: Mark the logical ends of blocks +- agx: Add agx_after_block_logical helper +- agx: Introduce worklist infrastructure +- agx: Adapt liveness analysis for SSA +- agx: Make DCE dumber +- agx: Translate phi nodes +- agx: Copy prop into phi nodes +- agx: Don't lower phis in NIR +- agx: Lower phi nodes to parallel copies +- agx: Split up RA from post-RA lowering +- agx: Always use hash table for extracts +- agx: Unit test parallel copy lowering +- agx: Remove identity moves +- agx: Add validation pass +- agx: Update RA comment +- agx: Remove has_liveness +- agx: Remove nir_register support +- agx: Lower mediump flat shading +- panvk: Ignore point size for !points +- panvk: Use correct point size limits +- panfrost: Only flip point sprites for GL +- panvk: Streamline no shader RSD case +- panvk: Gate rasterization on !discard, not fs req +- panvk: Emit fragment RSDs even with no shader +- panfrost: Remove ancient unused code +- panfrost: Call tgsi_to_nir earlier +- panfrost: Don't subclass pipe_compute_state +- panfrost: Use s->info.stage instead of tgsi stage +- panfrost: Simplify shader key architecture +- panfrost: Refactor variant selection code +- panfrost: Refactor variant rebind code +- panfrost: Lower point sprites on Bifrost +- panfrost: Lower user clip planes +- panfrost: Flip point coords in hardware +- pan/va: Generalize message? check for asm +- pan/va: Add missing TABLE (SFU) instructions +- pan/va: Mark NOT instructions as deprecated +- pan/va: Add VAR_TEX_BUF_GRADIENT instruction +- pan/va: Add non-IDVS VAR_TEX instructions +- panvk: Lower global vars to local +- panvk: Allocate stack for each thread +- pan/bi: Handle texture offset + index +- panvk: Lower image deref atomics +- panvk: Only write depth when depth test enabled +- pan/decode: Print row strides, not line strides +- panvk: Remove unused layout structs +- panvk: Report row_stride in GetImageSubresourceLayout +- panfrost: Add helpers to work with legacy strides +- panfrost: Use row stride for explicit layouts +- panfrost: Rename away from bytes_per_pixel +- panfrost: Use row stride to calculate slice size +- panfrost: Adapt get_param for row strides +- panfrost: Use row_stride even for linear resources +- lima,panfrost: Use row stride for tiling routines +- panfrost: Remove line_stride +- panfrost: Unify row stride and AFBC row stride +- panfrost: Unit test stride calculations +- panfrost: Unit test "from legacy" helper +- pan/bi: Rename bi_block->name to bi_block->index +- pan/bi: Count blocks +- pan/bi: Add u_worklist wrapper macros +- pan/bi: Use bi_worklist in analyze_helper_requirements +- pan/bi: Use bi_worklist for liveness +- pan/bi: Use bi_worklist for post-RA liveness +- pan/bi: Use worklist for scoreboard analysis +- pan/bi: Use a dynarray for predecessors +- nir: Don't set writes_memory for reading XFB +- panfrost: Relax image check +- pan/va: Serialize memory stores +- pan/bi: Align spilled registers on Valhall +- pan/bi: Add option to test spilling +- pan/bi: Assert that blend shaders may not spill +- panfrost: Correct XML for TLS +- panfrost: Use emit_tls +- panfrost: Use packed TLS on Valhall +- panfrost/ci: Smoke test spilling +- panfrost: Don't use VARYING_SLOT_TEX0 internally +- panvk: Don't use VARYING_SLOT_TEX0 internally +- pan/bi: Rework varying linking on Valhall +- panfrost: Increase the maximum tiler heap size +- mesa/st: Fix building tests on macOS +- panvk: Stub pipeline cache using the common code +- panvk: Conform viewport code to Vulkan spec +- panvk: Call nir_opt_trivial_continues +- pan/va: Add some whitespace to Valhall disassembly +- pan/va: Add whitespace after disassembled branches +- agx: Restore Valve copyright header +- pan/bi: Don't analyze td on Valhall +- pan/bi: Emit LEA_TEX on Valhall +- pan/bi: Support image loads on Valhall +- pan/bi: Preserve AXCHG.i32 destination +- pan/bi: Handle shared atomic exchange on Valhall +- pan/bi: Handle shared/scratch on Valhall +- pan/bi: Support atomics on Valhall +- pan/bi: Ensure the end NOP isn't eliminated +- asahi: Fix hangs waiting on the notification queue +- pan/bi: Update bi_count_write_registers for Valhall +- pan/va: Mark more source sizes +- pan/va: Lower split 64-bit sources +- pan/va: Use 64-bit lowering for texturing +- pan/bi: +JUMP can't read same-cycle temp +- pan/bi: Lower phis to scalar +- pan/bi: Use value-based interference with LCRA +- pan/bi: Add source/destination counts +- pan/bi: Add collect and split instructions +- pan/bi: Add bi_before_block cursor +- pan/bi: Lower split/collect before RA +- pan/bi: Fix mov and pack_32_2x16 +- pan/bi: Expand MAX_DESTS to 4 +- pan/bi: Create COLLECT during isel +- pan/bi: Lift split/collect cache from AGX +- pan/bi: Simplfy BLEND emit +- pan/bi: Emit collect and split +- pan/va: Use split for 64-bit lowering +- pan/va: Don't use bi_word in FAU unit test +- pan/bi: Scalarize bi_lower_swizzle +- pan/bi: Scalarize bi_opt_cse +- pan/bi: Scalarize modifier propagation +- pan/bi: Scalarize copyprop +- pan/bi: Remove bi_word and bi_word_node +- pan/bi: Simplify register precolouring in the IR +- pan/bi: Remove liveness metadata tracking +- pan/bi: Don't propagate discard +- pan/bi: Optimize split of collect +- pan/bi: See through splits for var_tex fusion +- pan/bi: Validate preload constraints are satisfied +- pan/bi: Validate vector widths +- pan/va: Handle 2-src blend in lower_split_src +- pan/va: Remove DISCARD.f32 destination +- pan/va: Use ^ instead of ` to indicate last-use +- asahi: Run lower_blend for all fragment shaders +- asahi: Use z24_in_z32f lowering +- asahi: Validate depth/stencil formats +- asahi: Don't store to unbound render targets +- asahi: Handle uncompressed Z32F depth buffers +- asahi: Pass depth/stencil clear values to firmware +- asahi: Flush on partial clears +- asahi: Handle 3rd deflake address +- asahi: Account for every byte of the cmdbuf +- asahi: Identify more depth pipeline fields +- asahi: Dump "unknown 4a" structure +- asahi: Add XML used for visibility tests +- asahi: Relax Draw "command" check +- asahi: Note unknown field +- asahi: Identify depth bias array +- asahi: Identify "set depth bias index" field +- asahi: Identify depth bias enable bit +- asahi: Add a depth bias structure +- asahi: Implement polygon offset +- asahi: Identify partial render pipeline +- asahi: Sync some names with Project Zero +- asahi: Add segment list pretty printer +- asahi: Split unknown field in segment list header +- asahi: Validate segment list length +- asahi: Compute segment list length +- asahi: Align segment list unknowns with Metal +- asahi: Decode resource allocation responses +- asahi: Decode resource allocation requests +- asahi: Identify suballocation mechanism +- asahi: Identify suballocated size +- asahi: Drop some unknowns +- asahi: Sync some more unknown fields +- asahi: Improve render target dumping +- asahi: Fix pipeline decoding +- asahi: Handle RECT textures as 2D +- asahi: Allocate a larger segment list +- asahi: Guard against encoder overflows +- asahi: Add wrap library +- docs/asahi: Add initial driver docs +- panfrost: Simplify the kmsro create path +- panfrost: Don't allocate SHARED as scanout +- pan/bi: Recoalesce tied operands after spilling +- pan/bi: Schedule for pressure pre-RA +- pan/bi: Allow CSEing LEA_BUF_IMM +- panfrost: Fix crash with GALLIUM_HUD +- panfrost: Extract afbc_stride_blocks helper +- panfrost: Extract pan_afbc_row_stride helper +- panfrost: Fix is_wide return type +- panfrost: Add pan_afbc_tile_size helper +- panfrost: Support tiled AFBC in stride helpers +- panfrost: Align layouts to tiles of superblocks +- panfrost: Add AFBC stride unit tests +- panfrost: Add 3D texture layout unit test +- panfrost: Calculate header_size based on row_stride +- panfrost: Add a tiled 16x16 layout unit test +- panfrost: Add 1x1 layout unit tests +- panfrost: Fix AFBC flags on v6 +- panfrost: Put comment in correct #ifdef +- panfrost: Handle AFBC Tiled +- panfrost: Add helper checking tiled AFBC support +- panfrost: Gate tiled AFBC on GPUs that support it +- panfrost: Advertise 16x16 tiled AFBC +- panfrost: Use tiled AFBC textures +- pan/bi: Fuse result types +- pan/bi: Test CMP result_type optimization +- panfrost: Allow FPK on Valhall +- panfrost: Set overdraw_alpha[01] flags +- panfrost: Set allow_rotating_primitives +- pan/bi: Don't allow spilling coverage mask writes +- pan/bi: Add VAR_TEX fusing unit test +- pan/va: Fill in missing src_flat16 enum +- pan/bi: Make LD_VAR w=format instead of w=vecsize +- pan/bi: Model Valhall source formats +- pan/bi: Interpolate varyings at 16-bit +- pan/bi: Extend bi_scoreboard_state for finer tracking +- pan/bi: Add slot to bi_instr +- pan/bi: Preserve flow control for non-psiz variant +- pan/bi: Add ASSERT_SHADER_EQUAL macro +- pan/bi: Add a trivial ctx->inputs for unit tests +- pan/bi: Extract bit_block helper +- pan/bi: Export bi_block_add_successor +- pan/bi: Export helper termination analysis +- pan/bi: Print flow control on instructions +- pan/va: Add pass to insert flow control +- pan/va: Do scoreboard analysis +- pan/va: Terminate helper threads +- pan/va: Add flow control merging pass +- pan/va: Assign slots roundrobin +- pan/va: Respect assigned slots +- pan/va: Unit test flow control insertion +- pan/va: Unit test flow control merging +- pan/bi: Use flow control lowering on Valhall +- pan/mdg: Scalarize with 64-bit sources +- pan/mdg: Remove uppercase write masks +- pan/mdg: Use "<<" instead of "lsl" +- pan/mdg: Only print masked components of swizzle +- pan/mdg: Only print 2 sources for ALU +- pan/mdg: Only print 1 source for moves +- pan/mdg: Distinguish SSA vs reg when printing IR +- pan/mdg: Print outmods when printing IR +- panfrost: Fix XML for AFBC header on v9 +- panfrost: Pass arch instead of dev into afbc_format +- panfrost: Add pan_afbc_compression_mode on Valhall +- panfrost: Support rendering to AFBC on Valhall +- panfrost: Handle texturing from AFBC on Valhall +- panfrost: Allow 2D AFBC on Valhall +- panvk: Use vk_image_subresource_*_count for clears +- pan/bi: Respect swizzles in nir_op_pack_64_2x32_split +- pan/bi: Allow vec6 for collects +- pan/bi: Align 64-bit register sources +- pan/bi: Rework Valhall register alignment +- pan/bi: Fix clper_xor on Mali-G31 +- panfrost/ci: Mark draw_buffers_indexed.* as flakes +- panfrost: Use C11 static_assert for enums +- nir: Add transform feedback system values +- nir: Export nir_io_add_intrinsic_xfb_info +- pan/bi: Add transform feedback lowering pass +- pan/bi: Handle transform feedback intrinsics +- panfrost: Don't allow vertex shaders to have side effects +- panfrost: Wire up transfrom feedback sysvals +- panfrost: Create transform feedback shaders +- panfrost: Launch transform feedback shaders +- pan/va: Do not insert NOPs into empty shaders +- panfrost: Extract panfrost_get_index_buffer helper +- panfrost: Don't calculate min/max indices on v9 +- pan/bi: Extract MUX to CSEL optimization +- pan/va: Implement more lanes +- pan/va: Lower MUX to CSEL where possible +- pan/va: Add MUX lowering tests +- pan/bi: Implement b2i with MUX +- pan/va: Record which instructions are signed +- pan/va: Try negating small constants when lowering +- pan/va: Try widening small constants +- pan/va: Unit test constant lowering pass +- panfrost: Enable Mali-G57 +- docs: Document Mali-G57 conformance +- panfrost: Fix inverted indirect dispatch condition +- pan/bi: Implement load_push_constant +- pan/bi: Don't reorder push with no_ubo_to_push +- panfrost: Use push constants for indirect dispatch +- panfrost: Use push constants for indirect draws +- panvk: Use push constants for clear +- panvk: Use push constants for copy shaders +- panfrost: Deduplicate indirect dispatch structs +- panfrost: Remove redundant first_tag access +- panfrost: Remove is_64b assignments +- panfrost: Add panfrost_query_l2_slices helper +- pan/perf: Fix performance counters on G57 +- mesa,gallium: Make point coord origin a CAP +- asahi,agx: Don't set PIPE_CAP_POINT_COORD_ORIGIN_UPPER_LEFT +- panvk: Drop support for Midgard +- panfrost: Allow pixels using discard to be killed +- panfrost: Inline pan_fbd_has_zs_crc_ext +- panfrost: Disable CRC at <16x16 tile sizes +- panvk: Don't specialize clear shaders for RT +- panvk: Remove unused pushmaps +- panvk: Clear Z/S attachments without a shader +- panvk: Fix stencil clears of combined ZS images +- panvk: Simplify depth clear preload condition +- panvk: Use common CmdBeginRenderPass +- panfrost: Disable QUADS on Valhall +- panfrost: Fix batch state changes on Valhall +- panfrost: Add tristate_is_defined helper +- panfrost: Implement provoking vertices on Valhall +- pan/va: Move VA_NUM_GENERAL_SLOTS to common +- pan/va: Simplify insert flow tests +- pan/va: Workaround quirk of barrier handling +- pan/va: Unit test barrier handling +- pan/bi: Mark bi_postra_liveness_ins as MUST_CHECK +- pan/bi: Constify bi_is_staging_src argument +- pan/va: Add header guards to valhall_enums.h +- pan/va: Move va_flow_is_wait_or_none to common +- pan/va: Include BLEND for va_swap_12 +- pan/va: Use validate_register_pair for BLEND pack +- pan/va: Mark last register reads +- pan/va: Unit test va_mark_last +- pan/bi: Fix LD_BUFFER.i16 definition +- pan/bi: Align accesses with packed TLS +- pan/va: Lower FADD_RSCALE.f32 to FMA_RSCALE.f32 +- pan/va: Implement sample positions FAU packing +- pan/va: Rename RSCALE to LDEXP +- pan/bi: Remove FRSCALE from IR +- pan/bi: Model MKVEC.v2i8 +- pan/bi: Constant fold MKVEC.v2i8 +- pan/va: Pack MKVEC.v2i8 byte lanes +- pan/va: Replace MKVEC.v4i8 with MKVEC.v2i8 +- pan/bi: Tune lower_vars_to_scratch +- panfrost: Enable more FP16 caps on Valhall +- panfrost: Enable CAP_INDIRECT_TEMP_ADDR on Valhall +- panfrost: Bump ESSL_FEATURE_LEVEL on Valhall +- v3d: Drop workaround for u_blitter bug +- agx: Handle loop { if { loop { .. } } } +- r600g: Remove streamout-based buffer copy path +- u_blitter: Remove util_blitter_copy_buffer +- lima,panfrost: Do not ralloc contexts against the screen +- pan/va: Handle terminal barriers +- ttn: Set nir->info.separate_shader +- pan/bi: Fix unpack_32_2x16 definition +- pan/va: Add missing <roundmode/> to V2F32_TO_V2F16 +- pan/bi: Implement f2f16{_rtz, _rtne} +- panfrost: Separate core ID range from core count +- panfrost: Move bifrost_lanes_per_warp to common +- panfrost: Fix PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS +- panfrost: Fix PIPE_COMPUTE_CAP_SUBGROUP_SIZE +- panfrost/ci: Disable 0ad trace on T860 +- panfrost: Clear with a quad to avoid flushing +- panfrost/ci: Disable T760 jobs +- nir/opt_shrink_vectors: Round to supported vec size +- panfrost: Revert provoking vertex assertion +- panfrost: Only key points to point coord origin +- pan/va: Dump unencodable instructions +- pan/va: Use invalid_instruction in more places +- pan/va: Print instructions with pack assert fails +- pan/va: Handle BIFROST_MESA_DEBUG=nosb +- panfrost: Make pan_select_max_tile_size O(1) +- panfrost: Use implementation-specific tile size +- panfrost: Protect pandecode by a mutex +- pan/bi: Require ATEST coverage mask input in R60 +- pan/va: Stall after ATEST +- panfrost: Fix shader_modifies_coverage on Valhall +- panfrost: Add decoupled early-ZS helpers +- panfrost: Unit test early-ZS helpers +- panfrost: Add zsa->zs_always_passes flag +- panfrost: Use early-ZS helpers +- util: Enable list_assert in debugoptimized builds +- util: Gate simple_mtx_assert_locked on !NDEBUG +- agx: Port minifloat tests to GTest +- pan/bi: Assert that we use the correct CLPER +- pan/bi: Rename CLPER_V6.i32 to CLPER_OLD.i32 +- pan/decode: Fix overrun decoding planes +- nir/lower_idiv: Be less creative about signs +- agx: Fix packing of samplers in texture instrs +- panfrost: Don't segfault on unknown models +- pan/bi: Don't reorder image loads across stores +- pan/bi: Don't allow ATEST to take a temporary +- pan/mdg: Print 3 sources for CSEL +- pan/bi: Fix dual texturing with uniforms +- pan/bi: Fix out-of-bounds write in va_lower_split_64bit +- pan/bi: Consider all dests in helper_block_update +- agx: Fix float copyprop of neg(neg) case +- panfrost: Respect buffer offset for OpenCL + +Andres Calderon Jaramillo (2): + +- nir: Account for YUV range. +- mesa/st: Account for YUV color space and range. + +Andres Gomez (5): + +- radv/ci: update vkd3d-proton results for AMD's Kabini +- ci: install a more recent version of Wine in the test-vk image +- ci: install gfxinfo-mupuf in the test-vk image +- ci: move b2c artifacts to its own directory +- ci: add wrapper script for Valve's traces runner + +Anuj Phogat (2): + +- include/uapi: Update drm_fourcc.h from kernel +- isl,iris: Add I915_FORMAT_MOD_4_TILED support for XeHP + +Arvind Yadav (7): + +- nir: Add a lowering pass for polygon and line smoothing +- radeonsi: Add nir_lower_poly_line_smooth pass for polygon and line smoothing +- nir: add load_point_coord_maybe_flipped intrinsics for point smoothing +- nir: Add a lowering pass for point smoothing +- ac/llvm : Adding Number of all interpolated inputs in ac_shader_abi +- ac/llvm: Implement nir_intrinsic_load_point_coord_maybe_flipped opcodes +- radeonsi: Enable nir_lower_point_smooth lowering pass for point smoothing + +Autumn on Tape (5): + +- gallivm: add subgroup shuffle support +- lavapipe: enable subgroup shuffle operations +- gallivm: use shufflevector for shuffles when index is constant data +- gallivm: use VPERMPS (x86/AVX2) for 32-bit 8-element shuffles +- lavapipe: enable subgroup quad operations + +Axel Davy (6): + +- frontend/nine: Skip invalid swvp calls +- frontend/nine: Fix buffer tracking out of bounds +- frontend/nine: Fix ATOC handling +- frontend/nine: Fix cso restore bug +- frontend/nine: Fix shader multi-use crash +- frontend/nine: Fix ff position_t fallback when w = 0 + +Bas Nieuwenhuizen (39): + +- vulkan/wsi/x11: Ensure we have the required number of images for acquire. +- radv: always enable NGG on GFX11 +- radv: Mark all formats as DCC compatible with each other on gfx11. +- radv: gfx11 register changes. +- radv: Adjust for new SWIZZLE_ENABLE. +- radv: Add gfx11 DCC fast clear support. +- radv: Disable transform feedback for gfx11. +- radv: update the initialization of SGPR0/1 registers for HS and GS on GFX11 +- radv: Deal with derefs from opaque types in function parameters. +- radv: Add performance counter reg write. +- radv: move ac_perfcounters to physical_device. +- radv: Add pstate locking for perfcounters. +- radv: Rename SPM functions. +- radv: Make inhibiting clockgating not static. +- radv: Add helper BO for perf counters. +- radv: Add perf counter lock/unlock commandbuffers. +- radv: Add a list of performance counters. +- radv: Implement support for querying performance counters. +- radv: Implement listing performance counters. +- radv: Expose VK_KHR_performance_query. +- radv: Add ability to override the build id for the cache. +- radv: Add a 32bit memory type. +- radv: Skip setting empty index buffers to avoid hang +- radv: Expose function to write vertex descriptors for dgc. +- radv: Always store stride in the vbo descriptor. +- radv: Require 32bit memory for indirect buffers. +- radv: Expose helper for base pa_su_sc_mode_cntl. +- radv: Make radv_get_vgt_index_size non-static. +- radv: Add helper to write scissors. +- radv: Add flushing for DGC. +- radv: Add DGC meta shader. +- radv: Implement DGC generated command layout structure. +- radv: Implement DGC cmdbuffer generation. +- radv: Implement CmdExecuteGeneratedCommandsNV. +- radv: Add stub for vkCmdBindPipelineShaderGroupNV. +- radv: Expose VK_NV_device_generated_commands. +- vulkan/wsi: Take max extent into consideration for modifier selection. +- amd/common: Don't rely on DCN support checks with modifiers. +- amd/common: Disable DCC retile modifiers on RDNA1 + +Ben Skeggs (3): + +- nv50/ir: fix OUT.FINAL on ampere +- nv50/ir: fix scope of memory ops on ampere +- nv50/ir: fix ALD/AST where address comes from AL2P + +Benjamin Cheng (5): + +- radv: fix memory leak of descriptor set layout +- radv: refactor image binding into struct +- radv: consolidate descriptor set buffer counts +- radv: implement disjoint memory for multiplane images +- radv: expose disjoint image support + +Boris Brezillon (159): + +- dzn: Skip binding entries with zero descriptors +- dzn: Make sure sampler heaps don't contain more than 2048 samplers +- microsoft/compiler: Add a dunmmy SV_SampleIndex when needed +- microsoft/spirv_to_dxil: Allow forcing per-sample shading +- dzn: Add support for sampleRateShading +- dzn: Pass the right type to CreateCommandList() in the reset path +- dzn: refactor error-handling +- ci/windows: Add a variable to globally disable jobs using windows runners +- dzn: Use core helpers to fill physical device features/properties +- dzn: Report actual device limits +- d3d12: Move the D3D12ResourceState code to the gallium driver +- ci/windows: Split the windows-related tests +- dzn: Add support for EXT_vertex_attribute_divisor +- dzn: Rename a bunch of dzn_descriptor_set helpers +- dzn: Pass layout objects to various dzn_descriptor_set_ptr_xxx helpers +- dzn: Split the write desc helpers in two halves +- dzn: Add support for KHR_descriptor_update_template +- microsoft/spirv_to_dxil: Add a draw_id field to vertex_runtime_data +- dzn: Fix exec_buf_stride calculation +- dzn: Simplify the triangle_fan_rewrite_index shader +- dzn: Update the draw_id sysval when issuing indirect draws +- dzn: Advertise KHR_shader_draw_parameters +- dzn: Pass an image view to dzn_cmd_buffer_clear_attachment() +- dzn: Set MAX_RTS to D3D12_SIMULTANEOUS_RENDER_TARGET_COUNT +- dzn: Move all render-related states to a substruct +- dzn: Inline dzn_cmd_buffer_clear_attachments() +- dzn: Add support for dynamic rendering +- dzn: Get rid of the render pass logic +- dzn: Allow multiDrawIndirect and drawIndirectFirstInstance +- dzn: Add KHR_draw_indirect_count support +- dzn: Add missing VKAPI_{ATTR,CALL} specifiers to BeginCommandBuffer() +- dzn: Pass the dzn_event pointer to _mesa_hash_table_insert() +- dzn: Fix the STATIC_ASSERT() in dzn_meta_blits_get_context() +- vulkan/wsi: Make wsi_win32_surface_get_capabilities() return the current extent +- dzn: Fix dzn_cmd_buffer_collect_queries() +- dzn: Use the vk_sync_binary abstraction +- spirv: Fix windows build +- dzn: Use the correct ICD path on Win32 +- microsoft/spirv_to_dxil: Fix push_constants type declaration +- microsoft/spirv_to_dxil: Fix the push_constant UBO size calculation +- dzn: Emit missing transition barriers for rendering attachments +- ci/windows: Bump spirv-samples commit-ID +- microsoft/spirv2dxil: Add a debug option +- microsoft/compiler: Make sure all SV_Position components are written +- microsoft/spirv_to_dxil: Extract NIR passes out of spirv_to_dxil() +- microsoft/spirv_to_dxil: Constify the descriptor binding mapping fields +- microsoft/compiler: Make input_mappings immune to driver_location holes +- microsoft/spirv_to_dxil: Remove dead IO vars at the end of dxil_spirv_nir_passes() +- microsoft/spirv_to_dxil: Add a linking helper +- microsoft/compiler: Allow vulkan users to handle varyings linking +- microsoft/spirv_to_dxil: vectorize IOs +- dzn: Fix maxVertexInputAttributes limit +- dzn: Handle varying linking explicitly +- dzn: Fix unitialized var access in dzn_meta_compile_shader() +- dzn: Fix vertex input handling +- microsoft/spirv_to_dxil: Let spirv_to_dxil() handle the linking +- microsoft/compiler: Drop the auto_link parameter passed to nir_to_dxil() +- dzn: Switch to ID3D12Device2 +- dzn: Drop an unused argument passed to dzn_graphics_pipeline_translate_vi() +- dzn: Use CreatePipelineState() +- dzn: Enable depth-bounds testing +- microsoft/compiler: Lower fquantize2f16 +- nir: Fix flat new_var assignment in create_new_io_vars() +- microsoft/compiler: Pick a type that matches interpolation mode for structs +- microsoft/compiler: Truncate function names when needed +- vulkan/util: Fill VkPhysicalDeviceIDProperties::deviceNodeMask +- spirv: Add a dst/src type to image deref loads/stores coming from OpAtomic{Load,Store} +- microsoft/compiler: Images are no longer reprensented by uniform vars +- microsoft/compiler: Set typed_uav_load_additional_formats when appropriate +- microsoft/compiler: s/assert()/unreachable()/ in emit_intrinsic() +- microsoft/compiler: Support load_sample_id_no_per_sample +- dzn: Decorrelate external image info and external image properties +- dzn: Return empty external properties +- dzn: Add a dummy GetDeviceGroupPeerMemoryFeatures() +- dzn: Add a dummy GetImageSparseMemoryRequirements2() +- dzn: Add dummy {Create,Destroy}SamplerYcbcrConversion() implementations +- dzn: Check that no export is requested in the memory allocation path +- dzn: Lower alignment requirements when allocating buffers or single-sample images +- dzn: Add a dzn_desc_type_has_sampler() helper +- dzn: Implement GetDescriptorSetLayoutSupport() +- dzn: Vulkan 1.1 wants maxPerSetDescriptors >= 1024 +- dzn: Prepare support for pipeline variants +- dzn: Fix primitiveRestart support +- dzn: Fix triangle-fan emulation +- dzn: Support dynamic depth testing parameters properly +- dzn: Support dynamic line width +- ci/windows: Allow skipping deqp tests if we have to +- ci/windows: Add dEQP-VK.dynamic_state.* to the test list +- ci/windows: Skip dEQP-VK.api.command_buffers.record_many_draws* +- d3d12: Keep access masks up-to-date when removing vars in d3d12_disable_multisampling() +- microsoft/compiler: Move d3d12_fix_io_uint_type() to dxil_nir.c +- microsoft/compiler: Fix Layer type +- microsoft/compiler: Support gl_Layer +- dzn: Disable rasterization if the last geometry stage doesn't write the position +- dzn: Enable support for geometry shaders +- ci/dzn: Move to a .toml description +- dzn: Fix the 2DMSArray case in dzn_image_get_dsv_desc() +- dzn: Fix CmdPushConstants() +- dzn: Clamp depthBiasConstantFactor when doing the float -> int conversion +- dzn: Transition resource to RENDER_TARGET/DEPTH_WRITE before clears +- microsoft/compiler: Fix emit_ubo_var() +- ci/dzn: Copy testlog.{css,xsl} to the result dir +- nir/serialize: Put dest last in packed_instr::tex +- dzn: Advertise anisotropic filtering support +- dzn: Advertise shaderImageGatherExtended support +- dzn: Enable dynamic indexing on all kind of descriptors +- dzn: Enable shader{Clip,Cull}Distance +- dzn: Enable the depthClamp feature +- nir/serialize: Support texop >= 16 +- nir/serialize: Silence integer-overflow false positive +- vulkan: Add a vk_pipeline_hash_shader_stage() helper +- anv: Use vk_pipeline_hash_shader_stage() +- v3dv: Fix nir_shader leaks in v3dv_meta_{clear,copy}() +- v3dv: Use vk_pipeline_hash_shader_stage() +- tu: Use vk_pipeline_hash_shader_stage() +- radv: Kill unused fs_m local var in radv_create_shaders() +- radv: Use vk_pipeline_hash_shader_stage() +- dzn: Hook-up device-lost detection +- dzn: Fix indentation +- dzn: Initialize UUIDs +- dzn: Drop dzn_pipeline_cache.c +- dzn: Add missing D3D12_CACHED_PIPELINE_STATE to MAX_COMPUTE_PIPELINE_STATE_STREAM_SIZE +- dzn: Fix assertion in d3d12_pipeline_state_stream_new_desc() +- dzn: Use d3d12_compute_pipeline_state_stream_new_desc() in the compute path +- dzn: Pass the maximum stream size to d3d12_pipeline_state_stream_new_desc() +- dzn: Drop useless while(link_mask != 0) +- dzn: Drop unused allocator passed to dzn_graphics_pipeline_create() +- dzn: Let dzn_pipeline_init() initialize the root signature in the stream +- dzn: Fix potential nir_shader leak +- dzn: Use vk_to_mesa_shader_stage() +- dzn: Save a few indentation levels in graphics_pipeline_compile_shaders() +- dzn: Don't delegate binding translation to dxil_spirv_nir_passes() +- dzn: Hash pipeline layout stages +- dzn: Add a generic cached blob wrapper +- dzn: Drop unneeded goto statement in dzn_compute_pipeline_create() +- dzn: Move the compute shader compilation logic to a sub-function +- dzn: Cache NIR shaders +- dzn: Caculate a binding translation hash +- dzn: Cache DXIL shaders +- dzn: Cache pipeline info +- ci/dzn: Add pipeline cache tests to the include list +- dzn: Cache and pack transition barriers +- dzn: Don't call dzn_cmd_buffer_queue_transition_barriers() prematurely +- dzn: Rename dzn_cmd_buffer_queue_image_subres_range_transition() +- dzn: Introduce dzn_cmd_buffer_queue_image_range_state_transition() +- dzn: Allow shaders to read from depth/stencil attachments +- dzn: Fix copies of block-based formats +- dzn: Fix unused var warnings +- dzn: Fix 'statement with no effect' warning +- dzn: Fix printf() format in dxgi_get_factory() error message +- dzn: Fix {back,front}_wr_uses_ref assignment +- dzn: Fix -Wswitch warnings +- dzn: Use the vk_descriptor_set_layout base struct +- dzn: Use the vk_pipeline_layout base struct +- dzn: Mark transition barriers as executed when we execute barriers +- dzn: Query D3D12_FEATURE_D3D12_OPTIONS3 features +- dzn: Provide a helper to check if 2 formats are compatible +- dzn: Support native image copies when formats are compatible +- dzn: Take VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT into account + +Boyuan Zhang (9): + +- radeonsi/vcn: add vcn 4.0 encode support +- radeonsi/vcn: add vcn 4.0 encode fw interface version +- radeonsi/vcn: add encode context ib for vcn 4.0 +- radeonsi/vcn: add session init ib for vcn 4.0 +- radeonsi/vcn: enable vcn 4.0 encode for gfx11 asic +- radeonsi/vcn: use calculated max hierarchy depth for hevc enc +- vl: add cabac init idc for h264 encode +- frontends/va: get entropy mode info for h264 encode +- radeonsi/vcn/enc: enable cabac entropy mode + +Brian Paul (101): + +- gallivm: assorted clean-ups in lp_bld_nir_aos.c +- gallivm: whitespace fixes in lp_bld_tgsi.h +- gallivm: asst. clean-ups in lp_bld_sample.h +- llvmpipe: asst. clean-ups in lp_linear_fastpath.c +- llvmpipe: asst. clean-ups in lp_setup.c +- llvmpipe: asst. clean-ups in lp_state_setup.c +- gallivm: asst. clean-ups in lp_bld_type.[ch] +- gallivm: cleanup lp_bld_arit.c +- gallivm: clean-up in lp_bld_nir.h +- gallivm: add const qualifier in lp_bld_nir_soa.c +- llvmpipe: asst. clean-up in lp_state_derived.c +- llvmpipe: asst. clean-ups in lp_rast_linear.c +- llvmpipe: fix comment typo in lp_linear_fastpath.c +- llvmpipe: asst. clean-ups in lp_rast.c +- llvmpipe: asst. clean-ups in lp_linear.c +- llvmpipe: asst. clean-ups in lp_scene.h +- llvmpipe: asst. clean-up and add comments in lp_state_fs.h +- gallivm: clean-up in lp_bld_sample.c +- llvmpipe: asst. clean-up in lp_setup_context.h +- gallivm: asst. clean-ups in lp_bld_sample_soa.c +- gallivm: clean-ups in lp_bld_tgsi.[ch] +- llvmpipe: asst. clean-ups in lp_state_fs.c +- llvmpipe: minor clean-ups in lp_jit.h +- llvmpipe: asst. clean-ups in lp_state_fs_linear_llvm.c +- llvmpipe: asst. clean-ups in lp_state_fs_linear.c +- llvmpipe: asst. clean-ups in lp_setup_tri.c, lp_setup_rect.c +- llvmpipe: whitespace clean-ups in lp_rast.h +- llvmpipe: minor whitespace/formatting cleanups in lp_state_setup.h +- llvmpipe: minor clean-ups in lp_state_fs_analysis.c +- llvmpipe: misc clean-up in lp_rast_tri.c +- llvmpipe: asst. clean-ups in lp_bld_blend_aos.c +- llvmpipe: asst. clean-ups in lp_rast_tri_tmp.h +- llvmpipe: asst. clean-ups in lp_rast_priv.h +- gallivm: minor clean-ups in lp_bld_sample_soa.c +- llvmpipe: misc clean-ups in lp_bld_interp.c +- gallivm: asst. clean-ups in lp_bld_nir.c +- llvmpipe: fix comment typo in lp_bld_interp.h +- llvmpipe: asst. clean-ups in lp_linear_sampler.c +- llvmpipe: asst. clean-ups in lp_tex_sample.c +- llvmpipe: asst. clean-ups in lp_scene.c +- llvmpipe: minor clean-up in lp_state_blend.c +- llvmpipe: use enum blend types instead of unsigned +- llvmpipe: s/unsigned/enum pipe_compare_func/ +- gallivm: asst. clean-ups in lp_bld_logic. +- llvmpipe: asst. clean-ups in lp_state_fs.c +- draw: s/unsigned/enum pipe_prim_type/ +- draw: asst. clean-up in draw_context.h +- draw: asst. clean-up in draw_gs.h +- draw: asst. clean-up in draw_pipe.c +- draw: asst. clean-up in draw_pipe_validate.c +- draw: asst. clean-up in draw_private.h +- draw: asst. clean-up in draw_pt.h +- draw: asst. clean-up in draw_pt.c +- draw: asst. clean-up in draw_pt_fetch_shader_emit.c +- draw: asst. clean-up in draw_pt_fetch_shader_pipeline.c +- draw: asst. clean-up in draw_pt_util.c +- draw: asst. clean-up in draw_pt_vsplit.c +- draw: asst. clean-up in draw_vbuf.h +- gallivm: use enum types instead of unsigned in lp_bld_sample.[ch] +- llvmpipe: add minor comments in lp_rast.h, lp_setup-rect.c +- llvmpipe: remove lp_rast_cmd_arg::state field +- gallivm: s/unsigned/enum pipe_swizzle/ +- llvmpipe: add simple assertion in generate_fragment() +- util/bitset: add BITSET_SIZE() +- lavapipe: rework code to compute textures_used, samplers_used +- llvmpipe: minor code re-org in lp_state_fs_analysis.c +- nir: add const qualifiers, move some decls in nir_to_tgsi_info.c +- llvmpipe: replace if/then with switch in llvmpipe_nir_fn_is_linear_compat() +- gallivm: s/0/LP_BLD_TEX_MODIFIER_NONE/ +- llvmpipe: replace GET_A0() macro w/ inline function +- llvmpipe: add missing tex_info->texture_unit assignment +- llvmpipe: fix invalid memory used in lp_fs_linear_run +- llvmpipe: fix texcoord analysis in llvmpipe_nir_fn_is_linear_compat() +- llvmpipe: replace LP_RAST_OP\_ #defines with enum type +- gallivm: increase LP_MAX_TGSI_SHADER_IMAGES from 16 to 32 +- lavapipe: fix logicop, independent blend enable/disable +- llvmpipe: don't allow texture/resource swizzles on linear path +- llvmpipe: initialize a local var to fix compiler warning in release build +- lavapipe: fix incorrect sv[] array size +- llvmpipe: fix texcoord analysis regression +- llvmpipe: asst. clean-up in lp_bld_depth.c +- llvmpipe: minor white-space clean-up in lp_setup.h +- llvmpipe: asst. clean-ups in lp_setup_rect.c +- gallivm: asst. clean-ups in lp_bld_const.c +- llvmpipe: asst. clean-ups in lp_linear_interp.c +- llvmpipe: minor optimization in lp_linear_init_interp() +- llvmpipe: asst. clean-ups in lp_state_setup.c +- llvmpipe: remove unneeded casts +- llvmpipe: replace 'unsigned' with enum types +- gallivm: asst. clean-ups in lp_bld_swizzle.c +- gallivm: simplify shuffling in emit_store_reg() +- gallivm: fix incorrect memset() in SOA emit_load_const() +- gallivm: rename 'var' param in emit_store_reg(), emit_store_var(). +- llvmpipe: make opaque field a 1-bit field +- llvmpipe: minor whitespace clean-up in lp_state_fs.h +- llvmpipe: pass frontfacing param to jit_function() call +- llvmpipe: asst. clean-up in lp_rast_linear_fallback.c +- llvmpipe: remove unneeded casts in lp_rast_linear_fallback.c +- llvmpipe: consolidate code in lp_rast_linear_fallback.c +- llvmpipe: tighten up some code in lp_rast_linear_fallback.c +- llvmpipe: add some const qualifiers in lp_rast_linear_fallback.c + +Caio Oliveira (1): + +- intel/compiler: In XeHP prefer <1;1,0> regions before compacting + +Chad Versace (2): + +- venus: Refactor vn_fix_graphics_pipeline_create_info +- venus: Don't encode ignored pTessellationState + +Charlie Turner (4): + +- ci: fix shellcheck violations in the test-vk building script +- ci: add Linux build of apitrace +- ci: upgrade gfxreconstruct build to v0.9.10 +- ci: upgrade to DXVK v1.10.1 + +Charmaine Lee (20): + +- gallium/util: Increase the debug_flush map depth to 64 +- draw: fix double free of NIR IR +- svga: fix shader IR type passed to draw create shader function +- svga/nir: enable PIPE_CAP_TGSI_TEXCOORD +- svga: enable PIPE_CAP_TGSI_TEXCOORD for vgpu10 and up +- svga/tgsi: add translation for DTRUNC +- svga/nir: enable lower double option +- svga: workaround for unexpected double swizzle +- svga: fix clip distance shadow copy writemask +- svga: add a helper function for common shader creation +- svga: create svga_shader_info +- svga: add svga_compile_shader helper function +- svga: add need_texcoord_semantic to tgsi_add_point_sprite & tgsi_add_aa_point +- svga: fix aa point +- svga: fix reference to NULL offset for atomic counter +- svga: fix shared memory index +- svga: fix mksstats build +- mesa/st: fix reference to nir->info after nir_to_tgsi +- svga: support TGSI_SEMANTIC_TEXCOORD in swtnl draw context +- svga: fix invalid component access of domain location + +Chia-I Wu (61): + +- turnip: fix drm modifier support with planar formats +- util: Keep quiet NaNs quiet when converting to half float. +- anv: advertise rectangularLines only for Gen10+ +- turnip: let modifier takes precedence over TU_DEBUG=noubwc +- freedreno/fdperf: make refresh rate configurable +- freedreno/fdperf: support dumping counters +- turnip: fix sampledImageIntegerSampleCounts +- turnip: emit VPC_SO_DISABLE in xfb begin/end +- ir3: fix mem_ctx for ir3_disasm_info::nir +- turnip: fix off-by-one in border color bitset +- turnip: fix tu6_pack_border_color for z24 +- turnip: disable UBWC for SNORM formats +- turnip: if-checks autotune debug macros +- turnip: base pipeline draw costs on colorWriteMask +- turnip: include stencil test for drawcall costs +- turnip: estimate render pass costs +- turnip: consider render pass costs in autotune +- ci/turnip: add a link to VkPhysicalDeviceMemoryProperties failure +- turnip: force sample interpolations for sample shading +- turnip: advertise VK_EXT_border_color_swizzle +- freedreno/fdl: use XYZW swap for PIPE_FORMAT_X24S8_UINT +- vulkan/device_select: remove get_pdevice_proc_addr +- freedreno/drm-shim: add a660 +- turnip: fix an assertion with drm-shim +- ir3: update ir3_const_state comment +- ir3: copy req_local_mem for MESA_SHADER_KERNEL +- ir3: fix output_loc size +- ir3: remove unused patch_vertices_in +- ir3: increment constoff right after it is assigned +- ir3: fix tess param allocation +- ir3: set UL flag before ir3_lower_subgroups +- turnip: remove shared_consts from tu_compiled_shaders +- turnip: shared_consts and push_consts are mutually exclusive +- turnip: add tu_common.h as the common header +- turnip: remove includes that are already in tu_common.h +- turnip: add tu_drm.h +- turnip: add tu_suballoc.h +- turnip: update tu_cs.h +- turnip: add tu_query.h +- turnip: add tu_image.h +- turnip: add tu_formats.h +- turnip: update tu_descriptor_set.h +- turnip: add tu_shader.h +- turnip: add tu_pipeline.h +- turnip: add tu_clear_blit.h +- turnip: add tu_dynamic_rendering.h +- turnip: add tu_lrz.h +- turnip: add tu_pass.h +- turnip: add tu_wsi.h +- turnip: update tu_autotune.h +- turnip: add tu_device.h +- turnip: add tu_cmd_buffer.h +- turnip: add tu_android.h +- turnip: update tu_util.h +- turnip: move away from tu_private.h +- turnip: remove tu_private.h +- turnip: remove headers from libtu_files +- turnip: use SPDX-License-Identifier +- turnip: fix a use-after-free in autotune +- turnip: lower the queue priority to 1 +- ir3: fix predicate splitting in scheduler + +Christian Gmeiner (12): + +- isaspec: Extend split_bits(..) to accept a bitsize +- isaspec: Handle patterns bigger then 64 bit +- nir: Fix unused-variable compile warnings +- freedreno: Wait for async compile jobs to finish +- ir3: Create at least one thread for async shader compile +- gtest: Fix maybe-uninitialized compiler warning +- r600: Switch to unreachable(..) +- svga: Clear query before usage +- ci: Remove -Wno-error=maybe-uninitialized +- etnaviv: factor out shader screen related init/deint +- etnaviv: async shader compile +- etnaviv: Move nir_shader_compiler_options to compiler + +Chuansheng Liu (1): + +- iris,anv: correct the max thread number for DG2+ + +Clément Guérin (1): + +- venus: implement VK_EXT_physical_device_drm + +Connor Abbott (69): + +- tu: Delete unused tu_clear_blit GS handling +- ir3, freedreno, tu: Make ir3_shader_variant store stream_output +- ir3, fd, tu: Copy misc. info from ir3_shader to ir3_shader_variant +- ir3: Use ir3_shader_variant::type more often +- ir3: Allocate disasm_info under variant +- ir3: Add ir3_shader_variant::compiler +- tu: Keep original blit shaders separately +- ir3: Remove ir3_shader_variant::shader +- ir3: Add functions to serialize variants +- ir3: Support disabling the pipeline cache +- ir3: Add ir3_shader_create_variant() +- tu: Hash pipeline layout contents +- tu: Rewrite cache UUID based on radv +- tu: Include turnip debug flags in pipeline cache UUID +- tu: Implement pipeline caching with shared Vulkan cache +- tu: Zero-initialize compute driver key +- tu: Implement VK_EXT_pipeline_creation_feedback +- tu: Implement VK_EXT_pipeline_creation_cache_control +- tu: Convert to sync2 entrypoints +- tu: Expose VK_KHR_synchronization2 +- tu: Implement VK_EXT_image_2d_view_of_3d +- tu: Enable shaderTessellationAndGeometryPointSize +- tu: Enable VK_KHR_swapchain_mutable_format +- tu: Fix partial-write LRZ determination +- tu: Implement extendedDynamicState2LogicOp +- tu: Implement VK_EXT_color_write_enable +- tu: Enable uniform texture buffers with NPOT formats +- freedreno/a6xx: Support more 3-component formats +- tu: Fix 3d GMEM store swizzle in texture descriptor +- tu: Add missing WFI to the 3d GMEM store path +- tu: Save/restore \*_BIN_CONTROL in 3d GMEM store path +- ir3/sched: Fix could_sched() determination +- tu: Fix linemode for tessellation with isolines +- ir3: Fix vectorizer condition for SSBOs +- freedreno: Document a650-specific CP_REG_WRITE flag +- freedreno/a6xx: Document border color/swizzle interaction +- freedreno/a6xx: Fix border color swizzling +- freedreno/a6xx: Use fdl format swizzle +- freedreno/fdl: Note border color constraints +- freedreno/a6xx: Fix border color with Z24UINT_S8UINT +- tu: Fix stencil border color with has_z24uint_s8uint +- mesa/st: Fix border color type for stencil sampling +- nv50/ir/ra: Fix copying compound for moves +- freedreno/a6xx: VPC_SO_NCOMP is actually VPC_SO_BUFFER_STRIDE +- tu: Use incoherent CCU write for buffer accesses +- tu: Support resolving D24S8 -> S8 +- tu: Fix resolving d32s8 into s8 on fast path +- tu/lrz: Do not use framebuffer when inheriting LRZ +- tu: Treat CP_WAIT_FOR_ME as a cache invalidate +- tu: Rewrite tess factor emission +- tu: Move TU_DONT_CARE_AS_LOAD into attachment_set_ops() +- tu: Disable GMEM for multiview inside tu_render_pass_gmem_config +- tu: Move tu_render_pass definition up +- tu: Remove usage of RenderPassBeginInfo +- tu: Initial support for dynamic rendering +- tu: Remove has_subpass_predication +- tu: Update more state with secondaries +- tu: Split out some state into a separate struct +- util/trace: Fix u_trace_*_iterator with no tracepoints +- tu: Add support for suspending and resuming renderpasses +- tu: Use common vk_image struct +- tu: Use common vk_image_view base struct +- tu: Add debug option to use emulated renderpass support +- tu: Restore formatting of tu_clear_blit.c +- tu: Fix sysmem depth attachment clear flushing +- ir3/spill: Fix extracting from a vector at the end of a block +- tu/lrz: Fix multiple subpass case with secondaries +- tu/lrz: Fix multiple depth attachment case with secondaries +- tu: Don't preload variable-count descriptors + +Constantine Shablya (1): + +- anv: set image_read_without_format NIR option on Vulkan 1.3 + +Corentin Noël (5): + +- mesa: Make sure to fallback to handling the original choose texture format +- virgl/ci: Update virglrenderer +- virgl/ci: Update virglrenderer +- virgl: Add support for passing the supported number of components from virgl +- virgl: Only progagate the uniform numbers if the numbers are actually right + +Cristian Ciocaltea (4): + +- virgl/ci: Rename virgl-lava-* jobs to virgl-iris-* +- virgl/ci: Update checksums for some virgl iris traces +- ci: Add CPU frequency adjustment capability +- ci: Limit Intel CPU scaling frequency for performance tests + +Daniel Schürmann (36): + +- nir/lower_alu_to_scalar: don't set the nir_builder cursor +- aco: remove 'max_waves' and use 'num_waves' to adjust for LDS and workgroup size +- aco: split num_waves adjustment into separate function +- aco: adjust num_waves for LDS before scheduling +- aco/optimizer: prevent any overflow between SGPR and const offset on MUBUF +- aco/ra: fix live-range splits of phi definitions +- aco: don't re-create vectors for load_barycentric_* intrinsics +- aco/ra: improve split_vector register assignment if the operand is not killed +- aco/optimizer: re-combine and copy-propagate p_create_vector(p_split_vector) +- radv: move two NIR passes out of tight optimization loop +- radv: call nir_lower_vars_to_ssa later +- aco: fix spilling of phis without temp operands +- nir/opt_vectorize: add callback for max vectorization width +- nir: introduce new nir_alu_alu_width() with nir_vectorize_cb callback +- nir/opt_vectorize: refactor src rewriting to avoid unnecessary mov instructions +- aco: correctly validate v_fma_mixhi_f16 register assignment +- radv: don't lower vectorized instructions to 32bit +- radv: use callback for nir_lower_to_scalar +- radv: vectorize nir_op_fdiv & nir_op_flrp & nir_op_ineg +- radv: vectorize nir_op_fabs +- radv: fix swizzles after nir_opt_algebraic_late +- radv: reorder some NIR passes +- nir/opt_shrink_vectors: fix re-using of components for vecN +- radv/shader_info: fix load_frag_coord and load_sample_pos read masks +- aco: fix packed 16bit fneg/fsat optimization +- aco: avoid unnecessary copies in emit_wqm() +- aco: Avoid live-range splits in Exact mode +- aco: fix assertion in insert_exec_mask +- spirv: handle phis decorated with RelaxedPrecision +- radv: Only create noop-fs for graphics pipelines +- radv: create RT traversal as separate shader +- radv/rt: use derefs for the traversal stack +- radv/rt: use stage ID as handle for general and closestHit shaders +- radv/rt: Don't load ClosestHit SBT on every hit, but only once after traversal +- radv/rt: fix nir_builder cursor in lower_rt_instructions() +- radv/rt: simplify lower_rt_instructions() + +Daniel Stone (10): + +- dzn/ci: Don't spam conformance warnings +- CI: Disable Windows jobs +- ci: Allow lima jobs to be globally disabled +- ci: Also disable Windows container builds when down +- ci/windows: Eliminate duplicate script definition +- ci/zink: Add glx-tfp to skips +- ci/lava: Sleep before, not after, API calls +- ci: freedreno farm is down +- CI: Disable Collabora lab +- Revert "CI: Disable Collabora lab" + +Danylo Piliaiev (53): + +- turnip: Try harder to keep LRZ valid and fix a few edge cases +- freedreno/a6xx: Add UNK fields to CP_REG_TEST and CP_COND_REG_EXEC +- turnip: Skip load/stores for tiles with no geometry +- turnip: Add debug option to print gmem load/store skip stats +- turnip: Fix tu_debug_flags values clashing +- tu: Implement VK_EXT_image_view_min_lod +- tu: Do not flush ccu in clear/blits during renderpass +- pps: Open writable renderer node in DrmDevice::create +- tu: Fix indices of drm_msm_gem_submit_cmd when filling them +- freedreno/a6xx: Add EARLYPREAMBLE flag to all a6xx_sp_xs_ctrl_reg0 +- util/u_trace: Add json output +- docs/u_trace: document u_trace usage +- ir3: handle gl_Layer and gl_ViewportIndex when there is TES + GS +- ir3: Force late-z if FS has global store/atomic +- tu: Fix D32S8 resolves into D32 or S8 in sysmem mode +- spirv: Workaround for RelaxedPrecision on OpLogical* in 3DMark +- turnip: Allow nested CP_COND_REG_EXEC +- turnip: Fix out-of-bounds builder->shader_iova access +- tu/autotune: Prevent division by zero +- turnip: copy disasm strings for pipeline executables +- tu: Disable sample counting for 3d blits during occlusion query +- tu: Reset xfb_used at the end of a renderpass +- meson: Define _GNU_SOURCE for android host system +- meson/pps: Check if libdrm exists to compile pps +- meson/tu: Don't compile libdrm paths if KGSL is selected +- tu: Do not expose storage image/buffer features for PACK16 formats +- tu: Use hw binning or sysmem with QUERY_TYPE_PRIMITIVES_GENERATED +- tu: Don't count 3d blits in QUERY_TYPE_PRIMITIVES_GENERATED +- freedreno: Document rest of GRAS_LRZ_CNTL, clarify UNK_25 event +- freedreno: document GRAS_UNKNOWN_810A +- tu: Overhaul LRZ, implement on-GPU dir tracking and LRZ fast-clear +- ir3: Use NIR's info.writes_memory to detect when when to force late-z +- tu,freedreno: Refactored START/STOP events for pipeline stats +- tu: Fix prim gen query and pipeline stats query interaction +- zink: re-enable EXT_primitives_generated_query for Turnip +- util/u_trace: Fix iteration over config_control +- turnip: Refactor tracepoints generation to reduce duplication +- turnip: Add TU_GPU_TRACEPOINT envvar to toggle tracepoints +- freedreno: Add the rest of tracepoints with start/end to perfetto +- freedreno: Refactor tracepoints generation to reduce duplication +- freedreno: Add FD_GPU_TRACEPOINT envvar to toggle tracepoints +- docs/u_trace: Add documentation for \*_GPU_TRACEPOINT envvars +- turnip: Disable LRZ fast-clear for gen1 and gen2 +- tu: Do not dereference descriptorSetLayout in push descriptors tmpl +- zink/turnip: Add comments to already triaged failures for tu baseline +- turnip: Simple breadcrumbs implementation to debug hangs +- tu: WFI after PC_TESSFACTOR_ADDR update +- freedreno: WFI after PC_TESSFACTOR_ADDR update +- ir3/ra: Always insert interval for precolored inputs +- ir3: Never remove GS_HEADER_IR3 sysval input +- tu: Disable LRZ write when alpha-to-coverage is enabled +- freedreno: Disable LRZ write when alpha-to-coverage is enabled +- ir3: Prevent reordering movmsk with kill + +Dave Airlie (82): + +- clc: add simple llvm initialise API +- clc: initialise one more llvm stage +- u_blitter/stencil: take dstbox x/y into accounts for dst fb width +- util/stencil: fix stencil fallback blit shader texture types. +- llvmpipe: add user memory resources to the debug global list. +- gallium/vl: wrap codec support checks in a common function. +- gallium/omx: add video codec supported hook for decode paths. +- meson: add a video codec support option +- gallium/vl: respect the video codecs configure in meson +- vulkan/wsi: keep allocate queue families in image, just don't fill them +- gallivm: move to new pass manager to handle coroutines change. +- radv: precalculate hs offchip parameters. +- radv: precalculate tess ring sizes/offsets. +- radv/ac: introduce a new common function for hs calcs. +- ac: port radeonsi tess factor calculations to common code. +- radeonsi: port tess ring calcs to the common helper. +- zink: avoid extra submits for empty wait cmdbufs. +- zink: don't finish barriers cmd buffer if not used. +- radv: cache the slab bo pointer in the pipeline. +- radv: only dirty line stipple when necessary. +- radv: optimised command buffer reset of vertex bindings. +- aco: move info pointer to a copy. +- aco: move to a minimal aco shader info struct. +- aco/info: reduce the gs ring info to what is needed. +- aco: remove radv specific streamout info +- aco: convert vs and so info over to aco structs. +- aco/info: add some more fields. +- aco/radv: convert to aco shader info at the radv level. +- aco: remove radv vs prolog key from aco internals. +- aco: drop unused radv include +- ac/radv: drop info pointer from the ac and radv shader structs +- aco/radv: drop radv_nir_compiler_options from aco. +- llvmpipe: align scratch size to 64-bit size. +- meson: add build-id to pipe libraries +- vl: fix codec checks to disable properly +- lavapipe: fix depth bias offset flag enables. +- llvmpipe: flush resources for kms swrast path. +- llvmpipe: handle timespec overflow on fence waits. +- nouveau: move codegen into a standalone library. +- nouveau: move codegen to a common higher level directory. +- nouveau/codegen: drop all ubytes from codegen. +- nouveau/codegen: drop gallium headers from the interface. +- llvmpipe: disable alpha test branch if multisample is enabled. +- egl/x11: split large put image requests to avoid server destroy +- egl/x11: add missing put_image cookie cleanups +- wsi/x11: add xcb_put_image support for larger transfers. +- glx/drisw: use xcb instead of X to query connection +- draw/cliptest: add support for guard band + full Z +- zink: workaround depth sampler border color when z24 is z32 +- aco: refactor the radv binary builder out of the core aco fn. +- aco/radv: provide a callback from aco shader building to build binary +- aco/radv: provide a vs prolog callback from aco to radv. +- aco: drop radv_shader.h include +- radv: add a dynamic vertex format cache. +- lavapipe: drop unreachable pNext checks. +- lavapipe: enable variablePointers +- kms/dri: add mutex lock around map/unmap +- crocus: fail query begin if upload allocation fails. +- lavapipe: state latest conformance tests passed +- llvmpipe: fix aniso cube map arrays. +- crocus: fix leak in query code. +- llvmpipe: make last_fence a screen/rast object not a context one. +- llvmpipe: keep context list and use to track resource usage. +- gallivm: use progress from subgroup lowering. +- gallivm: fix casting around read invocation. +- llvmpipe: export ARB_shader_ballot +- lavapipe: expose some subgroup extensions +- zink: fixup subgroup vote/ballot enables. +- zink/llvmpipe/ci: skip broken CTS ballot test. +- util: add reallocarray wrapper for win32 +- llvmpipe: Allocate scene tiles dynamically +- gallium: add a rasterizer state bit for unrestricted depth values. +- mesa/st: disable unrestricted fragment depth values for GL/GLES +- util/pack_color: clamp depth values outside range for unorm formats. +- llvmpipe/fs: handle unrestricted depth values. +- lavapipe: enable VK_EXT_depth_range_unrestricted. +- util/reallocarray: add errno.h include. +- lavapipe: enable VK_FORMAT_R5G5B5A1_UNORM_PACK16 +- gallivm: fix printf hook for cached shaders. +- draw: don't touch info values that aren't valid. +- nir_to_tgsi_info: drop const_buffers_declared +- llvmpipe: finish rendering before flushing frontbuffer resources. + +David Heidelberg (58): + +- ci: skqp: patch skqp to report also GL results +- ci: skqp: upgrade to skqp based on Android CTS 11.0 (r7) +- ci: skqp: update URL +- ci: intel: Merge anv and iris into src/intel/ci +- ci/iris: Enable SKQP on Tiger Lake boards +- ci: traces: switch to Valve trimmed traces +- ci: traces: re-enable disabled traces which are now trimmed traces +- ci: traces: virgl: reenable previously crashing Godot trace +- ci: uprev piglit 2022-05-10 +- ci/iris: skqp: add default GLES rendertests for TGL +- ci/iris: skqp: remove flaking atlastext for TGL +- ci/virgl: fix checksum for valve traces which run on iris +- ci/freedreno: enable ROR and Nheko traces +- ci/i915: add entries for RoR and Nheko traces +- ci/intel: add RoR and Nheko traces and reenable most of Valve traces +- ci/crocus: add RoR and Nheko traces +- ci/llvmpipe: add RoR and Nheko traces +- ci/radeonsi: add RoR and Nheko traces +- ci/virgl: traces: add RoR and Nheko traces +- ci/panfrost: add RoR and Nheko traces +- ci/iris: skqp: remove flaking atlastext for TGL (gl version) +- ci: traces: temporarily disable nheko trace +- ci/virgl: traces: temporarily disable nheko trace +- ci: uprev piglit 2022-06-09 +- ci/etnaviv: sort fails.txt +- ci/etnaviv: add recent fails and remove 3 unexpected passes +- ci/etnaviv: disable tests which takes too long +- ci/etnaviv: add recent flakes for gc2000 +- ci/etnaviv: gc2000: drop flakes from fails +- ci/freedreno: add Blender, Warzone2100, Freedoom and Unvanquished traces +- ci/freedreno: fix A530 glmark2@ideas:speed=10000 trace +- ci/llvmpipe: add Blender, Warzone2100, Freedoom and Unvanquished traces +- ci/iris: add Blender, Warzone2100, Freedoom and Unvanquished traces +- ci/virgl: add Blender, Warzone2100, Freedoom and Unvanquished traces +- ci/virgl: iris: disable Counter Strike 1.6 trace flaking +- ci/crocus: add Blender, Warzone2100, Freedoom and Unvanquished traces +- ci/panfrost: add Blender, Warzone2100, Freedoom and Unvanquished traces +- ci/iris: disable Blender Diorama trace (incompatible with perf. traces) +- ci/freedreno: update restricted traces to the brotli compressed version +- ci/freedreno: smuggle myself into restricted traces access list +- ci/freedreno: disable non-existing trace +- ci: Uprev apitrace to 790380e05854 +- ci: traces: switch to brotli compressed traces +- ci/freedreno: add more restricted traces +- ci/freedreno: temporary disable AmnesiaTDD +- ci/freedreno: disable Stellaris trace +- ci/freedreno: disable SpecOps trace, each run flaky +- Revert "ci/freedreno: temporary disable AmnesiaTDD" +- ci/traces: piglit, be more verbose +- docs: briefly document Gallium Nine +- ci: add script for manage Mesa CI +- ci: Allow disabling the whole of the Igalia farm +- ci: move apitrace instalation outside of x86_test-vk +- ci: wine-apitrace bump to 11.1 +- ci/freedreno: 3 pixel change in Raven restricted trace +- nine: char can be unsigned on non-x86 platforms +- mailmap: fix order of emails for Axel Davy +- ci: set reasonable amount of wget retries for each download + +Dawn Han (6): + +- Add iub features and properties passthrough +- Refactor the descriptor enums to be extensible +- Add iub type to vn_descriptor_type Verified the pNext VkWriteDescriptorSet is not NULL on iub type +- Add the iub binding count tracking +- Refactor and add template support for iub +- venus: enabled iub in physical device + +Denis Pauk (1): + +- panvk: Return VK_ERROR_INCOMPATIBLE_DRIVER for Midgard + +Dmitry Osipenko (1): + +- virgl: Fix unmapping of blob resources + +Dylan Baker (74): + +- VERSION: bump to 22.2-devel for next cycle +- docs: truncate new_features.txt +- docs: add release notes for 22.0.2 +- docs: Add sha256sums for 22.0.2 +- docs: update calendar and link releases notes for 22.0.2 +- docs: update calendar for 22.1.0-rc1 +- docs: update calendar for 22.1.0-rc2 +- winsys/radeon: remove unused simple_list.h +- r300: remove unused simple_list.h +- gallium/rbug: replace simple_list.h with list.h +- gallium/driver_trace: remove simple_list.h +- util/list.h: add a function to move an item in a list +- gallium/util/u_cache.c: remove trailing whitespace +- gallium/util/u_cache: replace simple_list.h with list.h +- gallium/draw: remove trailing whitespace +- gallium/draw: use list.h instead of simple_list.h +- llvmpipe: cleanup trailing whitespace +- llvmpipe: replace uses of simple_list.h with list.h +- util: remove simple_list +- docs: Extend calendar entries for 22.0 by 1 releases. +- docs: update calendar and link releases notes for 22.0.3 +- docs: Extend calendar entries for 22.1 by 1 releases. +- docs: update calendar for 22.1.0-rc3 +- docs: update calendar for 22.1.0-rc4 +- docs: add release notes for 22.0.3 +- docs: Add sh256sum for mesa 20.0.3 +- docs: update calendar and link releases notes for 22.1.0 +- docs: add release notes for 22.1.0 +- relnotes: Add sha256sum and fix minor formatting issues +- docs: Add calendar entries for 22.1 release. +- docs: add release notes for 22.0.4 +- docs: add sha256sum to 22.0.4 notes +- docs: Extend calendar entries for 22.0 by 1 releases. +- docs: update calendar and link releases notes for 22.0.4 +- docs: add release notes for 22.1.1 +- docs: Add sh256sums for 22.1.1 +- docs: update calendar and link releases notes for 22.1.1 +- docs: add release notes for 22.0.5 +- docs: update sha256 for 22.0.5 +- docs: update calendar and link releases notes for 22.0.5 +- docs: add release notes for 21.3.9 +- docs Add sha256 sums for 21.3.9 +- docs: link releases notes for 21.3.9 +- docs: add release notes for 22.1.2 +- docs: add sha256sum to 22.1.2 notes +- docs: update calendar and link releases notes for 22.1.2 +- docs: drop 22.1.4 calendar entry. +- docs: Add calendar entries for 22.2 release candidates. +- docs: Extend calendar entries for 22.1 by 3 releases. +- docs: add release notes for 22.1.3 +- docs: add sah256sum for mesa 22.1.3 +- docs: update calendar and link releases notes for 22.1.3 +- docs: Bump 22.2 branchpoint by two weeks +- anv: add gfx version 12.5 flushes to CCS path +- iris|anv: gfx version 12.5 data cache flush is not a workaround +- VERSION: bump 22.3.0-devel +- VERSION: bump for 22.2.0-rc1 +- .pick_status.json: Update to 8e6bdb2ed31f27c28d258859ec48e8e67f51ff41 +- .pick_status.json: Update to 0a0205f04552ab21971c431a5f8bb7486d664422 +- .pick_status.json: Update to 1a3b086b06c6c1ff63a4abe3752ca0a7a854dc77 +- .pick_status.json: Update to c67e60ae8fefaeadbe06bf0cbd8ad2ee267a6c0e +- .pick_status.json: Update to 70891edd9728e7dd73115171fdb03622ce6450ba +- VERSION: update to 22.2.0 +- Revert "VERSION: update to 22.2.0" +- VERSION: bump to 22.2.0-rc2 +- .pick_status.json: Update to a3bf0da1cbd4b10043c80bf44609a3024b5fcc36 +- .pick_status.json: Update to 24b9ad7cd5ebc7cfa5d03cf0f243ea4841c971b9 +- .pick_status.json: Update to 74fc367127ccf945f4c649dd6ddff955c802e36e +- .pick_status.json: Mark 11ab6087797f805cf158048915c67945613c9a72 as denominated +- VERSION: bump to 22.2.0-rc3 +- .pick_status.json: Update to 0c6fbfca0c91ef012e8ab767a317c07f1f6dc5e6 +- .pick_status.json: Update to 8eac45b27446cd9b9eaeb147af97fff1e09832cb +- .pick_status.json: Update to baf24dea943202b3a92cad0c9f9648597040955a +- .pick_status.json: Update to 3d4c36a3bcc51ed441b2667d92291bea30ef7449 + +Eli Schwartz (1): + +- meson: add various generated header dependencies as order-only deps + +Ella Stanforth (1): + +- v3dv: Implement VK_KHR_performance_query + +Emil Velikov (1): + +- c11: reinstate the original license and authorship + +Emma Anholt (232): + +- nouveau/nir: Don't try to emit OP_FMA pre-nvc0. +- Revert "ci: Disable Google's lab" +- ci/virgl: Add piglit and GL4.3 testing to the virpipe path. +- nir: Add lowering for fround_even on r300. +- nir: Skip fround_even on already-integral values. +- nir: Avoid generating extra ftruncs for array handling. +- turnip: Add TU_DEBUG=layout for dumping image layouts. +- freedreno/ir3: Make sched nodes before adding deps. +- turnip: Add nir_opt_conditional_discard. +- freedreno/ir3: Call nir_opt_find_array_copies(). +- turnip: Ignore TOP/BOTTOM_OF_PIPE bits in subpass src/dst dep flags. +- turnip: Don't disable LRZ in subpasses after the first in the easy case. +- nir_to_tgsi: Add a workaround for virglrenderer TG4. +- nir_to_tgsi: Fix assertion failures handling 64-bit vec3/vec4 ssa undefs. +- nir_to_tgsi: Don't forget to split 64-bit store_per_vertex_output. +- nir_to_tgsi: Avoid swizzling from undefined channels in load_output. +- nir_to_tgsi: Allocate the primid sysval to num_inputs, not num_outputs. +- tgsi: Emit ureg HW_ATOMIC decls in range order. +- tgsi_to_nir: Emit load_ubo_vec4 instead of load_ubo on non-integer HW. +- nine: Disable optional use of TTN when MUL_ZERO_WINS is available. +- r600: Fix up some mis-indentation of blocks. +- ci/r600: Manual run updates. +- r600: Fix reading back from a temp array immediately after writing on RV770. +- r600/sb: Avoid causing an exception when getting the reciprocal of 0u. +- r600: Use nir-to-tgsi instead of TGSI when the NIR debug opt is disabled. +- nouveau/nv50: Print the number of loops in shader-db output. +- ci/nouveau: Add expectations files for GM206. +- ci/nouveau: Add nv92 xfails. +- nouveau/nir: Disable bitfield ops pre-nvc0. +- nouveau/nv50: Enable mesa/st alpha test lowering on nv50 with NIR. +- nouveau/nv50: Set the primid sysval flag if it's in the sysval list, too. +- nouveau/nir: Fix the inverted sense of usesSampleMaskIn. +- nouveau/nir: Fix edgeflag input detection. +- nouveau/nir: Set the input for vertex/instance ID like TGSI does. +- ci/dzn: Cut the runtime by running 1/3 of the enabled tests. +- nir_to_tgsi: Make vec_to_movs avoid unsupported coalescing for 64-bit. +- nir_to_tgsi: Lower FS input array indexing since we don't declare input arrays. +- tgsi: Extend array lengths when merging usage_masks. +- virgl: Switch to nir-to-tgsi by default. +- nir/lower_tex: Make the adding a 0 LOD to nir_op_tex in the VS optional. +- nir_to_tgsi: Drop the txl(lod=0)->tex nir_lower_tex() fixup. +- nouveau/nir: Move FS output stores to the end of the last block. +- ci/nouveau: Add MESA_GLES_VERSION_OVERRIDE=3.1 baseline state. +- nouveau/nir: Add support for pre-GF100 images and ssbos. +- nouveau/nir: Put the UBO offset indirect into the address reg. +- nouveau: Enable the NIR backend by default. +- util: Add some unit tests of the half-float conversions. +- nir: Eliminate out-of-bounds read/writes in local lowering. +- nir: Don't assert on tg4 offset range. +- svga: Add support for requesting NIR and translating to TGSI. +- ci/iris: Demote APL deqp to manual-only for now. +- ci/iris: Add a bunch of APL and KBL flakes recently. +- mesa/st: Always generate NIR from GLSL, and use nir_to_tgsi for TGSI drivers. +- mesa/st: Remove st_glsl_to_tgsi. +- mesa/st: Remove TGSI shader program variant handling. +- mesa/st: Drop the attrib remapping in st_prepare_vertex_program(). +- mesa/st: Remove TGSI-only shader lowering code. +- tgsi_scan: Remove unused input_array_last/output_array_last. +- glsl: Remove UBO reference lowering. +- mesa: Remove unused Mesa IR PROGRAM_* register files. +- gallium/tgsi: Remove tgsi_emulate. +- mesa/st: Remove now unused TGSI paths from disk cache. +- glsl: Retire the non-NIR GLSL linking paths. +- gallium: Remove now-unused shader caps. +- glsl: Remove the unused lower_if_to_cond_assign. +- glsl: Drop the dead MOD_TO_FLOOR path. +- mesa/st: Clean up a bit of st_prepare_vertex_program(). +- compiler/glsl: Remove the dead parts of build_program_resource_list(). +- glsl: Remove EmitNoLoops and the associated lower_jumps(lower_break=true) code. +- glsl: Remove unused lower_variable_index_to_cond_assign. +- svga: Set lower_bitops for vpu9 screen. +- glsl: Stop lowering ir_quadop_vector. +- nouveau: disable fences when running under drm-shim. +- Revert "ci: remove nouveau from shader-db runs" +- ci/iris: Cut the glk-deqp test coverage in half. +- glsl: Make all drivers take the GLSLOptimizeConservatively path. +- turnip: Add a TU_DEBUG=perf debug option. +- ci/crocus: Manual CI updates after CI was down for a bit. +- ci/crocus: Merge the piglit runs with the deqp runs. +- i915g/ci: Add depth-clear-precision-check xfails like everyone else. +- ci/crosvm: Simplify the CID setup. +- ci/crosvm: Rename VSOCK_TEMP_DIR -> VM_TEMP_DIR. +- ci/crosvm: Terminate the previous crosvm after a deqp-runner timeout. +- freedreno/ir3: Fix 16-bit bit_count. +- spirv: Use nir_vec_scalars() to simplify matrix transpose. +- spirv_to_nir: Cast RelaxedPrecision ALU op dests to mediump. +- turnip: Make RelaxedPrecision-decorated ALU ops 16-bit. +- freedreno/ir3: Add support for 16-bit nir_texop_lod. +- freedreno/ir3: Lower texture instructions used only for f2f16 to 16-bit. +- mesa/st: Only scalarize for doubles lowering if we're lowering doubles. +- glsl,nir: Move i/umulExtended lowering to NIR. +- panfrost: always print the bad ALU op if we're failing to translate. +- ci/turnip: Add missing xfails for a618 full VK run +- nir_lower_mediump: Drop assertion about not containing movs. +- freedreno/ir3: Enable load/store vectorization for SSBO access, too. +- freedreno/ir3: Make the types of tex coord padding match. +- freedreno/ir3: Fix validation of half-precision image store values. +- freedreno/ir3: Fold 16-bit conversions into image load/store src/dsts. +- i915g: Ignore the new SEPARABLE_PROGRAM property. +- ci/i915: Merge the piglit and deqp runs. +- ci/i915: Update manual piglit job expectations. +- turnip: Don't bother creating tile_load/store_cs for sysmem rendering. +- turnip: Refactor a bit of tu6_emit_tile_select(). +- turnip: Move binning decisions from FB usage time to FB creation time. +- turnip: Use fb->binning_possible to decide on conditional tile load/stores. +- ci/turnip: Add a little forced touch-testing of XFB with no binning requested. +- turnip: Refactor a bit of repeated code for subpass setup. +- turnip: Include 3d-based CmdClearAttachments() in binning visibility. +- turnip: Refactor a bit of subpass attachment processing. +- turnip: Allow load/store skipping in vkCmdClearAttachments(). +- turnip: Move tile loads back into the draw CS. +- nouveau/nv30: Make sure fsat is lowered in the VS. +- gallium+glsl: Remove EmitNoSat/PIPE_CAP_VERTEX_SHADER_SATURATE +- glsl: Remove stale lower_instructions comments. +- glsl: Move exp/log-to-exp2/log2 lowering to glsl-to-NIR. +- tgsi_exec: Fix inf/nan handling for divide by zero. +- glsl: Drop the div-to-mul-rcp lowering for floats. +- nir: Fix idiv lowering on !NativeIntegers when lower_fdiv is also set. +- glsl: Drop INT_DIV_TO_MUL_RCP lowering. +- ci/crocus: Drop g41's xfail for rgba_half_float_oes. +- ci/iris: Disable skqp until it can be stabilized. +- turnip: Reverse the order of walking pipes or tiles on odd rows. +- mesa/st: Abort the linking on driver link failure. +- .gitignore: Qualify the path for the ignored build directory. +- nir: Rename is_arb_asm to use_legacy_math_rules and document its meaning. +- gallium: Rename MUL_ZERO_WINS to LEGACY_MATH_RULES. +- nir_to_tgsi: Set LEGACY_MATH_RULES cap for use_legacy_math_rules shaders. +- r600/sfn: Implement 0*x=0 behavior for use_legacy_math_rules. +- nouveau/nir: Implement mul_zero_wins behavior for use_legacy_math_rules. +- tgsi_to_nir: Set use_legacy_math_rules for TGSI_PROPERTY_LEGACY_MATH_RULES. +- gallium: Rename PIPE_CAP_TGSI_LEGACY_MATH_RULES to drop "TGSI" +- Revert "nine: Disable optional use of TTN when MUL_ZERO_WINS is available." +- iris: Enable PIPE_CAP_LEGACY_MATH_RULES. +- crocus: Enable PIPE_CAP_LEGACY_MATH_RULES. +- ci/dzn: Skip a test that intermittently times out at a minute. +- ci/iris: Disable blender-demo-cube_diorama on APL. +- ci: Terminate capture-devcoredump before tarring up artifacts. +- turnip: Document some fields about resolves. +- ci/bare-metal: Apply autopep8 to our python scripts. +- ci/bare-metal: Add per-boot-stage timeouts for fastboot and poe. +- ci/bare-metal: Get rid of servo's serial feed threads. +- ci/bare-metal: Add handling of netboot firmwares for servo boards. +- ci/bare-metal: Add test phase timeouts to all boards. +- ci: Remove the unused .test-manual +- ci/vc4: Merge quick_shader in with deqp-gles +- ci/vc4: Turn on deqp-egl testing by default. +- vc4: Propagate txf_ms's dest_type to the lowered txf. +- freedreno/ir3: Disable image/ssbo 16-bit conversion folding pre-a6xx. +- freedreno/a5xx: Set the buffer bit appropriately in XS_CTRL_REG0. +- ci/freedreno: Turn a530 back on by default and update expectations. +- ci/bare-metal: Remove "stage: test" from .baremetal-test. +- ci/bare-metal: Consolidate needs declarations in .baremetal-test-\*. +- ci/bare-metal: Collapse artifacts wget by default. +- ci/traces: GC unused code for DXVK trace replay. +- ci/traces: Drop ZINK_USE_LAVAPIPE ICD override. +- ci/freedreno: Add vulkan+gl integration testing in piglit. +- ci/turnip: Test traces on turnip using zink. +- ci/turnip: Trim the a630 VK run a bit. +- ci: Use "!references" to manage scheduled pipeline rules. +- ci: Pull out farm disables to a reused rule. +- ci: Pull common zink frontend rules to a shared rule list. +- ci/freedreno: Use !references to clean up restricted traces rules. +- ci: Simplify vulkan rules using !references. +- ci: Split core GL from core VK-or-GL rules. +- ci/freedreno: Filter when we run test jobs for VK or GL changes. +- ci/zink: Reuse anv-rules. +- ci/zink: Simplify lavapipe rules setup, and clarify what the rules are for. +- ci: Remove the trailing "when: never"s from rules. +- ir3: Retire the cp postsched pass now that we do RA in SSA. +- ci: Uprev deqp to 1.3.3.0. +- ci/llvmpipe: Skip ssbo.layout.random.all_shared_buffer.36 +- ci: Make the retry policy default for all jobs. +- ci: Rename .ci-run-policy rules to .build-rules. +- ci: Drop .build-rules from container jobs. +- ci: Remove .build-rules from core test job definitions. +- ci: Fix non-freedreno performance jobs running during Marge merges. +- ci: Remove GIT_STRATEGY from .test-manual-mr. +- mesa/st: Disable OES_texture_3D for PIPE_CAP_MAX_TEXTURE_3D_LEVELS==0. +- vc4: Disable OES_texture_3D being exposed. +- vc4: Move previous existing 3D xfails up to the group of 3d xfails. +- vc4: Add notes on the remaining dEQP failures. +- turnip: Account for additional_cs_reserve_size for both bin and render. +- turnip: Fix up per-stage additional size accounting. +- turnip: Fix the reservation for vertex inputs. +- turnip: Refactor vertex input setup a little. +- ir3: Make sure to pass the interp_mode through in our load_bary lowering. +- nir+ir3: Rename load_size_ir3 to load_center_rhw_ir3. +- freedreno: Rename the "SIZE" regs for interpolateAtOffset to "CENTERRHW" +- ir3: Clarify what's happening in the interpolateAtOffset() math. +- ir3: Use non-persp interpolation when appropriate for interpolateAtOffset. +- zink: Do the timestamp-to-ns math in a double to have better precision. +- mesa: Fix the error check for VertexAttrib*. +- vc4: Work around a HW bug with 2-vert line loops. +- vc4: Add links to test bug reports. +- kopper: Fix the return value of kopperSetSwapInterval(). +- kopper: Use the swap interval that was set at swapchain creation time. +- glx: Unify dri2/dri3 vblank_mode logic. +- kopper: Respect the vblank_mode env var. +- egl: Just include driconf.h instead of redefining its values. +- loader: Just include driconf.h instead of redefining its values. +- ci/freedreno: Mark an occasional flake pass that happens on a530. +- ir3: Fix the no-emitted-vertex condition emission in geom lowering. +- mesa/arbprog: Move the GLSLFragCoordIsSysVal handling to prog_to_nir. +- mesa/arbprog: Use nir_lower_io_to_temporaries. +- mesa/arbprog: Stop doing optimization in the ARB program IR. +- ci/crocus: Update portal 2 trace shas for the recent fix. +- ci/crocus: Disable the blender trace. +- ci/crocus: Drop xfails for the recent image external fix. +- freedreno/ir3: Enable core NIR's 16-bit ALU optimizations. +- intel/fs: Simplify brw_barycentric_mode() args. +- ci/bare-metal: Add timeouts to the shell commands called in fastboot. +- ci/freedreno: Switch a630 to manual/disabled for lab maintenance. +- glsl: Use the same NIR path for shared mem lowering as SPIRV does. +- glsl: Remove optimize_swizzles. +- ci/turnip: Add a bit of spilling-vs-ballot testing on a618. +- ci/freedreno: Add some more known flakes for a630 from our IRC logs. +- ci/turnip: Bump up the a630 full run timeout. +- ci/turnip: Add a couple of missing a630 fails. +- Revert "ci/freedreno: Switch a630 to manual/disabled for lab maintenance." +- nir: Split usub_sat lowering flag from uadd_sat. +- ci/crocus: Fix the commented YML for blender-demo-cube_diorama. +- gallivm: Refactor a bit of UBO/SSBO range checking. +- gallivm: Optimize single-invocation SSBO stores. +- ci/virgl: Make the VM-based virgl tests report to #virgl-ci, too. +- nir: Add an opt pass for phis after if choosing between true/false. +- glsl: Remove opt_conditional_discard(). +- nir: Use nir_foreach_phi_src consistently. +- ci/freedreno: Update a630 s8 resolve xfails. +- turnip: Split the tiling config into separate layouts based on CCU usage. +- turnip: Use the GMEM CCU space for attachments when the stores won't. +- zink: Make sure that we keep the existing ici pNext chain on inserts. +- spirv: Mark phis as mediump instead of directly lowering them to 16 bit. + +Enrico Galli (6): + +- microsoft/spirv_to_dxil: Add pass to lower dynamic accesses on ubo[1] +- microsoft/compiler: Fix for arrays of 1 element +- microsoft/compiler: Add support for unnamed ubos +- microsoft/compiler: Fix when using a shadow sampler more than once +- microsoft/spirv_to_dxil: Fix discard semantics +- microsoft/spirv_to_dxil: Fix discard to demote + return pass + +Eric Engestrom (36): + +- meson: replace deprecated dep.get_pkgconfig_variable(...) with dep.get_variable(pkgconfig : ...) +- docs: show and link to the new macOS CI +- broadcom/rpi4-skips: drop duplicated lines +- mailmap: update my email to Igalia +- vulkan: add script to detect & replace identifiers that have been renamed +- vulkan/runtime: use updated tokens from vk.xml +- vulkan: use updated tokens from vk.xml +- nir: use updated tokens from vk.xml +- radv: use updated tokens from vk.xml +- v3dv: use updated tokens from vk.xml +- turnip: use updated tokens from vk.xml +- zink: use updated tokens from vk.xml +- lavapipe: use updated tokens from vk.xml +- pvr: use updated tokens from vk.xml +- anv: use updated tokens from vk.xml +- dzn: use updated tokens from vk.xml +- panvk: use updated tokens from vk.xml +- venus: use updated tokens from vk.xml +- zink: fix portability_subset usage after rename from EXTX to KHR +- zink: drop unused VkPhysicalDevicePortabilitySubsetPropertiesKHR +- zink: add missing guards around \`have_{ext}` +- docs/isl: fix typos & formatting +- vk/util: handful of pythonic cleanups +- vk/util: simplify extensions gen code +- util/list: rename LIST_ENTRY() to list_entry() +- nouveau: use existing list_first_entry() macro instead of re-writing it +- docs/features.txt: mark VK_ANDROID_native_buffer as supported on turnip +- docs/features.txt: mark VK_ANDROID_native_buffer as supported on v3dv +- docs/ci: fix Emma's name +- turnip: expose support for VK_EXT_acquire_drm_display +- ci: add \`--print-errorlogs` to \`meson test` +- bin/gen_release_notes.py: bump advertised vulkan version to 1.3 +- vk/device-select-layer: fix .sType of VkPhysicalDeviceGroupProperties +- wsi/x11: fix memleak in wsi_x11_connection_create() +- meson: replace manual compiler flags with meson arguments +- broadcom: fix dependencies in static_library() calls + +Erico Nunes (11): + +- ci: put lima farm back online +- docs/ci: fix literalinclude of caching setup files +- lima/ppir: optimize branch comparison when possible +- lima/ppir: merge branch nodes in simple cases +- egl: Fix DETECT_OS macro usage +- ci: update linux kernel to v5.17 +- ci: update docs for linux kernel uprevs +- lima/ci: update piglit ci expectations +- lima/ci: reenable lima-mali450-piglit-gpu:arm64 +- lima/ci: Don't skip piglit opengl 1 tests +- ci: Revert "CI: Lima farm is offline" + +Erik Faye-Lund (157): + +- gallium: rename continue shader-cap +- gallium: rename dround shader-cap +- gallium: rename dfracexp/dldexp shader-cap +- gallium: rename ldexp shader-cap +- vulkan: explicitly cast object-type enum +- meson: turn on -Wno-unused-function project-wide +- nir: introduce and use nir_component_mask +- dzn: fixup indent +- dzn: remove all usage of ComPtr<T> +- dzn: always use ID3D12Device1 +- dzn: always use ID3D12GraphicsCommandList1 +- dzn: use ID3D10Blob instead of ID3DBlob +- dzn: pass IDXGIAdapter1 to d3d12_create_device +- dzn: port to d3d12 c-api +- dzn: add a bunch of missing struct-keywords +- dzn: drop auto usage +- dzn: drop using references +- dzn: no more reinterpret_cast +- dzn: use define instead of constexpr +- dzn: use c-style for-statement +- dzn: use c-style initialization +- dzn: c-style casts +- dzn: do not set unused default member initializer +- dzn: port code to plain c +- dzn: merge util sources +- dzn: remove needless defines +- dzn: remove some needless casts +- dzn: clean up pointer syntax +- dzn: add space before += operator +- dzn: drop needless enum-casts +- dzn: fixup indent +- dzn: remove stale cpp_args +- nir/lower_int64: do not try to clamp floats to int-range +- vulkan: do not depend on alignof(void) +- lavapipe: quiet non-conformant warning on ci +- panvk: quiet non-conformant warning on ci +- vulkan: use c_msvc_compat_args for shared code +- vulkan: drop empty vulkan_wsi_args +- microsoft/clc: fixup indentation +- microsoft/compiler: fixup indentation +- meson: remove unused defines +- meson: deprecate specifying osmesa-bits +- gallium/xlib: fix stale comment +- mesa: add missing error-path +- util: fix test on msvc +- lavapipe: tolerate NULL UBO and SSBO descriptors +- lavapipe: tolerate NULL image view descriptors +- lavapipe: tolerate NULL image-buffer view descriptors +- lavapipe: tolerate NULL sampler-buffer view descriptors +- lavapipe: add support for null-descriptors from EXT_robustness2 +- zink: pass screen to extension getter macro +- zink: initialize dispatch-table before queue-init +- zink: call vk-functions through dispatch-table +- zink: pass screen to zink_create_instance +- zink: dynamically load a few functions +- zink: use run-time linking to loader +- dzn: reset correct list +- vulkan/wsi: unbreak win32-support +- d3d12: add missing cmath includes +- tests/graw: rename shaders from .sh to .txt +- dzn: add a debug-flag to wait for the debugger +- dzn: add debug option to redirect stdout/stderr +- editorconfig: remove scons-ism +- editorconfig: remove html-rule +- editorconfig: remove m4-rule +- editorconfig: remove pl-rule +- radeonsi: port amdgcn_glslc build to meson +- freedreno: remove stale makefile +- intel: remove stale makefile +- meson: modernize win_flex stdint.h logic +- c99_compat.h: remove inline and __func__ shims +- util: add missing c99_compat.h includes +- util: remove needless c99_compat.h includes +- egl: remove needless c99_compat.h includes +- gallium/util: remove needless c99_compat.h includes +- mapi: remove needless c99_compat.h includes +- gallium: remove needless c99_compat.h includes +- include: remove needless c99_compat.h includes +- intel/isl: remove needless c99_compat.h includes +- vulkan/device_select: remove needless c99_compat.h includes +- include: drop c99_math.h +- include: drop c11_compat.h +- pvr: do not use c_msvc_compat_args +- turnip: do not do STATIC_ASSERT on a variable +- freedreno: do not use variable in STATIC_ASSERT +- freedreno: do not do STATIC_ASSERT on variables +- iris: do not do STATIC_ASSERT on variables +- freedreno: degrade STATIC_ASSERT to assert +- v3dv: do not do STATIC_ASSERT on variables +- util: add IS_POT macro +- intel/compiler: use macro for power-of-two check +- broadcom/compiler: use macro for power-of-two check +- util: use static_assert directly +- util: implement STATIC_ASSERT using c++11 / c11 primitives +- meson: add back -Werror=vla to msvc compat flags +- ci/windows: add back build-error detection +- d3d12: move cubemap-lowering to common-code +- microsoft/compiler: mark image-functions as such +- microsoft/compiler: make sampler-lowering optional +- microsoft/spirv_to_dxil: lower cube-images to 2d arrays +- dzn: enable binding_model tests +- dzn: correct assert-condition +- dzn: handle stencil-attachment-optimal layout +- dzn: use a null-rtv to handle no-attachment +- dzn: enable two more exts +- util: sort includes +- llvmpipe: assume c99 +- c99_compat.h: assume c99 support +- util: use c11 alignof instead of our own +- glcpp: remove outdated msvc hack +- vulkan/util: assume c11 +- util: use c11 alignas instead of rolling our own +- gallium: remove stale comment +- gallium: reorganize includes +- llvmpipe: use c11 alignas instead of PIPE_ALIGN_VAR +- gallium: use c11 alignas instead of PIPE_ALIGN_VAR +- gallium: use c++11 alignas instead of PIPE_ALIGN_VAR +- gallium: remove repeated check +- gallium: remove unused macros +- gallium: clean up PIPE_ALIGN_STACK-definition +- docs/zink: correct requirement for tbos +- dzn: correct order of src_subres and dst_subres args +- docs: set language to english +- dzn: fill in driver name and info +- dzn: fill in float-control details +- dzn: fill in non-uniform-indexing props +- dzn: fill in bindless props +- dzn: fill in depth/stencil resolve props +- dzn: fill in minmax props +- dzn: fill misc props +- dzn: expose VK_KHR_driver_properties +- zink: remove pointless test +- zink: remove pointless assert +- zink: drop pointless comment +- llvmpipe: merge all rasterizer-bind functions +- llvmpipe: consolidate legacy_points and multisample state +- llvmpipe: clean up bresenham implementation +- docs/zink: correct name of extension +- docs/zink: document required VK_KHR_swapchain_mutable_format +- docs/zink: document required VK_EXT_border_color_swizzle +- zink: warn about missing image2DViewOf3D feature +- zink: fix EXT_color_write_enable check +- gallium/hud: do not use texture-rect for font +- llvmpipe: remove unused function +- zink: clean up extension emitting +- zink: set dynamic indexing caps +- zink: more accurately set {Sampled,Image}1D caps +- zink: more accurately set {Sampled,Image}Buffer caps +- zink: more accurately set ImageQuery cap +- docs: fixup link to virgl docs +- zink: add have_D32_SFLOAT_S8_UINT boolean +- zink: do not use VK_FORMAT_D32_SFLOAT_S8_UINT without checking +- zink: type_main -> type_void_func +- zink: add spirv_builder_function_call +- zink: wrap discard in a function +- zink: clamp miplodbias when creating sampler +- docs/zink: document rgtc requirement + +Fabrice Fontaine (1): + +- src/util/futex.h: fix build on 32-bit architectures using 64-bit time_t + +Feng Jiang (1): + +- virgl: do not share virgl_screen between different drm_files + +Filip Gawin (5): + +- r300: Print warning when stubbing derivatives +- r300: keep negation if w is an inline constant +- r300: don't check for unitialized reads when rewriting register +- r300: prefer old not native swizzle in constant folding +- r300: don't read from output transform_r300_vertex_SEQ/SNE + +Francisco Jerez (2): + +- intel/dev: Compute pixel pipe information based on geometry topology DRM query. +- intel/fs/xehp+: Emit scheduling fence for all NIR barriers on platforms with LSC. + +Frank Binns (3): + +- pvr: replace p_compiler.h boolean with stdbool +- pvr: shorten error to err in label names +- pvr: Add TI AM62 as a supported device. + +Georg Lehmann (74): + +- aco: Remove some old comments in aco_opcodes.py. +- aco: Reuse previous -1 result in find_msb to avoid using VOP3. +- nir/lower_mediump: Add a new pass to fold 16bit image load/store. +- aco: Implement image_store d16. +- aco: Implement image_load d16. +- radv: Use nir_fold_16bit_image_load_store_conversions. +- radv: Run copy_prop and dce after folding 16bit sampling/load/store. +- aco: Remove D16 zero components from image stores. +- meson: Tell glslang to be quiet. +- radeonsi: Use nir_fold_16bit_image_load_store_conversions. +- wsi/x11: Don't leak xcb_get_geometry_reply_t. +- nir/opt_algebraic: Fix mask in shift by constant combining. +- nir/opt_algebraic: Simple xor/ishr optimizations. +- radv: Don't retile read-only images. +- nir/opt_algebraic: Optimize Doom Eternal's word extract by LSB. +- aco: Implement uclz. +- aco: Implement usub_sat. +- aco: Implement isub_sat. +- ac/llvm: Implement usub_sat and isub_sat. +- ac/llvm: Implement uclz. +- nir: Add optional lowering for mul_32x16. +- radv: Lower mul_32x16. +- radv: Lower hadd/hradd. +- radv: Lower 8bit isub_sat/usub_sat. +- radv: Enable VK_INTEL_shader_integer_functions2. +- radv, aco: Packed usub_sat/isub_sat. +- zink: Use VK_USE_64_BIT_PTR_DEFINES to check for 64bit platforms. +- radv, aco: Lower txf offset in NIR. +- nir/lower_tex: Add lower_array_layer_round_even option. +- radv, aco: Round texture array layer in NIR. +- vulkan: Spec update to 1.3.217. +- radv: Implement VK_EXT_non_seamless_cube_map. +- aco: Print r128/a16 MIMG bits separately. +- aco: Remove r128_a16 MIMG builder option. +- anv: Implement VK_EXT_non_seamless_cube_map. +- radv: Remove redundant memset in radv_descriptor_set_create. +- radv: Check descriptor pool entry count before allocating a new set. +- radv: Don't check if we need to copy immutable samplers for non push templates. +- aco/ra: Static assert that changing instruction type to VOP2 is valid. +- aco/ra: Move mac encoding optimization to its own function. +- aco/ir: Pad SOP2 and SOPC to the same size as SOPK. +- aco/ra: Optimize some SOP2 instructions with literal to SOPK. +- aco/optimizer: Convert s_add_u32 with literals to s_add_i32 if carry is not used. +- aco/optimizer: Optimize SOPC with literal to SOPK. +- r600/sfn: Add missing std::array include. +- aco: Fix swapping sources in SOPC -> SOPK optimization. +- aco/assembler: Fix s_bitreplicate_b64_b32 on GFX9. +- nir_to_tgsi: Lower uadd_sat/usub_sat. +- zink: Lower uadd_sat/usub_sat. +- r600: Lower uadd_sat/usub_sat. +- ir3: Implement [iu]sub_sat. +- nir/lower_io: Fix array length of buffers larger than INT32_MAX. +- nir/opt_algebraic: Optimize [ui](add|sub)_sat with 0. +- radv,aco: Don't lower and vectorize 16bit iabs. +- radv: Remove ineg from lower_bit_size_callback callback. +- radv, aco: Don't lower 16bit isign. +- ir3: Only run 16bit tex NIR passes on a5xx+. +- ir3: Lower alu to scalar if nir_legalize_16bit_sampler_srcs made progress. +- nir: Rewrite and merge 16bit tex folding pass with 16bit image folding pass. +- nir/lower_mediump: Add an option to only fold if all tex sources can be folded. +- ir3: Stop using nir_legalize_16bit_sampler_srcs. +- aco: Add G16 opcodes. +- aco: Support 16bit sources for texture ops. +- radv: Fold 16bit tex sources. +- aco: Add a note about G16 without A16 to README-ISA.md. +- aco/ir: Generalize (un)ordered_swapped. +- aco/ir: Add integer get_cmp_info. +- aco/ir: Fix swapped nle. +- aco/ir: Add vcmpx opcode to get_cmp_info. +- aco/ir: Add v_cmp_class to get_cmp_info. +- aco/ir: Add swapped opcode for v_cmp_u/v_cmp_o. +- radeonsi: Stop using nir_legalize_16bit_sampler_srcs. +- nir/fold_16bit_tex_image: Default to only_fold_all. +- aco: Force tex operand to have the correct sub dword size before packing. + +Gert Wollny (85): + +- virgl: Extend integer write out output fix to all non-move integers ops +- r600/sfn: Fix store_shared_r600 write masks +- ntt: Add option to not optimized register allocation +- nir: Add pass to split 64 bit vec3 and vec4 variable access and phis +- ntt: Split 64 bit vec3 and vec4 local variables and phis +- virgl: Clear all indirect flags when writing to a temporary +- nir: Don't optimize to 64 bit fsub if the driver doesn't support it +- ntt: remove dead input variables before lowering FS IO +- r600/sb: Don't optimize float GT and GE +- r600: tune nir options +- r600: Add support for TGSI_OPCODE_ATOMIMIN and IMAX +- virgl: don't move input gl_SampleMaskIn to a temp +- r600: Allow eight bit, three channel formats for vertex buffers +- nir: honor lower_double options for ffloor and ffract +- r600/sb: Don't create three source ops with all kcache values +- r600/sb: Fall back to un-optimized shader if scheduling fails +- virgl: Don't let ntt optimize the register allocation +- tgsi: Don't bother setting the TGSI_PROPERTY_SEPARABLE_PROGRAM +- virgl: Submit the TGSI_PROPERTY_SEPARABLE_PROGRAM when the host understands it +- r600: Add number of ALU groups to statistics +- r600: Make sure that LDS instructions only use bank swizzle 012 +- r600: Update nir options +- r600/sfn: rewrite NIR backend +- r600: enable sb by default also for NIR +- r600: Cleanup nir options +- r600/sfn: support nir_op_mulz and legazy math rules +- r600/sfn: Add missing copyright information +- r600/sfn: Don't use assert to test success of operation +- r600: limit loops when trying to merge alu groups +- r600/sfn: Never consider an op with register dest as dead +- r600/sfn: Add support for fdph +- r600/sfn: lower CLIPVERTEX to clip planes +- r600/sfn: emulate pmr::monotonic_buffer_resource if needed +- r600/sfn: clean up some commented out code +- r600/sfn: fix storing shader output array locations +- r600/sfn: lower undef to zero +- r600/sfn: count LDS queue pop reads separately in assembler +- r600/sfn: Fix the kcache failure handling +- r600/sfn: Fix emmision of LDS instructions +- r600/sfn: Fix used of local shared write mask +- r600/sfn: Fix streamout with non-zero component write +- r600: don't dump shader info to files on debug +- r600/sfn: make sure the memory pool is released after shader translation +- r600/sfn: lower the right shader variant +- r600: Delete possible old NIR variant when translating from TGSI +- r600: don't clone the nir shader, just use it +- r600ß/sfn: Don't fall through with VS outputs +- r600/dri-tweak: Disable ARB_shader_image_load_store for "Tomb Raider 2013" +- r600/sb: fall back to unoptimized shader if RA fails +- r600: Don't set TES_EVAL UCP dirty when it is not supported +- virgL: lower fneg and fabs +- virgl: honor host caps in shader cash sha evaluation +- r600/sfn: Factor out fragement shader class for EG +- r600/sfn: Support pre-EG style FS input +- r600/sfn: Prepare StreamOut instruction for pre EG opcodes +- r600/sfn: Handle slot differences between pre-EG and EG +- r600/sfn: Handle chipclass with kcache allocation +- r600/sfn: Handle buffer size query on pre-EG +- r600/sfn: Handle pre-EG GS input fetch +- r600/sfn: Handle primitive input handling for pre-EG +- r600/sfn: Handle non-compressed MS texel fetch +- r600/sfn: Set trans flag for flt_to_int on pre-EG +- r600/sfn: Handle pre-EG buffer fetch +- r600/sfn: Disable FP64 for pre-CEDAR +- r600/sfn: Advertise support for fmulz +- r600/sfn: Enable NIR for pre RG hardware +- r600/sfn: remove leftover debug output +- r600/sfn: Handle R600 class sin/cos +- r600: increase possible stack size in binary code +- r600/sfn: Schedule shift instruction on R600 in t-slot +- r600/sfn: Add GS thread fix just like the TGSI code path +- r600/sfn: Sort FS inputs to make interpolated values come first +- r600/sfn: Fix color outputs when color0 writes all +- r600: Force NOPs when loading AR on R600 class hardware +- r600/sfn: Handle R600 scratch read +- r600: Don't use SB with R600 style scratch reads +- r600/sfn: Handle color0 writes all on R700 like on EG +- r600/sfn: Don't tag mem-ring and stream instructions as exports +- r600/sfn: Don't schedule GDS instructions early +- r600/sfn: Don't scan the whole block for ready instructions +- r600/sfn: Use a heuristic to keep SSBO setup and store close +- r600: Fix reporting TGSI IR support +- r600/sfn: Use a low number for unused target register +- virgl: when reading back wait first, then do the transfer +- r600/sfn: override register ID when it doesn't matter + +Glenn Kennard (1): + +- nv30: Fix non-scissored clears after a scissor has been set + +Greg Depoire--Ferrer (5): + +- util/queue: add missing space to comment in util_queue_destroy +- zink: check for error when initializing util_queue +- lavapipe: check for error when initializing lvp_queue +- util/queue: handle thread cration failure in util_queue_adjust_num_threads +- zink: fix crash on dlclose when zink_internal_create_screen failed + +Guilherme Gallo (48): + +- ci/lava: Encapsulate job data in a class +- ci/lava: Filter log lines from LAVA return +- ci/lava: Return test-suite result as exit code +- ci/lava: Fix LAVA job validation +- ci/lava: Set lava-signal to kmsg +- ci/lava: Always validate the lava job +- ci/lava: Use lava-test-case to run custom scripts in LAVA +- ci/lava: Trap init-stage2.sh background processes +- ci/lava: Fix shebang in deqp-runner.sh +- ci/lava: Parse all test cases from 0_mesa suite +- ci/lava: Let LAVA job submitter run without JWT file +- ci/lava: Reduce LAVA boot phase timeout to 3 minutes +- ci/lava: Cancel the job if the script is interrupted +- ci/lava: Improve exception handling +- ci/lava: Retry when data fetching log RPC call is corrupted +- ci: Fix init-stage2 exit code +- ci: Update trace after CI fix +- ci: Fix tests expectations +- ci/lava: Make job submitter parse the job result +- ci/lava: Fix colored LAVA outputs +- ci/lava: Fix Gitlab Section markers +- ci/lava: Add support for more complex color codes +- ci/lava: Filter out undesired messages +- ci/lava: Improve result parsing regex +- ci/lava: Create Gitlab log sections handler +- ci/lava: Create LogFollower and move logging methods +- ci/lava: Follow job execution via LogFollower +- ci/lava: heartbeat: don't consider kernel message logs +- ci/lava: Wrap job info into a collapsed section +- ci/lava: Wrap job definition dump into a collapsed section +- ci/lava: Add an integration test for LAVA jobs +- ci/lava: Don't print LAVA debug messages +- ci/lava: Highlight kernel messages in bold +- ci/lava: Highlight job retrying message +- ci/lava: Stop printing after the result line +- ci/lava: Flexibilize section marker regexes +- ci/lava: Rename console color names +- ci/lava: Update license header +- ci/lava: Split lava_log into modules +- ci/lava: Detect R8152 issues preemptively and retry +- ci/lava: Make hung job status yellow +- ci/lava: Color red for fatal and yellow for warning +- ci/lava: Add \`slow` pytest marker +- ci/lava: Add canceled job status +- ci/lava: Add delay before calling lava-test-case +- ci/lava: Increase boot timeout +- ci/lava: Only parse result within testcase section +- ci/lava: Remove MINIO_RESULTS_UPLOAD if requested + +Hans-Kristian Arntzen (4): + +- radv: Fix RTPSO hashing of pGroups. +- vulkan: Update Vulkan XML and headers to 1.3.219. +- vulkan: Add common code for VK_EXT_shader_module_identifier. +- radv: Implement VK_EXT_shader_module_identifier. + +Hoe Hao Cheng (4): + +- zink/codegen: remember the fields in feats/props structs of extensions +- zink/codegen: do not include compilation structs with extension structs +- zink/codegen: do not automatically consider extensions promoted +- zink/codegen: use the updated identifiers + +Hyunjun Ko (15): + +- radv: fix assertion for the count of xfb output +- turnip: add an assertion for max descriptor set count. +- ir3: handle intrinsic_load_draw_id when scanning driver constants +- turnip: clamp to 1 for negative bottom-right of viewport. +- turnip: fix wrong size for lrz cs when lrz_track_quirk is enabled. +- freedreno,ir3: rename Z_CLAMP_ENABLE to Z_CLIP_DISABLE +- turnip: keep the depth_clip_disable state at the pipeline builder. +- zink: remove the workaround for depth_clip_enable on turnip +- turnip: Remove an unnecessary assert. +- turnip: expose VK_EXT_shader_module_identifier +- ir3: change maximum size of const files. +- ir3: handle shared consts. +- turnip: clean up unused parameters for user consts. +- turnip: Change to use shared consts for PushConstants +- tu: increase maxPushConstantsSize to 256. + +Iago Toral Quiroga (79): + +- v3dv: ignore no-op barriers +- v3dv: drop obsolete comment +- v3dv: don't emit final noop job if we don't have anything to signal +- v3dv: document Vulkan requirements for signaling operations +- v3dv: ignore barriers for image layout transitions from undefined layouts +- v3dv: drop default sampler states if not used +- v3dv: flag BCL barriers for all-graphics and all-commands stages +- v3dv: skip binning sync if binning shaders don't access external resources +- v3dv: use wait stage for wait semaphores to decide about binning syncs +- v3dv: check input attachment usage as sampled usage +- v3dv: implement VK_EXT_separate_stencil_usage +- v3dv: trivially implement VK_KHR_separate_depth_stencil_layouts +- docs/features: flag VK_KHR_timeline_semaphore as implemented +- v3dv: expose separateDepthStencilLayouts +- v3dv: implement vkGetPipelineExecutableInternalRepresentationsKHR +- v3dv: implement vkGetPipelineExecutablePropertiesKHR +- v3d/compiler: add more stats to prog_data +- v3dv: implement vkGetPipelineExecutableStatisticsKHR +- v3dv: expose VK_KHR_pipeline_executable_properties +- v3dv: don't leak variant QPU when pipeline compile fails +- v3dv: drop unsused struct +- v3dv: expose VK_KHR_shader_float_controls +- v3dv: expose VK_KHR_spirv_1_4 +- v3dv: expose VK_KHR_vulkan_memory_model +- v3dv: fix format usage checks when extended usage flag is set +- broadcom/compiler: don't promote early fragment tests when writing sample mask +- v3dv: consume barriers at the right stages +- v3dv: only clear BCL barrier state if we don't have pending graphics barriers +- v3dv: use an explicit struct type to track barrier state +- v3dv: track sources of barriers +- v3dv: limit sync for barriers to hw queues selected by source mask +- v3dv: don't be too aggressive disabling early Z +- v3dv: use the global RCL EZ disable if we don't have any EZ draws in the job +- broadcom/compiler: disable flags optimization for loop conditions +- v3dv: merge pending secondary barrier state into primary command buffers +- v3dv: handle barriers at the end of a command buffer +- v3dv: fix leak +- broadcom/compiler: fix postponed TMU spills with multiple writes +- broadcom/compiler: don't predicate postponed spills +- v3dv: fix pool descriptor count for inline uniform buffers +- v3dv: fix merge jobs +- nir: fix documentation for uadd_carry and usub_borry opcodes +- v3dv: don't lower uadd_carry and usub_borrow +- v3dv: remove obsolete comment +- nir/serialize: fix missing divergence info after deserialization +- broadcom/compiler: don't try to hide TMU latency at QPU scheduling +- broadcom/compiler: don't over-estimate latency of TMU instructions +- v3dv: fix comment in texel buffer shader copy path +- v3dv: remove unused lowering for nir_intrinsic_load_layer_id +- v3dv: fix comment for point_sprite_mask filed in shader key +- v3dv: clean up get_internal_type_bpp_for_image_aspects +- v3d,v3dv: stop hardcoding various image limits +- v3d: stop using a smaller texture limit in OpenGL +- v3d,v3dv: stop copying and pasting the translate_swizzle helper +- v3dv: only save/restore push constant data for meta operations if needed +- v3dv: move push constant data to the command buffer state +- v3dv: limit upload of indirect push constant data +- v3dv: don't allocate MAX_PUSH_CONSTANTS_SIZE bytes for the push constants UBO +- v3dv: stop tracking push constant buffer references +- v3dv: allocate more push constant buffers if needed +- v3dv: add a dirty state for pending push constants UBO updates +- v3dv: explain why we clear certain state after a draw call +- v3dv: remove duplicate condition +- nir: add nir_address_format_2x32bit_global +- broadcom/compiler: implement 2x32 global intrinsics +- broadcom/compiler: track if a shader uses global intrinsics +- v3dv: implement VK_KHR_buffer_device_address +- v3dv: handle VK_STRUCTURE_TYPE_DRM_FORMAT_MODIFIER_PROPERTIES_LIST_2_EXT +- v3dv: remove assert that poolSizeCount must be larger than 0 +- v3dv: fix offset reported by vkGetImageSubresourceLayout +- v3dv: remove unnecesary FIXME comment +- v3d,v3dv: add a common v3d_hw_prim_type helper +- v3dv: only check binning sync for semaphores for the first CL job +- v3dv: explicitly set sampler min/max filter properties to false +- v3dv: fill in Vulkan 1.2 missing device properties +- v3dv: enable missing Vulkan 1.2 features for supported extensions +- v3dv: enable missing mandatory Vulkan 1.2 features +- broadcom/ci: update list of slow tests +- v3dv: expose Vulkan 1.2 + +Ian Romanick (26): + +- glsl: Fix mixed tabs and spaces in lower_mat_op_to_vec.cpp +- glsl: Add flag to disable part of do_vec_index_to_cond_assign +- nir: Use nir_vector_extract to generate code for ir_binop_vector_extract +- glsl: Delete lower_extracts code +- anv: Remove FS executables when applying the null FS optimization +- intel/fs: Add missing synchronization for WaW dependency +- nir: i32csel opcodes should compare with integer zero +- nir/algebraic: Fix NaN-unsafe fcsel patterns +- nir: Add and use algebraic property "is selection" +- nir/range_analysis: Teach range analysis about fdot opcodes +- intel/compiler: Rename vec4 state URB opcodes to have VEC4\_ prefix +- intel/eu: Validate some aspects of URB messages +- intel/compiler: Move logical-send lowering to a separate file +- intel/fs: Add _LOGICAL versions of URB messages +- intel/fs: Lower URB messages to SEND +- intel/fs: Remove non-_LOGICAL URB messages +- intel/vec4: Set lower_usub_sat +- spirv: Fix array length of buffers larger than INT32_MAX. +- intel/fs: Make logical URB write instructions more like other logical instructions +- intel/fs: Make logical URB read instructions more like other logical instructions +- intel/fs: Don't pass flags to lower_urb_read_logical_send or lower_urb_write_logical_send +- intel/fs: Eliminate "masked" and "per slot offset" URB messages +- intel/fs: Use canonical form for "work around" tags +- nir: spirv: Allow 32-bit version of nir_intrinsic_is_sparse_texels_resident +- radeonsi: r600: d3d12: st: Use NIR lowering for tg4 offset arrays instead of GLSL lowering +- glsl: Remove lower_offset_arrays pass + +Icecream95 (45): + +- clc: Use stringstream for printing spirv errors +- panfrost: Stop overallocating compressed textures +- panfrost: Enable NIR lowering of half float packing +- panfrost: Fix pack_32_2x16 implementation +- pan/bi: Use texture index instead of sampler for message preloading +- pan/va: Improvements to LEA_TEX instructions +- pan/va: Add more transcendental operations +- pan/va: Add SEG_ADD/SEG_SUB operation +- pan/va: Add absneg modifier to V2F32_TO_V2F16 +- pan/va: Improve texture instructions +- nir/lower_tex: Copy more fields in lower_tex_to_txd and friends +- pan/mdg: Keep min_bound at 16 when alignment requires it +- pan/mdg: Use MAX2 to set min_alignment +- pan/mdg: Fix mask usage when filling before a spill +- pan/mdg: Fix disassembly of store instructions +- pan/mdg: Return the instruction from mir_insert_instruction_*_scheduled +- pan/mdg: Fix multiple spilt writes in the same bundle +- panfrost: Copy blend constant into variant even when reusing it +- panfrost: Increase the limit for blend shader variants +- panfrost: Move patched_s out of the pan_blitter_views struct +- nir: Add store_combined_output_pan BASE back +- pan/bi: Read base for combined stores +- pan/mdg: Read base for combined stores +- panfrost: Only write depth / stencil once if MRT is used +- pan/bi: Reverse linear constraint bits +- pan/bi: Add nodearray datastructure +- pan/bi: Use nodearrays for linear constraints +- panfrost: Constant stencil value tracking +- panfrost: Stop using sparse_array for batch BOs +- panfrost: Use a macro for checking for a shared bind type +- panfrost: Remove sync arguments from panfrost_batch_submit +- panfrost: Enable QUAD_STRIP and POLYGON on v6 +- pan/va: Use the _safe iterator when adding blend shader calls +- pan/decode: fflush buffers after dumping and before aborts +- pan/decode: Use tag bits for resource entry count +- pan/decode: Change indent when decoding resources +- panfrost: Only emit images when they are present +- panfrost: Emit the correct number of attributes +- panfrost: Clear the GENERAL varying buffer field if unused +- panfrost: Correctly calculate prefetch suppresion varying index +- panfrost: Add a debug option for checking overflows on pool uploads +- panfrost: Don't unbind recently bound streamout targets +- panfrost: Allow NULL streamout targets +- nine: Make vdecl_index_map always signed +- nine: Only enable thread_submit by default on x86 + +Igor Torrente (2): + +- venus: Add support to VK_KHR_maintenance4 extension +- venus: Use maintenance4 to get max_size_buffer + +Ikshwaku Chauhan (1): + +- Revert "radeon: add EFC support to only VCN2.0 devices" + +Illia Abernikhin (1): + +- vulkan/runtime: repair thread safety in method vk_queue_start_submit_thread() + +Indrajit Das (5): + +- radeonsi/gfx11: VRS changes +- radeonsi: save the fs constant buffer to the util blitter context +- gallium/u_blitter: clear color buffers using color from a constant buffer +- mesa/st: add nir shader to clear color buffers using constant value +- mesa/st: clear color buffers using color from a constant buffer + +Italo Nicola (3): + +- mesa/st: call pipe->link_shader even if ir loaded from cache when linking +- tgsi: Add SEPARABLE_PROGRAM property +- virgl: overpropagate precise flags + +Iván Briano (6): + +- vulkan/wsi/wayland: Fix double free on error condition +- anv: re-alloc push constants after secondary command buffers +- anv: do not get rid of empty/useless fragment shaders +- anv: assert inheritance_info is not NULL +- anv: emit scissors when the pipeline changes +- anv: pipelineStageCreationFeedbackCount is allowed to be 0 + +James Zhu (5): + +- amd: update headers to support decode software ring +- radeonsi/gfx11: update codec support for gfx11 +- radeonsi/vcn: add decode software ring support for gfx11 +- radeonsi/vcn: add jpeg decode support for gfx11 +- amd/common: some ASICs with gfx9 use compute rings for render + +Jan Beich (1): + +- vulkan/wsi: treat EBADF as missing DMA_BUF_IOCTL_{EXPORT,IMPORT}_SYNC_FILE + +Jan Palus (1): + +- pvr: use UINT64_C for 64-bit constant + +Jason Ekstrand (312): + +- clc: Only initialize LLVM once +- clc: Rework logging a bit +- clc: Declare LLVMContexts on the stack +- intel/compiler: Set lower_fisnormal +- nir,microsoft: Move scale_fdiv into a common NIR pass +- nir/opcodes: fisfinite32 should return bool32 +- util: Remove util_cpu_detect +- util/blob: Add align helpers +- util/set: Respect found in search_or_add_pre_hashed +- vulkan: Add a common VkPipelineCache implementation +- vulkan/pipeline_cache: Add helpers for storing NIR in the cache +- anv: Rename a fail label in CreateDevice +- anv: Switch to the new common pipeline cache +- anv: Clean up pipeline cache helpers a bit +- nir: Lower all bit sizes of usub_borrow +- vulkan: Set signals[i].stageMask = ALL_COMMANDS for QueueSubmit2 wrapping +- vulkan: Use ALL_COMMANDS_BIT for waits/signals instead of ~0 +- spirv: Handle Op*MulExtended for non-32-bit types +- vulkan/log: Allow but warn if called with a NULL object +- vulkan/log: Allow but warn for client-invisible objects +- nir/constant_folding: Break TXB folding into a helper function +- nir: Constant fold sampler/texture offsets +- panvk: Take buffer offsets into account in BindVertexBuffers +- panvk: Eliminate unused vertex attributes +- nir/deref: Add an alu-of-cast optimization +- panvk: Add address/range helpers for panvk_buffer +- vulkan/wsi: Close file descriptors in wsi_destroy_image +- vulkan/wsi: Only use a single fd per wsi_image +- vulkan/wsi: Hang on to file descriptors +- vulkan/wsi: Reset the image fence right before vkQueueSubmit +- vulkan/wsi: Set the right stage flags for semaphore waits +- vulkan/wsi: Add signal_fence/semaphore_for_image helpers +- nir/cf: Return a cursor from nir_cf_extract as well +- nir: Fix constant folding for non-32-bit ifind_msb and clz +- nir: Set image_buffers and msaa_images in lower_samplers_as_deref +- nir: Stop assuming shader_info::textures_used is 32-bit +- shader_info: Make images_used a bitset +- nir: Gather samplers_used separately from textures +- util/bitset: Support larger ranges in BITSET_TEST/CLEAR_RANGE +- shader_info: Bump the number of images and textures supported +- gallium/u_threaded_context: Use PIPE_MAX_SHADER_SAMPLER_VIEWS for sampler_buffers +- mesa/st: Clamp MaxImageUniforms to MAX_IMAGE_UNIFORMS +- gallium: Bump PIPE_MAX_SHADER_IMAGES to 64 +- panvk: Call nir_lower_io_arrays_to_elements_no_indirects +- nir/lower_blend: Be more explicit about deref assumptions +- nir/lower_blend: Stop passing the whole options object around +- nir/lower_blend: Support SNORM and integer formats for logic ops +- nir/builder: Add a nir_trim_vector helper +- nir/builder: Add a nir_resize_vector helper +- midgard: Handle FB fetch from non-vec4 output variables. +- nir/lower_blend: Expand or shrink output variables as needed +- panvk: Advertise support for logicOp +- iris: Set BindingTableEntryCount for compute shaders +- nir: Mark negative re-distribution on fadd as imprecise +- clover: Set images/samplers_used when lowering images +- lavapipe: Set shader_info::samplers_used +- gallium: Set shader_info::samplers_used in pstipple_fs +- ttn: Set shader_info::samplers_used +- mesa/st: Set samplers_used in lower_tex_src_plane +- gallium/draw: Properly handle nr_samplers != nr_sampler_views in keys +- llvmpipe: Fill out samplers even if nr_samplers = 0 +- lavapipe: Set images_used in lvp_lower_pipeline_layout +- ntt: Don't gather samplers_declared twice +- panvk: Move CreateDescriptorSetLayout to per-arch +- panvk: Break descriptor lowering into its own file +- panvk: Add a buffer to each descriptor set +- panfrost,panvk: Make fixed_sysval_ubo < 0 mean compiler-assigned +- panfrost: Add some sanity checking for sysvals +- bifrost,midgard: Allow providing a fixed sysval layout +- panvk: Get rid of the per-pipeline sysvals BO +- panvk: Use a flat sysvals struct +- panvk: Stop calling lower_uniforms_to_ubo +- panvk: Put the sysval and push const UBOs at fixed indices +- panvk: Interleave UBOs with multiple descriptor sets +- panvk: Put SSBO addresses in the descriptor buffer +- vulkan: Start collecting enabled features in vk_device +- panvk: Enable robustBufferAccess +- panvk: Advertise VK_KHR_variable_pointers +- panvk: Add an elems field to panvk_buffer_view +- pavnk: Pass bind layouts to texture and image descriptor helpers +- panvk: Implement texture/image queries +- FIXUP: Use 16-bit things for texture sizes +- panvk: Re-arrange descriptor set functions +- panvk: Rewrite the write portion of vkUpdateDescriptorSets +- panvk: Set immutable samplers properly up-front +- panvk: Implement descriptor copies properly +- panvk: Drop panvk_descriptor +- panvk: Round FillBuffer sizes down to a multiple of 4 +- nir: Preserve metadata if remove_dead_derefs makes no progress +- nir: Add a var set version of lower_indirect_derefs +- mesa/st: Use nir_shader_instructions_pass for st_nir_lower_builtin +- mesa/st: Check deref modes in lower_builtin_instr +- mesa/st: Use lower_indirect_var_derefs in st_nir_lower_builtin +- nir: Put the builder first in lower_input_attachments helpers +- nir: Use nir_shader_instructions_pass in nir_lower_input_attachments +- bifrost: Run nir_lower_global_vars_to_local before nir_lower_vars_to_scratch +- panvk: So more nir_lower_tex before descriptor lowering +- panvk: Lower blending after lower_var_copies +- v3dv: Add a create_image_view helper for internal views +- vulkan: Only be clever about vk_image_view::view_format for normal views +- v3dv: Don't use color aspects for depth/stencil images +- v3dv: Drop the region temporary from blit_shader +- vulkan,v3dv: Add a driver_internal flag to vk_image_view_init/create +- radv: Only use PLANE_0 in meta when actually needed +- radv: Use vk_image as the base for radv_image +- radv: Use vk_image_view as the base for radv_image_view +- glsl: Drop this != NULL assertions +- vulkan/pipeline_cache: Implement deserialize for raw objects +- v3dv: Loosen an assert in copy_buffer_to_image_shader +- vulkan: Fall back to raw data objects when deserializing if ops == NULL +- lavapipe: Use the correct ICD path on Win32 +- vulkan,anv,turnip: Add a common CmdBindVertexBuffers wrapper +- radv: Add a sqtt entrypoint for CmdBindVertexBuffers2 +- radv: Use the common CmdBindVertexBuffers wrapper +- lavapipe: Use the common BindVertexBuffers wrapper +- nir/algebraic: Add two more pack/unpack rules +- intel/fs: Drop fs_visitor::emit_alpha_to_coverage_workaround() +- intel/fs: Copy color_outputs_valid into wm_prog_data +- anv: Drop alpha_to_coverage from the NULL FS optimization +- anv: Handle the null FS optimization after compiling shaders +- anv: Don't disable the fragment shader if XFB is enabled +- intel: Only set VectorMaskEnable when needed +- vulkan: Add a base struct for buffers +- anv: Use the base vk_buffer struct +- panvk: Use the vk_buffer base struct +- vulkan/runtime: Add min_lod to vk_image_view +- intel/isl: Add isl_view::min_lod_clamp for IVB+ +- anv: Implement VK_EXT_image_view_min_lod +- isl: Add some asserts about multisampled surfaces +- nir: Handle register sources in lower_phis_to_regs_block +- nir: Add a correctness note for nir_lower_phis_to_regs_block +- nir: Rename nir_gather_xfb_info to nir_shader_get_xfb_info +- nir: Add a nir_xfb_info to nir_shader +- anv: Use nir_shader_gather_xfb_info +- lavapipe: Use nir_shader_gather_xfb_info +- turnip: Use nir_gather_xfb_info +- radv: Use nir_gather_xfb_info +- nir: Drop nir_shader_get_xfb_info +- glsl/nir: Populate nir_shader::xfb_info after linking varyings +- glsl/nir: Stop leaking varyings_info +- glsl/nir: Stash the xfb_info in the nir_shader when linking XFB +- st,nir: Use nir_shader::xfb_info in nir_lower_io_passes +- nir/glsl: Use rzalloc for nir_xfb_info +- nir: xfb_buffer_info::stride is in bytes +- ttn: Populate the images/textures/samplers_used fields in shader_info +- vulkan,anv: Move the image offset/extent sanitize helpers to common code +- radv: Use the common image offset/extent sanitize helpers +- vulkan,radv: Move vk_format_get_plane_format to common code +- vulkan/format: Add a vk_format_get_aspect_format helper +- vulkan: Add a helper for image<->buffer copies +- anv: Use the common image<->buffer copy helper +- vulkan,radv: Steal some image offset/extent helpers from radv +- radv: Use vk_image_buffer_copy_layout +- panvk: Use vk_image_buffer_copy_layout +- lavapipe: Use vk_image_buffer_copy_layout +- nir/deref: Break out a helper for compare_deref_paths +- nir/deref: Use an index in compare_deref_paths +- nir/deref: Make compare_deref_paths take a stop callback +- nir/deref: Re-arrange variable checks in compare_deref_paths +- nir: Increase nir_variable_data::mode to 16 bits +- nir/vars_tests: Use nir_var_mem_global instead of ssbo +- nir/deref: Handle SSBO array bindings specially +- nir/deref: Handle RESTRICT for SSBO deref bindings +- compiler/types: Don't place members in the previous substruct's end padding +- anv/wsi: Stop resetting semaphores +- vulkan/wsi: Fix a signal_semaphore_with_memory check +- wsi: Always signal semaphores and fences in wsi_common_acquire_next_image +- panvk: Use the common AcquireNextImage implementation +- v3dv: Use the common AcquireNextImage implementation +- radv: Drop create_sync_for_memory +- lavapipe: Use the common AcquireNextImage implementation +- pvr: Use the common AcquireNextImage implementation +- turnip: Use the common AcquireNextImage implementation +- turnip: Use the common QueuePresent implementation +- turnip: Set supported_sync_types before wsi_init +- vulkan/wsi: Use a STACK_ARRAY for stage_flags +- vulkan/wsi: Reset fences earlier in wsi_common_queue_present +- vulkan/wsi: Re-arrange QueueSubmit code in wsi_common_queue_present +- vulkan/wsi: Set memory ownership after signaling fences and semaphores +- vulkan/wsi: Signal semaphores and fences from the dma-buf +- intel/compiler: Use NIR_PASS(_, ...) +- anv: Use NIR_PASS(_, ...) +- anv: Properly clamp attachment layer counts +- vulkan/wsi: Use HAVE_LIBDRM to detect DRM instead of !_WIN32 +- vulkan: Add some border color helpers +- panvk: Use the new border color helpers +- lavapipe: Use the new border color helper +- turnip: Use the new border color helpers +- vulkan/nir: Make spirv_data const in vk_spirv_to_nir +- vulkan: Add a vk_pipeline_shader_stage_to_nir helper +- gallium: Add a u_default_clear_buffer helper +- iris: Use u_default_clear_buffer +- panfrost: Use u_default_clear_buffer +- nir/opt_memcpy: lower copies to/from tightly packed types +- nir: Add an options parameter to deref_instr_has_complex_use +- nir/opt_memcpy: Add another case for function_temp +- vulkan/render_pass: Add a better helper for render pass inheritance +- anv: Use CmdBeginRendering for resumes in BeginCommandBuffer when possible +- vulkan/render_pass: Use a special layout for self-dependencies +- vulkan/render_pass: Pass sample locations to barriers +- vulkan/render_pass: Allow for mixed sample counts +- vulkan/render_pass: Support VkAttachmentSampleCountInfoAMD +- vulkan: Fix a comment +- vulkan: Add a base struct for descriptor set layouts +- vulkan: Add a common implementation of pipeline layouts +- lavapipe: Use the vk_descriptor_set_layout base struct +- lavapipe: Use the vk_pipeline_layout base struct +- panvk: Use the vk_descriptor_set_layout base struct +- panvk: Use the vk_pipeline_layout base struct +- vulkan: Depend on vk_pipeline_layout in vk_cmd_enqueue +- anv: Replace an assert() with unreachable() +- util: Re-indent util_sign_extend, comment, and add asserts +- util: Use shifts in util_sign_extend +- util: Use util_sign_extend in fast_idiv_by_const +- mesa/bptc: Use util_sign_extend +- util,nir: Move mask_sign_extend from opt_load_store_vectorize to util +- isl: Use util_sign_extend +- intel/decoder: Use util_mask_sign_extend() +- panfrost,asahi: Use util_sign_extend for unpacking +- nir: Use util_mask_sign_extend when serializing constants +- vulkan/wsi: Pass the size to MapMemory in the SW path +- vulkan/wsi/wayland: Use a single memcpy in the SW path +- vulkan/wsi: Move select_memory_type to common and rework it a bit +- vulkan/wsi: Add a helper for creating CPU images +- vulkan/wsi/x11: Use wsi_configure_cpu_image +- vulkan/wsi/wayland: Use wsi_configure_cpu_image +- vulkan/wsi: Delete SW support from configure_native_image +- vulkan/wsi: Persistently map CPU images +- vulkan/wsi: Compute stride and size in configure_buffer_image +- vulkan/wsi: Align buffer image strides to optimalBufferCopyRowPitchAlignment +- vulkan/wsi: Support tiled CPU images +- vulkan/wsi/x11: Don't leak shm_reply if we don't have dri3 or present +- vulkan/wsi/x11: Only use MIT_SHM if the device supports EXT_external_memory_host +- vulkan/wsi/wayland: Use host pointer import when available +- dozen: Increase optimalBufferCopy*Alignment +- dozen: Stop asking for semaphore/fence signaling +- vulkan/wsi/wayland: Only memcpy if the swapchain is actually software +- vulkan/wsi: Fix structure chaining in wsi_create_buffer_image_mem +- shader_info: Move subgroup_size out of cs and make it an enum +- vulkan: Re-order pipeline hashing +- vulkan: Constify vk_spirv_version +- spirv,vulkan: Set shader_info::subgroup_size +- mesa,glsl,ttn: Set subgroup_size to UNIFORM +- anv: Drop unnecessary parameters to anv_pipeline_compile_cs +- anv: Put a VkPipelineShaderStageCreateInfo* in anv_pipeline_stage +- anv: Use vk_pipeline_shader_stage_to_nir +- vulkan: Add a vk_pipeline_shader_stage_is_null() helper +- anv: Use vk_pipeline_shader_stage_is_null() +- intel,anv,iris,crocus: Drop subgroup size from the shader key +- anv: Implement VK_EXT_shader_module_identifier +- anv: Init supported extensions before WSI +- vulkan/wsi/wayland: Add a BUFFER_TYPE flag +- vulkan/wsi: Add debug variables to force the SW and PRIME buffer blit paths +- intel/blorp: Set uses_sample_shading for MSAA blit shaders +- radv: Set uses_sample_shading for copy shaders +- intel/fs: Use shader_info::fs::uses_sample_shading +- nir: Add a pass for lowering shaders to single-sampled +- intel/fs: Use nir_lower_single_sampled +- glsl/nir: Set uses_sample_shading from glsl_to_nir +- spirv/nir: Set uses_sample_shading from spirv_to_nir +- nir/gather_info: Stop gathering uses_sample_shading +- glsl/nir: Set sample_shading if a FS output ever shows up as an rvalue +- intel/fs: Simplify persample_dispatch +- intel/fs,anv: Rework handling of coarse and sample shading +- panfrost: Simplify sample_shading +- anv: Stop compacting surface state tables +- vulkan: Detect pNext chain loops in vk_foreach_struct() +- anv: Don't use the wrong ARRAY_SIZE +- Revert "vulkan: Detect pNext chain loops in vk_foreach_struct()" +- vulkan: Add a vk_limits.h file for runtime limits +- vulkan: Add an enum for all dynamic graphics states +- vulkan: Add standard sample locations +- vulkan: Add data structures to store all graphics state +- util: Constify __bitset_test_range +- vulkan: Add a common data structure for dynamic states +- vulkan: Add a common implementation of vkCmdSet* +- vulkan: Copy the depth/stencil state optimization code from ANV +- Use vk_foreach_struct_const where needed +- vulkan: Detect pNext chain loops in vk_foreach_struct() (v2) +- anv: Add an instance multiplier to anv_pipeline +- anv: Handle vertex buffer sizes in anv_CmdBindVertexBuffers2 +- anv: Stop recording sample locations per-sample-count +- anv: Convert to using vk_graphics_pipeline_state +- anv: Rework setting primitive topology +- anv: Stop merging DEPTH_STENCIL state +- anv: Switch to using common dynamic state tracking +- vulkan/graphics_state: Improve the depth/stencil optimization code +- anv: Do depth/stencil optimization for dynamic depth/stencil +- vulkan: Call lower_clip_cull_distance_arrays in vk_spirv_to_nir +- vulkan/nir: Don't remove dead XFB outputs +- vulkan: Call gather_xfb_info in vk_spirv_to_nir +- lavapipe: Use more Vulkan NIR heleprs +- vulkan: Allow up to 12 pointers in multialloc +- vulkan: Record shader stages in vk_graphics_pipeline_state +- vulkan: Multisample state isn't always included in fragment shader state +- vulkan: Depth/stencil isn't fragment output state but multisample is +- vulkan: Add a fully_dynamic_state_groups() helper +- vulkan: Fix pipeline libraries with dynamic-only VI or FSR state +- vulkan: Input assembly and depth/stencil can also be fully dynamic +- vulkan: Don't assert VkGraphicsPipelineLibraryCreateInfoEXT::Flags == 0 +- zink: Mark depth/stencil visual tests as flakes +- lavapipe: Use shader_info::uses_sample_shading +- lavapipe: Use the vk_graphics_pipeline_state +- vulkan: Append subpass structures to VkRenderingInfo last +- vulkan: Include self-dep info in rendering continues +- intel/eu: Don't throw validation errors on float MOV_INDIRECT +- lima: Lower undefs to zero and run DCE after from_ssa +- nir/from_ssa: Ignore undef sources +- vulkan: Always populate vk_render_pass_state::render_pass +- vulkan: Improve the docs for vk_subpass_info +- radv: Use both aspects for depth/stencil blit destinations +- vulkan: Dirty VP_VIEWPORTS/SCISSORS when copying viewports/scissors + +Jason Volk (4): + +- r600: Fix userspace pointer support for evergreen compute. +- r600: Improve compute memory pool performance; reduce fragmentation. +- r600: Elide downloads for discarded and immutable compute memories. +- radeon: Support shared memory user pointers. + +Jeffrey Knockel (1): + +- egl: dereference XCB drawable pointers + +Jesse Natalie (158): + +- d3d12: Don't block DISPLAY based on format for non-Windows +- wgl: Implement WGL_ARB_create_context_robustness +- d3d12: Hook up robustness extensions +- d3d12: Plug context leaks +- d3d12: Plug screen leaks +- d3d12: Attempt screen reset during context create +- d3d12: Add a unit test for context reset recovery +- microsoft/compiler: Unload DXIL validator library \*after* calling Release() +- d3d12: Correctly key off of polygon stipple enable cap +- mesa: Ensure ARB programs end in a newline +- bin: Add a script for filtering/sorting refcount logs +- u_debug_refcnt: Don't loop for initial refcounts if the initial value is huge +- u_debug_symbol: Use correct sizeof for DbgHelp +- u_debug_symbol: Allow re-init of DbgHelp to work around already-initialized issues +- u_debug_stack: Lock around stack dumps to prevent interleaving +- u_primconvert: Refactor to remove recursion from util_primconvert_draw_vbo +- u_primconvert: Handle take_index_buffer_ownership +- d3d12: Use a pipe_reference in d3d12_bo +- d3d12: Fix leaks in map with do-not-wait +- d3d12: Fix a couple over-releases from incorrect take-ownership flag +- d3d12: Destroy empty residency bo set during early-return +- d3d12: Include windows.h on Windows before dxgicommon.h +- d3d12: Don't use VLAs +- d3d12: MSVC warning around operator precedence causing uint32_t==bool +- dzn: Add new parameter to vk_image_view_init call +- d3d12: Fix forward decl for sw_winsys +- microsoft/compiler: Fixup sampler derefs in tex instrs that don't \*need* samplers +- nir_lower_task_shader: Fix return from lower_task_intrin (bool, not void*) +- d3d12: Re-add missed bitmasks to shader key comparison +- CI/d3d12: Add skips and update baselines +- CI/windows: Install Vulkan SDK from LunarG directly isntead of Chocolatey +- CI/windows: Delete comment for installing vulkan-runtime +- CI/windows: Disable chocolatey progress printing to shrink log size +- CI/windows: Disable LLVM CMake developer warnings to shrink log size +- CI/windows: Move --fraction for deqp-runner to run command line +- dzn: Update fails/flakes +- CI: Re-enable Windows builds +- nir: Consider PNTC to be a varying +- wgl, d3d12: Handle front buffer writes for double-buffered MSAA surfaces backed by swapchains +- meson: Use a C99 STDC_VERSION for flex +- d3d12: Report PIPE_CAP_ACCELERATED as 0 on WARP +- d3d12: Properly set HS input control point count +- dzn: Init sync types before wsi +- mesa: Use C11 alignas for cross-platform SSE support +- CI: Lima farm is offline +- dzn: Skip another test that times out occasionally +- util/disk_cache: Implement disk_cache_get_function_identifier for Windows +- microsoft/clc: Enable tests that pass on server 2022 +- microsoft/clc: Fix test double free in the case of compilation failure +- microsoft/clc: Remove dead image vars +- microsoft/clc: Add a unit test for unused image kernel args +- vulkan_entrypoints_gen: For Windows ARM64EC, prefix symbols with '#' +- dzn: Support Vulkan loader interface v5 +- gallium/util: Move u_dl and u_pointer to src/util +- dzn: Annotate sometimes-unused variables. +- dzn: Define COBJMACROS before any includes +- dzn: Use os_time_sleep instead of Win32 Sleep +- dzn: Use u_dl to load D3D12 +- dzn: Add an eventfd sync implementation +- dzn: Fix maybe-uninitialized warning +- dzn: Skip dxil validator and some Windows-only debug flags for Linux +- dzn: Use IUnknown instead of IDXGIAdapter1 as the stored adapter +- dzn: Don't hash adapter LUID as part of device UUID +- dzn: Use a custom adapter desc instead of DXGI adapter desc +- dzn: Move DXGI code to a separate file and only build it on Windows +- dzn: Add a DXCore enumeration path +- dzn: Add ABI helpers for D3D12 functions returning structs +- vulkan/wsi/win32: Use the new helpers and persistent map +- dzn: Fix winsys reporting +- dzn: Only support high/normal queue priorities +- dzn: Add for condition to break nested loop +- dzn: Re-order includes in dzn_nir to prevent TRUE redefinition +- dzn: Only add Werror flags that the compiler supports +- dzn: Disable Wunused-value for C++ files +- dzn: Fix comparison order for format enums +- dzn: Re-order dzn_physical_device_desc so = { 0 } doesn't initialize a struct +- dzn: Use fabs on floating point input +- CI: Add dzn to debian vulkan/clang/release builds +- mesa: Support D3D11/D3D12 memory imports +- mapi: Add more EXT_external_objects_win32 functions/enums +- gallium: Add a 'name' field to winsys_handle +- mesa: Implement ImportMemoryWin32NameEXT +- gallium: Add 'name' field to Win32 semaphore import +- driver_noop: Remove infinite recursion from create_fence_win32 +- gallium: Add a new fence type with a pipe cap to indicate it can be imported +- mesa: Support importing D3D12 fences as timeline semaphores +- mesa: Implement ImportSemaphoreWin32NameEXT +- gallium, mesa: Support setting timeline semaphore values +- d3d12: Store the rest of the device IDs in the screen +- d3d12: Compute UUIDs required by external objects extension +- d3d12: Add pipe getters for Win32 and base external objects device matching +- d3d12: Support B4G4R4A4 format +- d3d12: Get adapter LUID after device creation +- d3d12: PIPE_BIND_SHARED doesn't mean linear and is always on opened resources +- d3d12: Support creating memory objects +- d3d12: Implement resource_from_memobj +- d3d12: Support opening resources and memobj by name +- d3d12: Support importing fences / timeline semaphores +- d3d12: Implement server signal/wait +- d3d12: Implement fence opening and value setting +- d3d12: Fix up resource import validation +- docs: Update features.txt and new_features.txt +- u_atomic: Implement p_atomic_xchg for Windows +- u_atomic: Fix MSVC p_atomic_add_return +- u_atomic: Add p_atomic_fetch_add which returns the old value +- zink: Use p_atomic_fetch_add +- simple_mtx: Replace GCC sync intrinsics with u_atomic ops +- Fix static glapi on Windows +- dzn: Missed ABI fixes for GetCustomHeapProperties +- dzn: Remove the cast when the SDK version is high enough +- d3d12: Add helpers to build with correct ABI for MinGW +- microsoft/clc: Add helpers to build with correct ABI for MinGW +- subprojects: Point DirectX-Headers at a specific tag +- CI: Update DirectX-Headers dependency for MinGW/Debian +- util: Add a Win32 futex impl +- d3d12: Support clip halfz +- d3d12: Add a blank d3d12_resource_state.h/cpp +- d3d12: Move desired resource state to new files +- d3d12: Move current resource state to new files +- d3d12: Give bos a unique identifier to be used for state tracking +- d3d12: Add a list of contexts alive for the current screen +- d3d12: Hold lock when removing resources from residency list +- d3d12: Notify contexts about deletion of bos +- d3d12: Add a context state tracking structure +- d3d12: Treat depth/stencil as planar for plane count helper +- d3d12: Create/free context state entries +- d3d12: Track a global resource state for non-simultaneous-access resources +- d3d12: Record a state fixup command list when necessary +- d3d12: Swap the remainder of state tracking to new method +- d3d12: Rename bind invalidate options to transition flags +- d3d12: Extract core barrier logic +- d3d12: Optimize transition_subresource_states that covers a whole resource +- d3d12: Add a transition flag indicating that state accumulation is needed +- d3d12: Remove state tracking implementation details from header +- nir_lower_io_to_scalar: Support arrayed (per-vertex) I/O +- microsoft/compiler: Add a max shader model option +- microsoft/compiler: Add a max validator version +- microsoft/compiler: Emit metadata based on long-lived signature data +- microsoft/compiler: Split signature processing into two parts +- microsoft/compiler: Ensure 4-component position writes via NIR +- microsoft/compiler: Lower I/O to scalar +- microsoft/compiler: Support DXIL validator version 1.5 +- microsoft/compiler: Compute correct usage masks +- microsoft/compiler: Correctly compute dynamic indexing I/O masks +- microsoft/compiler: DXIL validator 1.6 counts UAVs differently +- microsoft/compiler: Set the barycentrics flag for attribute_at_vertex +- microsoft/compiler: DXIL validator 1.6 uses a new PSV struct version +- microsoft/compiler: DXIL validator 1.6 uses a new PSV resource struct +- microsoft/compiler: Support DXIL validator 1.6 +- microsoft/compiler: Support DXIL validator 1.7 +- microsoft/compiler: Add a getter for a validator version from DXIL.dll +- microsoft/compiler: Blacklist DXIL validator 1.6 from 20348 SDK +- d3d12: Retrieve validator version +- dzn: Retrieve validator version +- egl/wgl: Delete unused variables/code +- egl/wgl: Fix some awkward sizeof formatting +- microsoft/compiler: Discard shouldn't be marked readnone +- microsoft/compiler: Fix PSV struct when numthreads is 0 + +Jiang Feng (1): + +- virgl: Set res->maybe_busy to true when creating resources + +Jonathan Gray (2): + +- intel/dev: sync ADL-S pci ids with linux +- intel/dev: add RPL-S pci ids from drm-intel-next + +Jonathan Marek (2): + +- freedreno/registers: add a7xx registers for drm/msm kernel driver +- freedreno/registers: add missing varset="chip" for new enum values + +Jonathan Weinstein (1): + +- lavapipe: support instance rate zero for VK_EXT_vertex_attribute_divisor + +Jordan Justen (48): + +- intel/dev: Add device info for RPL-P +- vulkan/wsi/x11: Use atomic_int with sent_image_count +- drm-uapi/i915_drm.h: Update from drm-next (2022-04-28) +- intel/dev: Read hwconfig from i915 +- intel_dev_info: Add --hwconfig command line parameter +- iris: Fix assertion meant to only target the clear-color stride +- nir/divergence: handle more \*_intel intrinsics +- intel/dev: Add INTEL_PLATFORM_DG2_G12 +- intel/dev: Add DG2 G12 PCI IDs +- intel/dev: Enable first set of DG2 PCI IDs +- intel: Fix build of mi_builder_tests by including c99_compat.h +- intel: Build mi_builder_test whenever build-tests is set +- drm-uapi/i915_drm.h: Update from drm-next (2022-05-06) +- intel/gem: Add support for I915_ENGINE_CLASS_COMPUTE +- anv: Add support for I915_ENGINE_CLASS_COMPUTE in init_device_state() +- anv: Move STATE_BASE_ADDRESS programming into init_common_queue_state() +- anv, iris: Add support for I915_ENGINE_CLASS_COMPUTE +- anv, iris: Enable compute engine with INTEL_COMPUTE_CLASS=1 +- intel/dev: Add devinfo::mem to store i915 regions information +- anv: Use memory info from devinfo +- iris/bufmgr: Use memory info from devinfo +- intel/tools: Print memory info in intel_dev_info +- vulkan/wsi: Disable dma-buf sync file if ENOSYS is returned +- intel/compiler: Don't create vec4 reg-set for gen8+ +- intel/dev: Add vram.unmappable.size region info +- intel/tools: Print unmappable region info in intel_dev_info +- intel/dev: Add intel_vram_all_mappable() +- iris/resource: Assert that DG2 CCS buffers don't also try to set BO_ALLOC_SMEM +- iris/resource: Avoid mapping when not needed in iris_resource_init_aux_buf() +- iris/bufmgr: Add all_vram_mappable which is currently always true +- iris/bufmgr: Set mmap_mode to IRIS_MMAP_NONE for lmem in small-BAR mode +- iris: Set clear_color_unknown if the bo is not mappable +- anv/allocator: Add assert and TODO comment for future small BAR uapi +- iris/bufmgr: Add assert and TODO comment for future small BAR uapi +- drm-uapi/i915_drm.h: Update from drm-next (2022-07-22) +- iris/bufmgr: Add I915_GEM_CREATE_EXT_FLAG_NEEDS_CPU_ACCESS for vram mappable buffers +- intel/dev: Use i915 region probed_cpu_visible_size when non-zero +- intel/dev: Determine the amount of free vram using small BAR uapi +- intel/dev: Enable remaining DG2 and ATS-M device IDs +- anv: Fix PHYSICAL_DEVICE_MEMORY_BUDGET_PROPERTIES with large BAR +- intel/dev: Fill in system memory info when using INTEL_DEVID_OVERRIDE +- iris: Drop extra file-descriptor dup in iris_drm_screen_create() +- intel/pci_ids: Update ADL-S strings +- intel/pci_ids: Add 0x468b ADL-S PCI-id +- intel/pci_ids: Update ATS-M device names +- intel/pci_ids: Update DG2 device names +- intel/pci_ids: Add dg2 0x5698 pci-id +- intel/pci_ids: Drop non-upstream dg2 pci-ids + +Jose Maria Casanova Crespo (1): + +- v3dv/ci: increase fraction to 10 on v3dv ci jobs. + +Josh Billingsley (1): + +- driconf: add SD Gundam G Generation Cross Rays + +Joshua Ashton (1): + +- aco: Use movk for AddressHi bits in vertex prolog + +José Fonseca (6): + +- trace: Fix framebuffer state serialization. +- lavapipe: Prevent mapping buffers beyond their size. +- trace: Dump NIR. +- trace: Parse character data more efficiently. +- trace: Bring state dump up to speed. +- trace: Allow to control nir dumping via an environment variable. + +Juan A. Suarez Romero (37): + +- ci/v3dv: remove fixed test +- v3d: report the correct unsupported blit format +- v3d: do not tile 1D textures +- v3d: use surface format defined on pipe_blit +- gallium: add hook on getting canonical format +- v3d: define our own canonical supported formats +- v3d: enable GL_ARB_copy_image extension +- gallium: move get_canonical_format hook to pipe_screen +- v3dv: duplicate key on hashtable insert +- v3dv: store device_id on device init +- v3d/simulator: add support for AMD cards +- mesa: unref syncobj after wait_sync +- v3d: enable early Z/S clears +- v3d: disable early-Z on odd frame dimensions +- v3d: allow TFU blitting for single layer textures +- v3d: add hook to get on-disk shader cache +- v3d: add builtin support for white/black clamp-to-border +- v3dv: check BO mapping result +- docs: update extensions for V3D and VC4 +- ci: bump VK-GL-CTS to 1.3.2.0 +- v3d: fix blending for mixed RT formats +- v3d: expose GL_ARB_draw_buffers_blend +- v3d/ci: Add traces +- v3d: use function to initialize refcount +- v3d: save only required states in blitter +- v3d: release all color buffers on context destroy +- v3d: add ARB_polygon_offset_clamp extension support +- v3dv/ci: Update expected results +- v3d/ci: Add flake test +- v3d/v3dv/ci: update expected results +- v3d/ci: (partially) revert expected list +- v3d/ci: update expected list +- v3dv/ci: Update expected list +- v3d/vc4/ci: Fix typo in skipped list +- v3d/ci: enable ARB_texture_cube_map piglit tests +- vc4: properly restore vc4 debug option +- vc4: store tex sampler in proper register + +Kai Wasserbäch (1): + +- fix(gallivm): Replace LLVMConstF* with LLVMBuild* methods. + +Karmjit Mahil (26): + +- pvr: Fix clang-format errors caused by vk outarrays. +- pvr: Add PVR_WORKGROUP_DIMENSIONS define. +- pvr: Add initial implementation of vkCmdDispatch(). +- pvr: Add compute update shared kernel in vkCmdDispatch(). +- pvr: Add compute update kernel in vkCmdDispatch(). +- pvr: Update pvrsrvkm to fw 1.17 . +- pvr: Add csb enum helper funciton header. +- pvr: Write immutable descriptor words in vkAllocateDescriptorSets(). +- pvr: Fix incorrect samples to cr_isp_aa_mode conversion. +- pvr: Change compute hard coding infrastructure. +- pvr: Remove unused output_selects from vertex shader state. +- pvr: Add graphics pipeline hard coding infrastructure. +- pvr: Enable 33.15.11.3 (AXE-1-16M). +- pvr: Extend hard coding infra to allow per stage per device coding. +- pvr: Fix physical device limits. +- pvr: Handle vdm degen_cull_enable. +- pvr: Fix off by 1 error in buffer_id for ubo pds program. +- pvr: Rename loop iterator variable. +- pvr: Update pvrsrv build version for fixed size fw. +- pvr: Rename uniform pds program to descriptor pds program. +- pvr: Add static consts support in descriptor pds program. +- pvr: Add static consts to hard coding infra. +- pvr: Add initial vkCmdPipelineBarrier skeleton. +- pvr: Add IDF/WDF program for compute pipeline barrier. +- pvr: Add compute pipeline barrier handling. +- pvr: Fix cdm shared reg usage reported to fw. + +Karol Herbst (20): + +- nir: add 16 and 64 bit fisnormal lowering +- vtn: clamp SpvOpImageQuerySize dest to 32 bit +- nv50/nir: align tlsspace to 0x10 +- llvmpipe: PIPE_COMPUTE_CAP_GRID_DIMENSION is uint64_t +- llvmpipe/fence: make the fence id counter atomic +- iris/cs: take buffer offsets into account for CL +- nir: add a nir_remove_non_entrypoints helper +- ntt: make use of new samplers_used field +- clc: parse localSize and localSizeHint +- clc: drop parsingComplete check +- clc: speed up compilation by not relying on opencl-c.h +- clc: fix compiler features_macro CTS Test +- clc: disable opaque pointers until they are supported +- nv50/ir: recoginse Ampere +- nir/gather_info: allow to run it before inlining +- nir/gce: pin call instructions +- nouveau: use the contexts pushbuf and client where possible +- nouveau/mm: make code thread safe +- nv50/ir: fix OP_UNION resolving when used for vector values +- nv50: properly flush the TSC cache on 3D + +Ken Johnson (2): + +- d3d12: Reduce cost of shader key comparisons +- mesa/main: Correct alloca() use without #include "c99_alloca.h" + +Kenneth Graunke (41): + +- anv: Fix INTEL_DEBUG=bat on XeHP +- intel/decoder: Fix binding table pointer decoding with large offsets +- iris: Add FLUSH_HDC to PIPE_CONTROL_CACHE_FLUSH_BITS +- Revert "st/mesa: Transcode ASTC to BC7 (BPTC) where possible" +- intel: Drop Wa_1409226450 (stall before instruction cache invalidation) +- intel: Fix analysis invalidation in eliminate_find_live_channel +- intel/compiler: Move spill/fill tracking to the register allocator +- mesa: Split tx_compress_dxtn into per-format functions +- mesa: Avoid temp images in _mesa_texstore_rgb_dxt1 for GL_RGBA source +- mesa: Require temp images in _mesa_texstore_*_dxt* with SkipImages +- mesa: Avoid temp images in _mesa_texstore_*_dxt* for stride = 0 +- isl: Add an isl_format_supports_typed_atomics() helper. +- intel/compiler: Fix uncompaction of signed word immediates on Tigerlake +- intel: Set a more useful fake devinfo->gtt_size in no-hw mode +- intel/compiler: Fix A64 header construction with a uniform address +- iris: Update comment about 2GB dynamic state range +- iris: Use PIPE_* defines rather than ones from main/config.h +- crocus: Use PIPE_* defines rather than ones from main/config.h +- intel/compiler: Stop including src/mesa/main/config.h +- intel/compiler: Fix brw_gfx_ver_enum.h to be a proper header file +- intel/compiler: Split 3DPRIM_* defines out to a separate header. +- intel/tools: Stop malloc'ing device info in i965_disasm +- intel/compiler: Move opcode_desc handling to a separate header +- intel/compiler: Introduce a new brw_isa_info structure +- intel/compiler: Remove use of thread_local for opcode tables +- intel/compiler: Convert brw_eu.cpp back to brw_eu.c +- intel/compiler: Handle split-sends in EOT high-register pinning case +- intel/fs: Opportunistically split SEND message payloads +- intel/compiler: Avoid copy propagating large registers into EOT messages +- intel/compiler: Don't set SBID on EOT send messages +- mesa: Mark render-to-texture as unsafe if there's no pipe_resource +- intel/eu: Handle compaction when inserting validation errors +- intel/eu: Fix XeHP register region validation for hstride == 0 +- intel/eu: Clarify spec citations for XeHP region restrictions +- intel/eu: Mark header present in URB memory fences on XeHP +- intel/compiler: Lower FIND_[LAST\_]LIVE_CHANNEL in IR on Gfx8+ +- intel/compiler: Delete unused Gfx8+ code in brw_find_live_channel() +- iris: Fix PIPE_CAP_UMA +- iris: Use linear for exported resources if we can't convey tiling +- iris: Delete unused iris_screen::aperture_bytes field +- crocus: Fix memory leaks on iris_resource_create failure paths + +Konrad Dybcio (1): + +- freedreno: Enable A619 + +Konstantin Seurer (106): + +- radv: Fix ray queries with !15854 +- radv: Do not discard hits with t=tmax +- radv: Enable rt primitive culling for spirv2nir +- ci: Always install glslangValidator for build jobs +- radv: Use VAs in radv_meta_buffer.c +- radv: Make fill_buffer_shader non-static +- radv: Format radv_acceleration_structure.c +- radv: Add the fuchsia radix sort +- radv: Build accaleration structures using LBVH +- radv: Clean up the accel-struct build shaders +- radv: Fix lowering ignore_ray_intersection +- vulkan: Add a shader module clone helper +- radv: Copy shader modules to avoid use after free +- radv/radix_sort: Add missing entry points +- radv/radix_sort: Make variable names consistent +- radv: Use the entire morton code as sort key +- ac/nir: Do not set cursor in try_extract_additions +- radv: Move RADV_RT_STAGE_BITS to radv_private.h +- radv: Treat rt stages like compute stages +- radv: Use RADV_RT_STAGE_BITS more often +- nir: Add a ray launch size addr intrinsic +- radv: Allow indirect dispatches without bo +- radv: Indirect unaligned dispatches +- radv,aco: Use ray_launch_size_addr +- radv: Implement vkCmdTraceRaysIndirectKHR +- radv: Advertise indirect ray tracing +- radv: Allow radv_fill_buffer to work with VAs only +- radv: Use radv_fill_buffer for accel struct builds +- Revert "radv: Make fill_buffer_shader non-static" +- radv: Add and use radv_bvh_node_type +- radv: Replace nir_vector_extract with nir_channel +- radv/ci: Skip slow/crashing ray tracing tests +- radv: Enable ray queries by default +- radv: Add fail/flake lists for navi22 +- radv: Fix handling of primitiveOffset +- radv: Ignore transformOffset if transformData is 0 +- vulkan: Use the correct enum in vk_buffer +- radv: Use vk_buffer +- radv: Add and use radv_cp_dma_wait_for_stages +- radv: Some acceleration structure cleanups +- radv: Require an alignment of 64 for accel structs +- radv: Fix the accel struct size calculation +- vulkan: Spec update to 1.3.216 +- spirv: Add plumbing for ray_cull_mask +- radv: ray cull mask +- radv: Add accel struct queries for maintenance1 +- radv: ACCESS_2_SHADER_BINDING_TABLE_READ_BIT_KHR +- radv: PIPELINE_STAGE_2_ACCELERATION_STRUCTURE_COPY +- nir: Handle ray_launch_size_addr in opt_preamble +- radv: vkCmdTraceRaysIndirect2KHR +- radv: Advertise VK_KHR_ray_tracing_maintenance1 +- nir,spirv: Preserve inbounds access information +- nir/nir_lower_io: Optimize 32-bit inbounds access +- radv: Move some rt intrinsics to the top +- radv: Introduce an enum for the any hit status +- radv: Fix the t comparison for triangles +- radv: Refactor lower_rt_derefs +- nir: Add a nir_test_mask helper +- radv: Use nir_test_mask instead of i2b(iand) +- nir: Use nir_test_mask instead of i2b(iand) +- d3d12: Use nir_test_mask instead of i2b(iand) +- intel: Use nir_test_mask instead of i2b(iand) +- microsoft: Use nir_test_mask instead of i2b(iand) +- dozen: Use nir_test_mask instead of i2b(iand) +- radv: Use two bools for ahit_status +- radv: Skip pipeline_no_null_shaders_flag tests +- radv: Fix acceleration structure size queries +- radv: Use a global address for ray_launch_size +- radv: Use a global address for sbt_base +- radv: Only set rt stack size for dynamic stacks +- vulkan: Common vk_format_get_component_bits +- nir: Add a common gen_rect_vertices implementation +- radv: Use nir_gen_rect_vertices +- v3dv: Use nir_gen_rect_vertices +- radeonsi: Set vertex_id_zero_based +- aco: Remove dead nir_intrinsic_load_vertex_id case +- ac/llvm: Remove load_vertex_id handling +- util/format: Use an explicit length for the descs +- util/format: Assert that formats are valid +- radv: Remove format desc null checks +- radv: Use desc->format +- v3dv: Remove format desc null asserts +- turnip: Remove format desc null assert +- gallivm: Remove format desc null checks +- tgsi: Remove format desc null check +- util: Remove format desc null check +- agx: Remove format desc null check +- etnaviv: Remove format desc null check +- llvmpipe: Remove format desc null checks +- panfrost: Remove format desc null check +- r300: Remove format desc null check +- r600: Remove format desc null checks +- radeonsi: Remove format desc null checks +- softpipe: Remove format desc null check +- virgl: Remove format desc null checks +- pvr: Remove format desc null check +- util/format: Remove format desc null checks +- gallium/tests: Remove format desc null checks +- radv: Add meta saving/restoring for predicating +- radv: Use RADV_META_SUSPEND_PREDICATING +- radv: Simplify the meta init fail path +- radv: Fix out of bounds access in RTPSO creation +- vulkan: Fix descriptor set layout allocation scope +- vulkan: Handle descriptor set layout alloc fails +- radv: Use vk_descriptor_set_layout +- radv: Fix stack size calculation with stage ids + +Kristian Høgsberg (1): + +- util/u_debug: Use 'initialized' instead of 'first' + +Kyle Russell (1): + +- etnaviv: don't halve max_instructions for GC400 + +Leandro Ribeiro (1): + +- Revert "egl/wayland: deprecate drm_handle_format() and drm_handle_capabilities()" + +Lionel Landwerlin (120): + +- anv: use shadow surface for stencil input attachment on gfx7 +- intel: fix URB programming for GT1s +- intel/fs: require memory fence commit bit on Gfx9 +- anv: improve INTEL_DEBUG for submit +- anv/intel: add a new debug flag for stalling after every draw/dispatch +- anv: implement DEBUG_SYNC +- anv: allow getting the address of the beginning of the batch +- intel/fs: tidy up lower of ray queries +- ci/iris: mark another test as flaky +- anv: implement VK_EXT_primitives_generated_query +- nir/divergence: handle load_global_block_intel +- anv: fix INTEL_DEBUG=sync +- intel: fixup number of threads per EU on XeHP +- anv: remove unused enum +- anv: fix acceleration structure descriptor template writes +- anv: skip acceleration structure in binding table emission +- anv: reemit 3DSTATE_STREAMOUT after memcpy +- anv: fix clflush usage on utrace copy batch +- anv: limit clflush usage +- vulkan/runtime: 0-out pipeline cache object +- iris: use new kernel uAPI to compute video memory +- anv: fix invalid utrace memcpy l3 config on gfx < 11 +- anv: reset all dynamic state after secondary execution +- anv: add missing logic op set in pipeline dyn state +- Revert "anv: fix dynamic state emission" +- anv: rework sample location +- anv: never emit 3DSTATE_CPS in the pipeline batch +- anv: don't emit 3DSTATE_STREAMOUT in pipeline batch +- anv: don't emit 3DSTATE_WM in pipeline batch +- anv: don't emit 3DSTATE_BLEND_STATE_POINTERS in pipeline batch +- anv: don't emit 3DSTATE_DEPTH_BOUNDS in pipeline batch +- anv: don't emit 3DSTATE_VF_TOPOLOGY in pipeline batch +- anv: remove static_state_mask +- docs: explain state emission in Anv +- intel: fix stall debug option +- nir/cf: return cursor after insertion of cf_list +- nir/lower_shader_calls: don't use nop instructions as cursors +- nir/lower_shader_calls: don't insert code after break/continue +- nir/lower_shader_calls: put inserted instructions into a dummy block +- anv: fix primitives generated queries values +- intel/perf: add support for new opcodes in code generation +- intel/perf: store a copy of devinfo +- intel/perf: add support for dualsubslice count variable +- intel/perf: add support for GtSlice/GtSliceXDualsubsliceY variables +- intel/perf: stop overriding oa_format +- intel/perf: add OA A counter type +- intel/perf: add support new variable counting the number of EUs in slice0-3 +- intel/perf: add max vfuncs +- intel/perf: add new layout for Gfx12.5 products +- intel/perf: disable sseu setting on Gfx12.5+ +- intel/perf: deal with OA reports timestamp values on DG2 +- intel/ds: fix compilation +- u_trace/anv/iris: drop cs argument for recording traces +- intel/disasm: add missing handling of <1;1,0> +- util/debug: add an enable parsing helper +- util/perf: add options to enable/disable tracepoints +- anv: move internal RT shaders to the internal cache +- nir/divergence: handle load_scratch_base_ptr +- nir/divergence: handle load_ray_num_dss_rt_stacks_intel +- anv: don't expose EXT_border_color_swizzle on gfx7 +- intel/validator: validate dst/src types against devinfo support +- anv: fix index buffer emission +- vulkan/wsi: fix crash with debug names on swapchain +- intel/dev: add a helper to update memory info +- intel/dev: add a fallback when memory regions are not available +- anv: limit RT writes to number of color outputs +- anv: disable perf queries on non RCS engines +- anv: make apply_pipeline_layout/compute_push_layout visible to NIR debug +- anv: silence border color swizzle debug message +- clc: add new feature options for intel_clc +- intel/clc: enable fp16 & subgroups for GRL +- anv: disable non uniform indexing of UBOs +- anv: remove device features handling now in runtime +- anv: use the right helper to invalidate memory +- intel/fs: make sure memory writes have landed for thread dispatch +- intel/nir: temporarly disable opt_uniform_atomics for RT/CL +- anv/utrace: use a bo pool for utrace buffers +- intel/fs: ray query fix for global address +- nir/serialize: restore ray query variables +- isl: add new helper for format component compatibility +- anv: track if images can be fast clear with non-zero color +- anv: move gfx12+ CCS_E clear color restriction +- anv: deal with isl format swizzles for buffer views +- anv: validate image view lowered storage formats for storage +- anv: add a comment about handling buffer view swizzles on gfx7 +- intel: protect against empty invalidate ranges +- anv: skip flush/invalidate faster +- spirv: switch to uint64 for rayquery internal type +- anv: ensure tile flush before streamout writes +- intel/fs: Set NonPerspectiveBarycentricEnable when the interpolator needs it. +- anv: remove local computation of dynamic states +- anv: simplify dynamic buffer count in pipeline layout +- anv: break up anv_pipeline_compile_graphics() +- anv: split graphics nir loading +- anv: rename internal function for consistency +- anv: move CreateGraphicsPipelines to common code +- anv: move CreateComputePipelines to common code +- anv: move CreateRayTracingPipelines to common code +- anv: reorder & document fields of anv_graphics_pipeline +- anv: name non dynamic state fields correctly +- anv: fix primitive topology dynamic state emission on gfx7 +- anv: move restart index to gfx state +- anv: remove pipeline setting dynamic state +- intel/dev: deal with i915 unallocated_size on smem +- anv: allocate RT scratch in local memory +- anv: make use of the new smallbar uAPI +- anv: decode init batch with INTEL_DEBUG=bat +- intel/dev: move verification function to a header +- intel/tools/drm-shim: fixup eu_stride for topology +- intel/tools/stub: fixup parsing of --platform= +- intel/dev: add a test verifying that device override works +- intel/dev: remove INTEL_DEVID_OVERRIDE +- anv: fixup PIPE_CONTROL restriction on gfx8 +- anv: don't return incorrect error code for vkCreateDescriptorPool +- anv: fix GetPipelineExecutableStatistics for ray tracing pipelines +- anv: fixup assertions on lowered storage formats +- anv: fix assert in memory budget code when extension is not supported +- intel/fs: fix load_scratch intrinsic +- intel/fs: fixup scratch load/store handling on Gfx12.5+ +- intel/fs: fixup SEND validation check on overlapping src0/src1 + +Liviu Prodea (7): + +- dzn: Fix incompatible pointer type error affecting MSYS2 MINGW32 +- ci/MSVC: LLD is not really used anywhere +- ci/MinGW: LLD is not really used anywhere +- d3d12/dzn/spirv2dxil: Require version library +- Microsoft clc: strip lib prefix Otherwise OpenCLon12 ICD can't load it +- meson: Microsoft / maybe Intel CLC need the all-targets workaround just like clover +- meson: Only draw with llvm depends on native directly Tests, softpipe or AMD drivers don't depend on it directly + +Louis-Francis Ratté-Boulianne (4): + +- d3d12: Move d3d12_nir_lower_vs_vertex_conversion() to a common place +- microsoft/compiler: Add support for more scaled formats +- dzn: Allow some non-native formats to be used as vertex inputs +- dzn: Enable dEQP-VK.pipeline.vertex_input.* test cases + +Lucas Stach (37): + +- etnaviv: update headers from rnndb +- etnaviv: set VIVS_GL_VERTEX_ELEMENT_CONFIG depending on prim type +- etnaviv: don't supertile textures if supertiling is disabled via debug option +- etnaviv: fill all minor GPU features from the kernel +- etnaviv: update headers from rnndb +- etnaviv: correct bits per tile and clear value for HALTI5 +- etnaviv: use new PE pipe address states on >= HALTI0 +- etnaviv: use feature flag to determine which RS states to use +- etnaviv: use feature bit for one const src per instuction limitation +- etnaviv: clean up tiling setup in etna_compile_rs_state +- etnaviv: add support for big tile RS states +- etnaviv: properly set additional DEC400 compression states +- etnaviv: use feature bit to check for big tile support +- etnaviv: add tile size helper +- etnaviv: reinstate accidentially commented assert +- etnaviv: improve single buffer setup +- etnaviv: add linear PE support +- etnaviv: switch depth mode when linear PE is used +- etnaviv: expose real map buffer alignment +- etnaviv: allow mapped buffers during execution +- etnaviv: drm: always use hash to track BO index +- etnaviv: drm: don't cache mmap offset +- etnaviv: drm: make etna_bo_map thread safe +- etnaviv: drm: rename etna_drm_table_lock +- etnaviv: add function to get resource status +- etnaviv: rework resource status tracking (again) +- etnaviv: drop scanout assumption for resources allocated via modifier interface +- kmsro: add 'imx-lcdif' driver support +- etnaviv: move dummy BOs to screen +- etnaviv: add texture descriptor suballocator +- etnaviv: tex_desc: make error handling more consistent +- etnaviv: tex_desc: remove descriptor patch TODO comment +- etnaviv: fix use after free in async shader compile +- etnaviv: remove etna_align_up() function +- etnaviv: move checking for MC2.0 for TS into screen init +- etnaviv: use linear PE rendering only on properly aligned surfaces +- etnaviv: add debug option to disable linear PE feature + +M Henning (7): + +- nouveau/nir: Allow up to 6 nested joins +- nouveau: Emit cctl to flush L1 cache for atomics +- nouveau: Skip cctl for atomic counters in tgsi +- nouveau/nir: Split fewer 64-bit loads +- ttn: Make ttn_optimize_nir match gl_nir_opts +- nv50: Fix tlsSpace alignment +- nouveau: Drop C++03 compat code + +Manas Chaudhary (2): + +- panvk: Add PANVK_DEBUG_DUMP dump mappings +- panvk: Allow PAN_BO_INVISIBLE to be mapped with dump flag + +Manuel Stoeckl (4): + +- dri_interface: add XBGR16161616 format +- gbm: add GBM_FORMAT_[AX]BGR16161616 +- dri: fix incorrect fourcc for R16G16B16A16_UNORM format +- dri: add [ax]bgr16161616 to format lookup tables + +Marcin Ślusarz (40): + +- anv: invalidate all metadata in anv_nir_lower_ubo_loads +- intel/compiler: invalidate all metadata in brw_nir_lower_intersection_shader +- intel/compiler: invalidate metadata in brw_nir_initialize_mue +- compiler: add VARYING_SLOT_CULL_PRIMITIVE +- intel/compiler: handle VARYING_SLOT_CULL_PRIMITIVE in mesh +- intel/compiler: handle gl_Viewport and gl_Layer in FS URB setup +- anv: update task/mesh distribution with the recommended values +- intel/compiler: print name of the unhandled intrinsic +- anv: disable streamout before emitting mesh shading state +- intel/compiler: implement primitive shading rate for mesh +- anv: handle primitive shading rate for mesh +- intel/tools: add support for GEM_CREATE_EXT in intel_dump_gpu +- intel/tools: clean up mmio regs definitions +- intel/tools: drop wrappers around mmio regs macros +- intel/tools/aubinator: add support for 2 "new" subopcodes +- intel/tools/aubinator: list all platforms in help message +- intel/tools: add macros for gfx12+ variant of VCSUNIT0 +- nir: add load_mesh_inline_data_intel intrinsic to nir_divergence_analysis +- nir: add load_task_payload intrinsic to nir_divergence_analysis +- intel/tools: fix 32-bit build +- anv: remove redundant code calculating dynamic states mask +- anv: remove invalid copy/pasted comment +- anv: mask out not applicable state flags when setting up mesh pipeline +- intel/dev: fixup max_cs_workgroup_threads after hwconfig is read +- nir/opt_load_store_vectorize: handle task payloads +- intel/compiler: vectorize task payload loads/stores +- nir/lower_task_shader: insert barrier before/after shared memory read/write +- intel/compiler: assert that base is 0 for [load|store]_shared intrins +- intel: fix typos found by codespell +- intel/common: allocate space for at least one task urb +- intel/compiler: adjust task payload offsets as late as possible +- anv: disable injection of primitive shading rate for mesh +- intel/compiler: print shaders after nir_remove_unused_varyings +- nir: convert unused mesh outputs to shared memory +- intel/compiler: emit URB fences for TASK/MESH +- intel/compiler: there are 4 types of fences on gfx >= 12.5 +- intel/compiler: insert URB fence before task/mesh termination +- intel/compiler: use NIR_PASS more +- intel/compiler: fix mesh urb write regression +- anv: disable task redistribution + +Marek Olšák (311): + +- gallium: move drivers/radeon/* into drivers/radeonsi/ +- gallium: move radeon_winsys.h into gallium/include/winsys/ +- gallium/winsys: move {amdgpu,radeon_drm}_public.h contents into radeon_winsys.h +- gallium/ddebug: implement pipe_vertex_state callbacks +- nir: fix an uninitialized variable valgrind warning in nir_group_loads +- ac/surface: fix an addrlib race condition on gfx9 +- radeonsi: set more state bits for disabled binning +- radeonsi: don't compile monolithic PS for dual source blending +- radeonsi: add an SQTT workaround for chips with disabled RBs +- radeonsi: set PA_SC_EDGERULE according to bottom_edge_rule state +- radeonsi: initialize line stipple config registers +- radeonsi: fix ALPHA_IS_ON_MSB determination +- radeonsi: handle an out-of-bounds sampler/image index better +- radeonsi: don't treat skipped GS copy shader compilation as successful +- radeonsi: dump shaders after si_shader_binary_upload to fix printed LDS stat +- radeonsi: ignore AMD_DEBUG=nodcc if the texture is imported +- radeonsi: run nir_io_add_const_offset_to_base for TES/TCS as late optimizations +- radeonsi: scalarize IO instructions +- radeonsi: add pipe_stream_output_info into si_shader_context +- radeonsi: use nir_shader_get_entrypoint in si_nir_scan_shader +- radeonsi: apply key.ge.opt.kill_{outputs,pointsize,clipdistance} in NIR +- radeonsi: gather pipe_stream_output_info from NIR intrinsics +- radeonsi: move most "info" fields from si_shader_selector into si_shader_info +- radeonsi: move si_shader_info::stage into si_shader_selector +- ac: add ac_nir_optimize_outputs, a NIR version of ac_optimize_vs_outputs +- radeonsi: assign param export indices before compilation +- ac/llvm: remove LLVM pass ac_optimize_vs_outputs +- amd: document chips +- ac/surface/tests: generalize and extend gfx10 tests +- ac: add more non-shadowed registers to the lists +- ac/llvm: add AC_WAIT_EXP for ac_build_waitcnt +- ac/llvm: remove immoffset parameter from ac_build_tbuffer_store +- ac/llvm: remove immoffset parameter from ac_build_tbuffer_load +- ac/llvm: remove inst_offset parameter from ac_build_buffer_store_dword +- ac/llvm: use soffset correctly in ac_build_buffer_load +- ac/llvm: remove inst_offset parameter from ac_build_buffer_load +- ac/llvm: remove immoffset parameter from ac_build_tbuffer_load_byte/short +- ac/llvm: remove unused code from ac_llvm_build.c/h +- ac/llvm: don't use tbuffer_store as a fallback for swizzled stores +- ac/llvm: rename tbuffer -> buffer where tbuffer instructions are not generated +- ac,radeonsi: rework how scratch_waves is used and move it to ac_gpu_info.c +- ac/gpu_info: remove old and unused fields from radeon_info +- ac/llvm: remove unused LLVM helpers +- ac/llvm: don't create the target machine if the LLVM processor is unsupported +- radeonsi: remove an obsolete comment about LLVM 14 +- radeonsi: fail to create pipe_screen if LLVM doesn't support the GPU +- winsys/amdgpu: fix a mutex deadlock when we fail to create pipe_screen +- radeonsi/ci: add polaris11 baseline +- util: add pipe formats R10G10B10X2_SINT, B10G10R10X2_SINT, G16R16_SINT +- util: change util_format_snorm8_to_sint8 -> snorm_to_sint for radeonsi +- ac/gpu_info: rework how num_se is derived +- ac: remove really_needs_scratch, parameter from ac_parse_shader_binary_config +- ac/llvm: set the correct cache policy for sparse buffer loads +- ac/llvm: don't set ac_dlc in visit_intrinsic +- ac: support GR channel order in ac_choose_spi_color_formats +- ac,radeonsi: report SCALED formats as unsupported by samplers and color buffers +- radeonsi: always print the lower-case GPU name for radeonsi-run-tests.py +- radeonsi: cleanup local includes +- radeonsi: align the tessellation ring address to 2MB +- radeonsi: remove unused si_cs_shader_state::uses_scratch +- radeonsi: simplify setting COMPUTE_PGM_RSRC2.SCRATCH_EN +- radeonsi: code style fix +- radeonsi: add si_can_use_compute_blit +- radeonsi: don't report R64_*INT as a sampler format because it doesn't work +- radeonsi: don't allow broken compute image copies into A8R8_UNORM +- radeonsi: add color buffer support for 2_10_10_10 pipe formats +- radeonsi: add color buffer support for R9G9B9E5_FLOAT on gfx10.3 +- radeonsi: implement compute_copy_image between 1D_ARRAY and other texture types +- radeonsi: cosmetic changes in si_compute_copy_image +- radeonsi: assert format support in si_compute_copy_image +- radeonsi: don't expose formats that CB supports, but not shaders +- radeonsi: rename si_test_blit -> si_test_image_copy_region +- radeonsi/test_image_copy: remove whole image copies +- radeonsi/test_image_copy: remove linear and power-of-two size testing +- radeonsi/test_image_copy: remove special SDMA codepaths +- radeonsi/test_image_copy: simplify texture size generation +- radeonsi/test_image_copy: separate generating a random pipe_resource +- radeonsi/test_image_copy: separate printing pipe_resource into a function +- radeonsi/test_image_copy: test all interesting texture targets +- radeonsi/test_image_copy: test copying a block-compressed format +- radeonsi/test_image_copy: test mipmaps +- radeonsi/test_image_copy: test all formats +- radeonsi/ci: update sienna_cichlid results +- radeonsi: handle compressed formats in si_compute_copy_image +- radeonsi: improve the subsampled copy_image calculation +- radeonsi: remove compressed and subsampled gfx copy from resource_copy_region +- radeonsi: remove si_create_sampler_view_custom and related code +- radeonsi: remove si_create_surface_custom +- mesa: consider the sample count when choosing a texture format +- frontend/dri: allow swapped BGR->RGB channel order for MSAA color buffers +- amd/registers: hardcode GC base offsets in the json generator +- amd/registers: add gfx11 to the json generator +- amd/registers: add gfx11.json +- amd/registers: add gfx11-rsrc.json +- amd: enable gfx11 in header generator, fix drivers with renamed gfx6-10 defs +- amd: add gfx11 to packet definitions +- amd: import gfx11 addrlib +- amd: add Mesa-only addrlib changes +- ac: don't align VGPRs to 8 or 16 for gfx11 +- ac: implement ac_get_tbuffer_format for gfx11 +- ac: scratch buffer register changes for gfx11 +- ac: implement register shadowing for gfx11 +- ac/llvm: export mrt0 instead of null on gfx11 +- ac/llvm: don't set DLC on gfx11 because it means something else there +- ac/llvm: update pknorm and waitcnt for gfx11 +- ac/llvm: don't set GLC for stores on gfx11 +- ac/gpu_info: set cu_mask correctly for gfx11 +- ac/surface: gfx11 changes +- ac/surface: define gfx11 modifiers +- ac/surface: add gfx11 support to modifiers tests +- radeonsi/gfx11: increase the hw screen offset alignment +- radeonsi/gfx11: always allow DCC stores +- radeonsi/gfx11: expect packed threadID VGPRs +- radeonsi/gfx11: enable NGG-only draw paths +- radeonsi/gfx11: enable arbitrary DCC format reinterpretation +- radeonsi/gfx11: use the new TCS WaveID SGPR to compute vs_rel_patch_id +- radeonsi/gfx11: program inst_pref_size for graphics +- radeonsi/gfx11: buffer descriptor changes +- radeonsi/gfx11: add CB deltas +- ac,radeonsi/gfx11: remove FMASK loads +- ac,radeonsi/gfx11: set SWIZZLE_ENABLE correctly +- radeonsi/gfx11: don't set non-existent SPI_SHADER_USER_DATA_VS_x +- radeonsi/gfx11: don't set COMPR for exports, use 0x3 channel mask instead +- radeonsi/gfx11: implement attributes through memory +- radeonsi/gfx11: scattered register deltas +- radeonsi/gfx11: don't set non-existent CP_COHER_START_DELAY +- radeonsi/gfx11: emit SQ_NON_EVENT for tessellation at the end of IBs +- radeonsi/gfx11: don't use FLUSH_AND_INV_DB_META +- radeonsi/gfx11: TF_RING_SIZE changed to a per-SE size +- radeonsi/gfx11: limit MSAA color buffers to the RGBA channel order +- radeonsi/gfx11: update the initialization of SGPR0/1 registers for HS and GS +- radeonsi/gfx11: change LDS allocation granularity for PS +- radeonsi/gfx11: don't count the non-existent scratch_byte_offset SGPR +- radeonsi/gfx11: resolve MSAA using u_blitter +- radeonsi/gfx11: mark streamout as unimplemented for now +- radeonsi/gfx11: limit CP DMA to max 32KB sizes +- radeonsi: inline si_cp_dma_prefetch in si_draw_vbo for lower overhead +- radeonsi/gfx11: add a workaround for CB perf counters +- amd: add chip identification for gfx1100-1103 +- radeonsi: remove tautologies when setting CU_EN +- amd: rename ring_type --> amd_ip_type and match the kernel enum values +- amd: rename RING_* enums to AMD_IP_* +- amd: replace num_rings[type] with ip[type].num_queues +- ac/gpu_info: simplify HW IP querying +- ac/gpu_info: remove dead GDS query code +- ac/gpu_info: print all IP versions reported by the kernel +- ac/gpu_info: fix incorrect IP versions reported by the kernel +- radeonsi: report 10_10_10_2 scaled formats as unsupported on gfx6-9 as well +- radeonsi: report more missing/broken texture formats as unsupported on gfx6-9 +- ac/llvm: implement memory_barrier_tcs_patch properly to fix LLVM 15 +- radeonsi: wait before s_barrier in TCS epilog to fix LLVM 15 +- ac/llvm: inline emit_membar +- ac/llvm: replace LLVMBuildFence with ac_build_waitcnt(VSTORE) +- ac/llvm: add gl_shader_stage parameter into ac_build_s_barrier +- ac/llvm: move the gfx6 optimization for TCS barriers into ac_build_s_barrier +- ac/llvm: fix the remaining s_barriers for LLVM 15 +- radeonsi: add si_get_tcs_epilog_key +- ac/llvm: skip s_barrier if tess patches don't cross a wave boundary +- radeonsi: inline si_nir_build_llvm +- ac/llvm: remove ac_shader_abi::emit_outputs +- radeonsi: cleanups getting rid of the ambigous "epilogue" word +- radeonsi: get rid of the ambiguous "prologue" word +- radeonsi: try to group stage-specific code in si_llvm_translate_nir +- ac: replace 5 ac_shader_abi::load_* callbacks with 1 intrinsic_load callback +- amd: change chip_class naming to "enum amd_gfx_level gfx_level" +- radeonsi: rename RADEON_FLAG_UNCACHED -> RADEON_FLAG_GL2_BYPASS +- winsys/amdgpu-radeon: rework how we describe heaps +- winsys/amdgpu: handle RADEON_FLAG_ENCRYPTED as a heap modifier +- radeonsi: use PIPE_RESOURCE_FLAG_UNMAPPABLE and DRIVER_INTERNAL more +- winsys/amdgpu-radeon: clean up code related to heaps +- include: update amdgpu_drm.h +- radeonsi: use the new flag AMDGPU_GEM_CREATE_DISCARDABLE +- radeonsi: use AMDGPU_VM_PAGE_NOALLOC to disable MALL (infinity cache) +- radeonsi: fix a crash when failing to create a context +- radeonsi: print an error when failing to create a context +- radeonsi/gfx11: set BIG_PAGE for the attribute ring +- ac/gpu_info: silence a valgrind warning due to amdgpu_query_hw_ip_info +- radeonsi: remove GFX9_MERGED_NUM_USER_SGPR definition +- ac/gfx11: fix the scratch buffer +- radeonsi/gfx11: fix alpha-to-coverage with stencil or samplemask export +- radeonsi/gfx11: don't insert shader code for GS_PIPELINE_STATS_EMU +- radeonsi/gfx11: fix the value of VGT_GS_OUT_PRIM_TYPE at the beginning of IBs +- Revert "radeonsi/gfx11: limit MSAA color buffers to the RGBA channel order" +- radeonsi/gfx11: fix VM faults due to the attribute ring +- radeonsi/gfx11: optimize attribute stores +- radeonsi: merge all preamble states into one +- radeonsi: move CS preamble emission into the winsys +- winsys/amdgpu: initialize IB_PREAMBLE in advance +- radeonsi: fix resource_copy_region with ETC formats (e.g. for Stoney) +- Revert "frontend/dri: allow swapped BGR->RGB channel order for MSAA color buffers" +- Revert "mesa: consider the sample count when choosing a texture format" +- amd: rename fishes to Navi21, Navi22, Navi23, Navi24, and Rembrandt +- r300,r600,svga: save the FS constant buffer for u_blitter to fix a regression +- st/mesa: remove unused GENERIC input and output from the clear VS +- ac/gpu_info: always retile DCC on gfx10 and newer chips +- mesa: clamp UINT values greater than INT_MAX for glGetInteger +- mesa: change GL_MAX_COMPUTE_WORK_GROUP_COUNT to UINT +- mesa: change GL_MAX_UNIFORM_BLOCK_SIZE to UINT +- mesa: change GL_MAX_TEXTURE_BUFFER_SIZE to UINT +- mesa: change GL_MAX_SHADER_STORAGE_BLOCK_SIZE to UINT +- st/mesa: limit MaxComputeWorkGroupCount to INT_MAX - 1 due to failing tests +- st/mesa: round down MaxUniformBlockSize to a multiple of 4 for piglit +- gallium: rename PIPE_SHADER_CAP_MAX_CONST_BUFFER_SIZE to \*_BUFFER0_* +- gallium: add PIPE_CAP_MAX_CONSTANT_BUFFER_SIZE_UINT for UBOs +- gallium: rename PIPE_CAP_MAX_TEXTURE_BUFFER_SIZE->MAX_TEXEL_BUFFER_ELEMENTS_UINT +- gallium: rename PIPE_CAP_MAX_SHADER_BUFFER_SIZE -> \*_UINT +- radeonsi: increase the max compute LDS size to 64KB for gfx7+ +- radeonsi: report correct maximum compute grid sizes +- radeonsi: clamp against MAX_TEXEL_BUFFER_ELEMENTS correctly +- radeonsi: change max TBO/SSBO sizes again and rework max alloc size +- radeonsi: compute PIPE_CAP_MAX_TEXEL_BUFFER_ELEMENTS_UINT correctly +- radeonsi: set the max UBO size same as the max SSBO size +- winsys/amdgpu: use AMDGPU_IB_FLAG_PREAMBLE for the CS preamble on gfx10+ +- radeonsi: print LDS size in bytes +- radeonsi: don't use info.gs.invocations if it's not GS +- radeonsi: fix uninitialized wait_mem_scratch_tmz +- radeonsi: remove unused dword from wait_mem_scratch +- radeonsi: fix polygon stippling without color and Z outputs (v2) +- radeonsi: restructure PS no-export fixups +- radeonsi: determine DB_SHADER_CONTROL in si_shader_ps +- radeonsi: set INTERPOLATE_COMP_Z to 0 to work around an EQAA bug +- radeonsi: add BREAK_BATCH at the beginning of IBs +- radeonsi: simplify how pipeline statistic offsets are computed +- radeonsi: rework how VS_STATE_BITS are set for VS, TES, and GS +- radeonsi: rename and regroup VS_STATE definitions +- radeonsi: move GS_STATE bits to the end to make space at the beginning +- radeonsi: rework how vs_state_bits is set and unpacked +- radeonsi: fix an NGG streamout hang with monolithic shaders +- radeonsi: fix a crash in gfx10_sh_query_get_result_resource +- radeonsi: unconditionally enable the streamout overflow query with NGG +- radeonsi: inline gfx10_emit_streamout_begin/end +- radeonsi: fix NGG streamout hang by allocating GDS in the right place +- radeonsi: remove streamout code from shaders if no streamout buffers are bound +- winsys/amdgpu: add a kernel GDS management workaround retrying on -ENOMEM +- radeonsi: allocate GDS only once per process +- radeonsi: allocate only 1 GDS OA counter for gfx10 NGG streamout +- ac,radeonsi: don't export null from PS if it has no effect on gfx10+ +- ac/gpu_info: clamp gart_size_kb and vram_size_kb to fix buggy kernel driver +- gallium/u_threaded: fix buffer busyness tracking +- ac/llvm: fix tcs_wave_id unpacking on gfx11 +- amd/gfx11: add PixelWaitSync packet fields +- radeonsi/gfx11: compile monolithic PS if it writes memory +- radeonsi/gfx11: allocate more space for pipeline statistics +- radeonsi/gfx11: synchronize correctly before setting SPI_ATTRIBUTE_RING_* +- radeonsi/gfx11: don't use memory for waiting for cache flushes +- radeonsi/gfx11: don't allocate unused wait_mem_scratch +- radeonsi/gfx11: drop the ES vertex count requirement +- radeonsi/gfx11: add missing register shadowing code +- radeonsi/gfx11: rework GDS streamout code to single-lane and enable streamout +- amd: update addrlib - trivial changes +- ac/surface: expose all 64K_R_X and 256K_R_X modifiers on gfx11 +- radeonsi: fix random PS wave size +- util: remove our redefinition of assert +- Change all debug_assert calls to assert +- util: remove now unused debug_assert +- ttn: set dest_type for TXQ +- gallium: add pipe_blit_info::dst_sample, update u_blitter to write that sample +- ac/gpu_info: rename info fields to num_cu, memory_bus_width, memory_freq_mhz +- ac/gpu_info: remove tabs +- ac/gpu_info: rework and extend device info to be more informative +- ac/gpu_info: use hw_ip::ip_discovery_version to set IP versions +- ac/gpu_info: use the kernel-reported GFX IP version to set gfx_level +- ac/gpu_info: remove vram_size and gtt_size in favor of \*_kb variants +- ac/gpu_info: remove redundant uvd_decode +- ac/gpu_info: remove redundant jpeg_decode +- ac/gpu_info: remove redundant uvd_encode +- ac/gpu_info: remove redundant vce_encode +- ac/gpu_info: remove redundant vcn_encode +- amd: require amdgpu DRM 3.2.0 from April 2016 +- gallium/radeon: require radeon DRM 2.45.0 from April 2016 +- radeonsi: remove workarounds for radeon DRM < 2.45.0 +- ac/gpu_info: use drm_amdgpu_device_info instead of amdgpu_gpu_info +- ac/gpu_info: remove amdgpu_gpu_info parameter from ac_query_gpu_info +- ac/gpu_info: remove unused has_unaligned_shader_loads +- gallium/radeon: require radeon DRM 2.50.0 (kernel 4.12) from July 2017 +- ac/gpu_info: require amdgpu DRM 3.15.0 (kernel 4.12) from July 2017 +- util/u_threaded: fix a CPU storage crash due to an invalid codepath taken +- glthread: don't ignore glPushAttrib/glPopAttrib when tracking GL_CULL_FACE +- glthread: don't sync on IsEnabled(GL_DEPTH_TEST) by tracking it in glthread +- util/format: add util_format_rgbx_to_rgba helper +- amd/addrlib: fix 3D texture allocation failures on gfx11 +- ac/surface: don't forbid 256KB swizzle modes on smaller gfx11 chips +- radeonsi: unify VGT_TESS_DISTRIBUTION programming +- radeonsi: move SI_MAX_VRAM_MAP_SIZE to si_debug_options.h +- radeonsi: call pipe->blit instead of util_blitter_blit after MSAA resolving +- radeonsi: make si_launch_grid_internal static +- radeonsi: add common helper si_launch_grid_internal_images that is more robust +- radeonsi: remove compute-based DCC decompression because it's broken +- radeonsi: don't do image stores with RGBX, L, LA, I, and SRGB formats +- radeonsi: follow shader_info.float_controls_execution_mode (mostly) +- radeonsi: add need_fmask_expand parameter into si_decompress_subresource +- radeonsi: check for 16-bit hw support instead of relying on options.fp16 +- radeonsi: move compute-related code from si_blit.c to si_compute_blit.c +- radeonsi: fold async_copy into the preceding conditional in si_blit +- radeonsi: cosmetic changes around do_hardware_msaa_resolve +- radeonsi/gfx11: use correct VGT_TESS_DISTRIBUTION settings +- radeonsi/gfx11: set SAMPLE_MASK_TRACKER_WATERMARK = 15 and clean up +- radeonsi/gfx11: use better PRIM_GRP_SIZE_GFX11 setting +- radeonsi/gfx11: skip code in si_update_shaders that has no effect +- radeonsi/gfx11: rename si_calc_inst_pref_size -> si_get_shader_prefetch_size +- radeonsi/gfx11: enable shader prefetch except for initial chip revisions +- radeonsi/gfx11: use a better workaround for the export conflict bug +- radeonsi: make various blit functions non-static +- radeonsi: allow texture_map to upload only 1 sample for MSAA instead of all +- radeonsi: add a randomized blit test +- radeonsi: move the no-AA small prim precision cull constant into an SGPR +- radeonsi: move small prim precision computation out of si_emit_cull_state +- radeonsi: don't assume that TC_ACTION_ENA invalidates L1 cache on gfx9 +- glthread: unbind framebuffers in glDeleteFramebuffers +- glthread: call _mesa_glthread_DeleteBuffers unconditionally +- ac/surface: disallow 256KB swizzle modes on gfx11 APUs + +Marek Vasut (3): + +- dri: Check buffer height and avoid overflow +- etnaviv: Rename etna_emit_tex() args +- etnaviv: Implement TXD + +Mark Collins (1): + +- tu: Clamp priority in DRM submitqueue creation + +Mark Janes (11): + +- intel: parse intel_measure environment without side effects +- intel: relax assertion in INTEL_MEASURE +- anv: fix performance bug in INTEL_MEASURE +- intel: provide a callback to clean up after intel_measure_gather +- iris: provide a callback to INTEL_MEASURE to clean up snapshots +- util: define helpful macros for compiler diagnostic features +- iris: pad all structures used in a shader key +- iris: reorder to minimize padding +- intel/compiler: pad all data structures used by shader cache keys +- intel/compiler: reorder shader cache keys to minimize padding +- intel/dev: drop warning for unhandled hwconfig keys + +Martin Roukala (né Peres) (27): + +- ci/b2c: fix the generation of the IMAGE_UNDER_TEST variable +- ci/b2c: bump the kernel to 5.17.1 +- ci/b2c: update b2c to enable monitoring support +- radv/ci: add a Van Gogh dEQP runner +- ci: add ACO_DEBUG to the list of variables to pass down for testing +- ci/radv: remove amdgpu.gpu_recovery=0 from the kernel cmdline +- radv/ci: reboot the DUTs when hitting a ring timeout +- radv/ci: reduce the execution timeout +- radv/ci: re-enable fossils testing +- radv/ci: enable the experimental support for mesh shaders +- radv/ci: update the list of flakes +- radv/ci: allow polaris10 to retry more times +- radv/ci: bump the timeout of vangogh/renoir +- radv/ci: update the vkcts expectations for the CI machines +- radv/ci: remove an expected fail from all families +- radv/ci: try to guess failure expectations for Fiji/Raven/Navi14/Navi22 +- ci/valve: do not extract the install tarball on the runner +- ci/valve: uprev boot2container +- Revert "ci/valve: uprev boot2container" +- radv/ci: test vkd3d on kabini +- ci/radv: move a test from the fail list to the flake list +- ci/radv: add a missing flake +- ci/radv: enable vkcts testing on kabini +- ci/x86: update to llvm 13 +- radv/ci: test the llvm backend on navi21 +- radv/ci: allow specifying a warning pattern +- radv/ci: do not reboot on soft-recovered hangs, just warn + +Matt Coster (20): + +- pvr: csbgen: Formatting pass (PEP-8 plus other minor changes) +- pvr: csbgen: Add a missing @staticmethod decorator +- pvr: csbgen: Make some loops more pythonic +- pvr: csbgen: Cleanup imports +- pvr: csbgen: Remove "dim" functionality from Group +- pvr: csbgen: Remove unused function parameters +- pvr: csbgen: Add typing information +- pvr: csbgen: Isolate "main" function with __name__ check +- pvr: csbgen: Replace map() and filter() with list comprehension +- pvr: csbgen: Fix "local variable may be undefined" lints +- pvr: csbgen: Add \*_unpack() functions for all generated struct types +- pvr: debug: Implement PVR_DEBUG variable +- pvr: csbgen: Make all generated enums unambiguous +- pvr: Add a const qualifier to inline pvr_csb_*() helpers +- pvr: Remove pvr_cmd_struct() macro +- pvr: Make pvr_cmd_pack() macro clearly internal +- pvr: Split out unioned structs from struct pvr_sub_cmd +- pvr: Implicitly assert that the correct sub-command type is present +- pvr: Split pvr_dev_addr_t into a separate header +- pvr: Add helper macros for creating pvr_dev_addr_t instances + +Matt Turner (3): + +- mesa: Drop dead #include "sparc/sparc.h" +- freedreno/ir3: Move the texture array coord fixup to nir +- freedreno/ir3: Pass 16-bit sampler coordinates when possible. + +Matti Hamalainen (7): + +- gallium/tools: fixes to option handling +- gallium/tools: add option for ignoring junk calls in trace dumper +- gallium/tools: reimplement tracediff completely in Python +- pytracediff: add per-line difference highlighting for blocks +- pytracediff: make -M ("method only") option print arguments for differing calls +- pytracediff: change how 'junk' calls are handled +- pytracediff: implement pager ('less') invocation internally + +Max Kellermann (2): + +- gallium/u_threaded: add missing reference counts for draw_multi slots +- gallium/u_threaded: fix offset calculation for draw_multi slots + +Michael Olbrich (1): + +- crocus: export GEM handle with RDWR access rights + +Michael Skorokhodov (2): + +- glsl: Fix ir_quadop_vector validation +- anv: Update line range + +Michel Zou (7): + +- gallium: fix unused symbols warnings +- meson: call run_command with check=true +- vulkan/wsi: fix missing unistd include +- zink: fix pointer size conversion warning +- lavapipe: allow null handleTypes +- meson: drop dladdr check on win32 +- util/xmlconfig: fix -Wmaybe-uninitialized warning + +Mihai Preda (28): + +- gallium: add opaque pointers shim for LLVM < 8.0 +- gallium: use LLVM opaque pointers in draw_llvm.c +- gallium/llvmpipe: use LLVM opaque pointers in lp_bld_interp.c +- gallium/llvmpipe: use LLVM opaque pointers in lp_bld_depth.c +- gallivm: use LLVM opaque pointers in lp_bld_arit.c +- gallivm: use LLVM opaque pointers in lp_bld_assert.c +- gallivm: use LLVM opaque pointers in lp_bld_format_aos.c +- gallivm: use LLVM opaque pointers in lp_bld_conv.c +- gallivm: use LLVM opaque pointers in lp_bld_coro.c +- gallivm: use LLVM opaque pointers in lp_bld_printf.c +- gallivm: use LLVM opaque pointers in lp_bld_flow.c +- gallivm: use LLVM opaque pointers in lp_bld_intr.c +- gallivm: use LLVM opaque pointers in lp_bld_format_s3tc.c +- gallivm: use LLVM opaque pointers in lp_bld_gather.c +- gallivm: use LLVM opaque pointers in lp_bld_nir_soa.c +- gallivm: use LLVM opaque pointers in lp_bld_sample.c +- gallivm: use LLVM opaque pointers in lp_bld_sample_soa.c +- gallivm: use LLVM opaque pointers in lp_bld_struct.c +- gallivm: use LLVM opaque pointers in lp_bld_tgsi_soa.c +- gallivm: use LLVM opaque pointers in lp_bld_tgsi_aos.c +- gallivm: LLVM opaque pointers small changes +- gallium: refactor a channel loop in draw_llvm.c +- Revert "gallivm: use LLVM opaque pointers in lp_bld_tgsi_soa.c" +- gallium: LLVM-15 contexts use non-opaque pointers +- gallivm: fix a few llvm non-opaque pointers +- gallivm: refactor a bit the cache access in view of LLVM opaque pointers +- gallivm: add lp_build_struct_get() variants that take the LLVM type +- gallivm: push LLVM version guard into assert + +Mike Blumenkrantz (634): + +- nir/lower_tex: fix rect queries with lower_rect set +- mesa/st: set normalized coords for RECT samplers if rects are unsupported +- zink: unset PIPE_CAP_TEXRECT +- zink: radv baseline fix +- zink: nv ci update +- zink: clamp max shader images to 32 +- zink: remove tcs patch slot map +- zink: disable EXT_extended_dynamic_state2 on AMDPRO +- zink: add baseline for amdpro +- zink: reject resource creation if format features don't match attachment +- zink: use mixed zs renderpass for depth read/write +- zink: unconditionally set line width on rasterizer state change +- zink: support restart with PIPE_PRIM_LINES_ADJACENCY if ext is available +- zink: fix extended restart prim types without dynamic state2 +- zink: do not create fences at all if timeline semaphores are supported +- zink: require timeline semaphores +- zink: remove batch lock +- zink: rename zink_query::batch_id +- zink: remove driver-based max_fences throttling +- zink: hook up VK_KHR_create_renderpass2 +- zink: only use VK_DEPENDENCY_BY_REGION_BIT if sync2 is available +- zink: require renderpass2 +- zink: require KHR_imageless_framebuffer +- zink: delete all non-imageless framebuffer code +- zink: remove framebuffer indirection +- zink: rename imageless framebuffer functions +- zink: remove the worst part of basic framebuffer support +- zink: remove ZINK_NO_TIMELINES +- zink: make a kopper debug print into an error +- zink: rework zink_kopper_update() assert +- zink: always bind gfx pipeline at the top of draw +- zink: store min required stride values on the vertex state +- zink: selectively disable dynamic vertex stride +- nir/fold_16bit_sampler_conversions: add a mask for supported sampler dims +- lavapipe: run nir_fold_16bit_sampler_conversions +- zink: set VK_QUERY_RESULT_WAIT_BIT when copying to qbo +- zink: NV_linear_color_attachment +- zink: nv ci update +- zink: fix synchronization when drawing from streamout +- zink: fix xfb counter buffer barriers +- zink: remove xfb_barrier flag +- zink: handle device-local unsynchronized maps +- util/draw: fix map size of indirect buffer in util_draw_indirect_read +- util/draw: handle draw_count=0 when reading indirect parameters +- util/draw: fix indirect draw count readback +- gallium: add PIPE_CAP_MULTI_DRAW_INDIRECT_PARTIAL_STRIDE +- zink: unset PIPE_CAP_MULTI_DRAW_INDIRECT_PARTIAL_STRIDE +- zink: move the kopper present fence to the displaytarget object +- wgl: support GL 4.6 +- zink: make unsupported blit formats an error +- zink: make update_framebuffer_state() public +- zink: rework clear_depth_stencil hook +- zink: add a stencil blit fallback path +- zink: fix tcs control barriers for use without vk memory model +- zink: fix semantics mask for compute control barriers +- zink: hook up VK_KHR_vulkan_memory_model +- zink: use vulkan memory model shader semantics for tcs barriers +- zink: always use vk mem model for shaders if available +- zink: fix memory_barrier intrinsic +- zink: fix stride=0 for dynamic state vertex stride checking +- zink: add synchronization for buffer clears +- mesa/st: clamp GL_RENDERBUFFER to GL_TEXTURE_2D for sparse queries +- aux/trace: make get_sparse_texture_virtual_page_size useful +- glsl/nir: set new_style_shadow for sparse tex ops as necessary +- zink: fix group memory barrier emission +- vulkan: bump layer api versions to current vk header version +- vk/cache: unbreak vk_pipeline_cache_create with flags +- kopper: add DISPLAY_TARGET bind for depth buffer +- zink: fix/improve swapchain surface info updating +- zink: fix up swapchain depth buffer geometry during fb update +- drisw: remove dead code +- kopper: always fetch and store drawable info +- kopper: move drawable geometry updating up in function +- kopper: store whether screen has dmabuf support +- kopper: copy a bunch of code for texture_from_pixmap +- zink: add supported present modes to kopper displaytarget +- zink: add a util function for finding swapchain resource +- zink: flag swapchains when updating fails +- zink: handle dead swapchains in acquire +- zink: unset image layout when killing swapchain +- zink: rework swapchain object check for acquires +- zink: fix return for zink_kopper_acquire +- zink: handle swapchain acquire failures more directly +- zink: flag swapchain resources as swapchains +- zink/kopper: add a mechanism for checking swapchain status +- zink: radv ci updates +- ac: remove gallivm header include +- kopper: add some error logging for pixmap->texture failure +- kopper: check for modifiers to use modifier functionality +- zink/kopper: add a function for determining if running on software +- kopper: check whether zink is using sw +- kopper: rename a confusing variable +- kopper: implement texture-from-pixmap for software (non-dmabuf) +- zink: fix dmabuf plane returns +- zink: export fd info for all 2d images +- zink: ignore KMS handle types +- zink: remove drm_fd +- llvmpipe: add a ci flake +- llvmpipe: disable statistic queries during draw if queries are disabled +- llvmpipe: remove misleading query comment +- llvmpipe: disable compute statistics queries if queries are disabled +- wgl: always set alpha on kopper windows +- zink: hook up VK_EXT_primitives_generated_query +- zink: pass query object to get_num_queries() +- zink: pass query object to get_num_query_pools() +- zink: pass query object to get_num_results() +- zink: pass screen param to convert_query_type() +- zink: add and use a function to detected emulated primgen queries +- zink: add a flag to zink_query to trigger rasterizer discard workaround +- zink: use VK_EXT_primitives_generated_query when available +- wgl: don't auto-load zink before software drivers +- lavapipe: lower quad_broadcast intrinsics +- zink: add a ci flake +- kopper: invalidate drawables when resizing textures in place +- kopper: pass the current context to dri_flush +- util/blitter: fix sampler restore with 0 saved samplers +- radv: emit fewer framebuffer registers +- zink: restore conditional ordering for query begin/end +- nir/gather_info: flag fbfetch on subpass image loads +- llvmpipe: never infer early zs tests when fbfetch is active +- zink: split renderpasses for TextureBarrierNV() usage +- zink: scale depth bias by factor of 2 +- zink: pass sparse backing page offset to binding function +- zink: pass sparse bind bo offset through for texture binds +- zink: set all usage flags when querying sparse features +- zink: fix multisample conditional in sparse image query +- zink: bump number of image binds that can be batched to 50 +- zink: semi-handle 1D sparse texture rewrites for drivers that don't support them +- zink: decouple descriptor templates from layouts +- zink: flatten push descriptor template into normal template array +- zink: implement indirect buffer indexing +- zink: delete nir_lower_dynamic_bo_access +- lavapipe: enqueue pipeline destruction +- zink: fix null buffer/surface formats +- zink: add more image usage for null surfaces +- zink: use descriptor surfaces for notemplates ref updating +- lavapipe: handle null samplerviews +- gallivm: fix ssbo oob reads +- lavapipe: fix CmdPushDescriptorSetWithTemplateKHR with refcounting +- st/draw_feedback: set constant buffer stride +- gallivm/draw: fix oob ubo reads +- llvmpipe: always set ssbo data pointers for draw +- gallivm: fix oob txf swizzling +- lavapipe: set robustBufferAccess2 and robustImageAccess2 +- lavapipe: ci updates +- docs: EXT_robustness2 for lavapipe +- zink: ci updates +- kopper: remove unused zink include +- zink: remove unused headers/struct members/linkage +- zink: outdent some code +- zink: add a flag to indicate whether a resource is exportable +- zink: break out resource bind-adding into separate function for reuse +- zink: introduce copy contexts +- zink: create a copy context for the screen on init +- Revert "zink: export fd info for all 2d images" +- zink: always check for fb rebinds when starting renderpass +- zink: unset pipe_resource::next pointers when creating resource copies +- zink: add a bind flag to indicate a resource is exported as a dmabuf +- zink: fix conditional for modifier usage +- zink: force memory exports if dmabuf bind is specified +- zink: add a LINEAR drm modifier if rebinding to add dmabuf export +- zink: rebind resources for export as needed +- zink: create images with modifiers any time there is an import handle +- zink: fix up sparse texture sampling for shadow samplers +- zink: clamp out min_lod operands for explicit lod ops +- zink: fix sparse binding for arrayed textures +- zink: set sparse flag in cubemap lowering +- zink: remove misleading sparse comment +- zink: fix sparse texture depth calcs for arrayed textures +- zink: remove implicit query resets +- zink: remove refs from shader images +- zink: reuse local res pointer in set_shader_images +- zink: simplify dumb update flagging in set_shader_images +- zink: read shader image r/w usage from incoming data struct +- zink: copy incoming shader image struct after doing updates +- zink: stop leaking shader image surfaces +- zink: fix 3d shader image miplevel calc for depth +- zink: directly reuse surface ivci when rebinding +- gallium/noop: implement pipe_screen::create_fence_win32 +- aux/trace: screen::create_fence_win32 +- zink: add win32 guards for device extensions with win32 in name +- lavapipe: ignore depth/stencil states if attachment is not present +- lavapipe: fix pipeline handling for dynamic render zs state with pipeline library +- spirv: fix barrier scope assert +- zink: manually validate shaders in debug builds +- zink: fix init with MESA_SHADER_CACHE_DISABLE enabled +- zink: use copy context to eliminate dependency on EXT_calibrated_timestamps +- zink: move get_timestamp to screen +- zink: add tu baseline for a630 +- zink: add an anv-tgl ci job +- zink: never create a sw context unless sw is requested +- zink: only reject non-blittable resource creation if it has no other binds +- zink: add separate error message for push descriptor set alloc fail +- zink: add extra validation for resource creation +- zink: remove input attachment usage when pruning surface usage +- zink: add extended usage and retry if resource creation fails +- zink: fix surface/bufferview cache comparisons +- zink: force render target usage for sampler-only resources +- zink: clamp 1D_ARRAY sparse textures to 2D as needed +- zink: handle PIPE_BUFFER sparse texture queries +- zink: fix non-dynamic vertex stride update flagging +- zink: add c++ guards for zink_screen.h +- zink: add ZINK_DEBUG=sync +- zink: update radv baseline +- zink: rename a fb surface variables +- zink: always use zink_batch_rp to init renderpass +- zink: use zink_batch_no_rp to end renderpasses +- zink: move renderpass handling to zink_render_pass.c +- zink: use global image rebind counter for dmabuf export +- zink: remove fb rebind check during renderpass begin +- zink: use local variable for getting rt barrier info +- zink: break out sample location updating to separate function +- zink: break up swapchain renderpass update to util function +- zink: remove renderpass state flag for swapchain +- zink: pass rt attrib to zink_render_pass_attachment_get_barrier_info() +- zink: fix indentation +- zink: break out color rt attrib analysis into util function +- zink: break out zs rt attrib analysis into util function +- zink: track whether there's any transient attachments bound +- zink: lift some renderpass mechanics up a level in the api +- zink: split out scissor flagging from framebuffer updating +- zink: use dynamic rendering (most of the time) +- turnip: fix assert for max xfb outputs +- zink: only warn once for SRGB fb without KHR_swapchain_mutable_format +- zink: clamp dynamic render layerCount to 1 +- zink: create all descriptor ref templates at once +- zink: add screen-based indexing to descriptor sets +- zink: use descriptor indices in compiler +- zink: use descriptor set index indirection in program init +- zink: use descriptor set index indirection in cache state updates +- zink: flag descriptor sets as changed before updating hashes +- zink: add a "compact" descriptor mode +- zink: use XOR for descriptor hash accumulation +- zink: radv ci updates +- zink: use a zink_render_pass_pipeline_state bit for fbfetch +- zink: add renderpass bits for color/depth r/w +- zink: update radv piglit fails +- vtn: assert that vector shuffle indices are in-bounds +- vtn: assert that composite members have the same bit size as the result +- zink: require draw params ext/cap in ntv if drawid is used +- zink: emit spirv cap for subgroup vote ops +- zink: use correct scope for subgroup vote ops +- zink: fix subgroup vote detection +- zink: lower subgroup width to 1 for unsupported subgroup vote stages +- zink: remove first_frame stalling +- zink: drop wideLines requirement +- zink: drop largePoints requirement +- st/pbo_compute: do pbo readback directly to the buffer object if it exists +- st/pbo_compute: use different calc for non-3d compute buffer sizing +- st/pbo_compute: make compute download conditional in shader slightly more readable +- st/pbo_compute: fix z coords for compute pbos +- zink: only infer modifiers if winsys handle has a stride +- zink: drop requirement for 6 descriptorsets to use bindless +- zink: store the real (non-compact) binding usage for programs +- zink: fix change flagging for compact descriptor cache +- zink: flag all cached descriptors as needing update on program change +- zink: add a #define for descriptor compaction +- zink: make swapchain kill message more descriptive +- st/texture: use base teximage for compute blits +- st/texture: allow compute blits for (some) non-finalized textures +- lavapipe: fix dynamic patchControlPoints +- mesa: throw a log warning any time a fallback texture is used +- zink: fix radv piglit fails +- zink: force queries to start/end out of renderpass +- zink: tu ci updates +- zink: reorder pipeline dynamic state1 substruct +- zink: improve packing for pipeline dynamic state1 info +- zink: break out dynamic state1 pipeline info to struct +- zink: use memcmp for comparing dynamic state1 info +- zink: use dynamic cull mode +- zink: break out pipeline dynamic state2 into struct +- zink: use dynamic patch vertices if available +- zink: expand patch_vertices pipeline key bitsize +- zink: fix framebuffer attachment usage asserts for dummy attachments +- aux/trace: fix sampler view dumping +- aux/trace: implement pipe_screen::is_compute_copy_faster +- zink: disable compute pbos on turnip +- egl/wayland: manually swap backbuffer when using zink +- egl/wayland: skip buffer creation on zink +- kopper: use get_drawable_info path for non-x11 drawables +- ci: disable unit tests +- zink: fix dmabuf plane layout struct scoping +- zink: represent plane offsets using offset from plane 0 vs size of plane +- zink: rename a variable +- zink: handle aux plane imports +- Revert "zink: remove drm_fd" +- zink: add back kms handling +- mesa: improve relocation problem message +- mesa/st: bump param reservation to 20 +- format_utils: properly parenthesize macro params +- d3d12: skip time-elapsed piglit tests in ci +- virgl: add some ci flakes +- mesa/st: bump param reservation to 28 +- mesa: add statevar for atomic counter offsets +- mesa: conditionally set constants dirty for atomic counter binds +- st/glsl_to_nir: call st_set_prog_affected_state_flags() as late as possible +- mesa: handle atomic counter lowering for drivers with big ssbo offset aligns +- zink: remove ANV depth clip control workaround +- zink: disable EXT_primitives_generated_query on turnip +- zink: track invalidation for image resources +- zink: flag renderpass for change if image resource changes valid state +- zink: rename renderpass attrib value +- zink: split out a dynamic render ternary +- zink: hook up surface invalidation to LOAD_OP_DONT_CARE +- zink: invalidate blit dsts if fully covered +- zink: remove buffer valid range tracking from blit +- mesa/streaming-memcpy: add memcpy fallback +- streaming-memcpy: move to src/util/ and compile unconditionally +- mesa/st: use util_streaming_load_memcpy for compute pbo copying +- zink: break out of zs mixed layout update loop when work is done +- zink: only update layout when doing mixed zs attachment renderpass check +- aux/trace: add pipe_context::link_shader +- aux/tc: handle link_shader +- zink: simplify some compiler bo code +- zink: fix buffer descriptor hashing +- zink: always create descriptor templates +- zink: flag optimization pass after inlining uniforms +- zink: break out nir alu typing in ntv +- zink: run nir_lower_alu_to_scalar in optimizer loop for 64bit lowering +- zink: always scalarize pack/unpack alu ops +- zink: don't lower pack/unpack intrinsics +- zink: run more int64 passes during optimization loop if int64 isn't available +- zink: fix ntv partial stores +- zink: flag all shader i/o variables as !is_xfb +- zink: fix variable locations in manual xfb emission +- zink: if a variable is flagged as being xfb, treat it as manual xfb output +- zink: improve/expand manual xfb emission +- zink: implement nir_op_unpack_64_2x32 in ntv +- zink: rewrite 64bit shader i/o as 32bit +- zink: scalarize when rewriting explicit 64bit io +- zink: avoid adding more 64bit alus during bo rewriting +- zink: fix 32bit bo rewriting +- zink: update tu a630 baseline +- zink: add new flakes +- zink: fix feature requirement +- zink: add feature req for border colors +- zink: more requirement docs +- zink: rewrite atomic ssbo intrinsics as atomic derefs +- zink: run copy_prop_vars during optimization +- zink: warn on missing customBorderColorWithoutFormat +- zink: more lavapipe glcts fails +- zink: cap driver inlining using ssa allocation limit +- zink: use separate pointer for swapchain presents +- zink: keep a count of async presents occuring for a given swapchain +- zink: defer old swapchain destruction +- zink: add implicit sync workaround for non-mesa drivers +- zink: unset res->swapchain upon killing a swapchain +- zink: fix up KILL to a more sensible log message +- lavapipe: VK_EXT_border_color_swizzle +- anv: VK_EXT_border_color_swizzle +- zink: fix dynamic stride conditional in pipeline creation +- zink: fix generated tcs update path +- zink: use the patch_vertices value from the shader key, not the context +- zink: unset generated tcs pointer from tes on deletion +- zink: fix generated tcs deletion +- zink: only add necessary binds during dmabuf export +- zink: avoid uninit values in renderpass state +- zink: don't print VK_EXT_shader_atomic_float warning for nir_intrinsic_image_deref_atomic_exchange +- zink: hook up VK_EXT_non_seamless_cube_map +- zink: use VK_EXT_non_seamless_cube_map when available +- zink: rename nonseamless struct members to emulate_nonseamless +- zink: skip some emulated cube code if the extension is present +- zink: fix cubemap lowering for array samplers +- zink: fix cubemap lowering bit size +- zink: clamp dynamic renderArea to framebuffer size +- zink: check for pending clears to determine write status of zs attachments +- zink: add a function for getting the minimum framebuffer layers +- zink: clamp dynamic fb layers better +- zink: clamp renderpass layers better +- zink: anv ci updates +- zink: use uint64_t for batch_id +- zink: remove thread_queue +- zink: use a dynarray for semaphore wait flags +- zink: add batch api for adding a wait semaphore +- zink: add semaphore handling for sparse binds +- zink: detect sparse-only binding queue +- zink: always bind on the sparse queue +- zink: only enable vtn debugging if validation is active +- zink: tu ci baseline update +- zink: radv baseline updates +- drisw: probe kms if fd is passed +- kmsro: add error message on drm ioctl failure +- zink: rename a variable +- zink: unify actual map calls in buffer mapping +- zink: use only the extents for buffer unmap flushes +- zink: fix buffer transfer flushes with offsets +- zink: store and reuse memory heap flags for buffer placement +- zink: move BAR allocation demotion up the stack +- zink: be a little selective about BAR fallback memory type +- zink: hook up VK_EXT_border_color_swizzle +- zink: use VK_EXT_border_color_swizzle +- zink: print runtime warning on missing EXT_border_color_swizzle +- mesa/st: add PIPE_CAP_DITHERING +- zink: unset PIPE_CAP_DITHERING +- mesa/st: increment num_uniforms for hw accel select shaders +- zink: move draw-time barrier generation down a little +- zink: track vertex buffer bind counts on resources +- zink: rework buffer barrier generation +- zink: track image binds +- zink: add a #define for vk shader bits +- zink: collect gfx stages for all bindings during barrier generation +- zink: don't short-circuit gfx stage finding during barrier generation +- zink: relax zink_resource_buffer_needs_barrier checks +- zink: ci updates +- glx/drisw: store the flush extension to the screen +- glx/drisw: invalidate drawables upon binding context if flush extension exists +- zink: fix dual_src_blend driconf workaround +- nir/lower_point_size: apply point size clamping +- mesa: break out PointSizeIsOne setting to util function +- mesa: rename PointSizeIsOne -> PointSizeIsSet +- mesa: skip pointsize exports if pointsize attenuation is enabled +- mesa: enforce pointsize exports if pointsize is being clamped +- mesa: explicitly disallow multiple pointsize exports from generating +- zink: ci updates +- zink: add flag to indicate if a resource is a dmabuf +- zink: flag dmabufs for foreign queue transition on flush_resource call +- lavapipe: zero out blend info if blend isn't enabled +- lavapipe: copy more pNexts for pipeline creation +- lavapipe: fix renderpass info handling during pipeline creation +- lavapipe: add a pipeline library assert +- lavapipe: skip post-copy pNext checking during pipeline creation for composites +- zink: use the bigger of the variable type and interface type for bo sizing +- zink: track ssbo bind counts +- zink: track overall samplerview bind counts +- zink: fix image bind counting +- zink: track gfx/compute descriptor barrier info +- zink: use tracked barrier info for generated barriers +- zink: remove another zink/tu fail +- zink: fix-ish depth clipping without VK_EXT_depth_clip_enable +- zink: add a turnip driver workaround for EXT_depth_clip_enable +- zink: remove swizzle from fbfetch lowering +- lavapipe: always set point_tri_clip +- zink: fix kopper_acquire return value +- zink: track whether current swapchain has data +- zink: move to AoS for storing swapchain struct data +- zink: move 'acquired' flag onto swapchain struct +- zink: use better determination for handling swapchain acquire submits +- zink: remove broken kopper assert +- zink: remove unnecessary code in kopper readback acquire +- zink: improve no-op acquire detection +- zink: flag batch as having work during frontbuffer flush +- zink: ensure pending present flushes are handled during frontbuffer flush +- zink: delete zink_resource_object::acquire +- zink: no-op pixmap frontbuffer flush calls +- zink: acquire swapchain images on image map +- zink: use kopper_displaytarget type directly +- zink: delete zink_resource_object::dt_has_data +- zink: set PIPE_RESOURCE_FLAG_DONT_MAP_DIRECTLY for device-local resources +- lavapipe: break out part of shader optimization into function +- lavapipe: streamline xfb shader setup +- nir/types: fix glsl_matrix_type_is_row_major() assert +- zink: store context flags +- zink: don't increment screen->num_contexts for copy context +- zink: disable turnip traces temporarily +- lavapipe: don't remove xfb outputs +- zink: enforce viewport depth clamping +- zink: size uniform_0 ubo accurately +- zink: further improve bo sizing +- zink: force unsized array in bo creation using the last block member type +- zink: move/refactor bo_vars struct creation in compiler +- zink: handle bitsizes in get_bo_vars() analysis +- zink: don't run nir_lower_uniforms_to_ubo repeatedly +- zink: pass zink_shader to optimize_nir during final optimize pass +- zink: add an optimizer pass to enforce bo bounds +- zink: use std430 packing by default +- zink: use a std430 interface for bo types +- mesa: remove incomplete texture warning +- mesa/st: make get_sampler_view_format() public +- mesa/st: add PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_FREEDRENO +- zink: init driver workarounds earlier in screen creation +- zink: disable customBorderColorWithoutFormat on turnip +- zink: don't warn for missing customBorderColorWithoutFormat on turnip +- zink: support PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_FREEDRENO +- zink: flush pending clears for fb texture barriers +- mesa: fix SignalSemaphoreEXT behavior +- lavapipe: don't overwrite entire VkFormatProperties3 struct +- lavapipe: don't crash on null xfb buffer pointer +- zink: reject swizzled format blits +- lavapipe: support inlined shader spirv for compute +- zink: allow multiple tex components for depth tg4 +- zink: always use 32bit sample ops +- zink: add a compiler pass to match up tex op dest types +- zink: use sampler_bind_count to simplify some code +- zink: break out samplerview layout reset code +- zink: always update sampler descriptor layouts on fb surface unbind +- zink: always use 32bit floats for so output types +- zink: handle bare matrix types in xfb emission +- zink: split xfb block emission from array/matrix handling +- zink: fix xfb array inlining +- zink: use right glsl length getter for ntv partial stores +- zink: bitcast extracted streamout components to uint before creating uvec +- zink: add a compiler pass to split xfb block outputs +- zink: fix xfb emit check in compiler +- mesa: move is_wrap_gl_clamp() to samplerobj.h and deduplicate +- mesa: track which sampler wrap params use GL_CLAMP +- mesa: update GL_CLAMP emulation when binding/unbinding textures +- zink: remove bogus range tracking from texture clear +- zink: fix transient attachment rp assert +- zink: remove format check from clear texture +- zink: fix coverage check for texture clears +- zink: stop using u_blitter for texture clears +- zink: improve zink_clear_depth_stencil check for current attachment +- zink: massively simplify zink_clear_depth_stencil +- zink: remove non-renderpass clear path from zink_clear_texture +- zink: remove u_blitter usage from zink_clear_render_target +- zink: always use storeOp=STORE for depth renderpass +- zink: track a bitmask of fb attachments with mismatched layer counts +- zink: add explicit (awful) handling for fb layer mismatch clears +- zink: clamp color clear values based on format +- zink: lift clearing on fb state change up a level +- zink: remove out-of-renderpass clears +- zink: delete zink_fb_clear_util_unpack_clear_color +- zink: delete srgb tracking for clears +- zink: simplify zink_framebuffer_clear_data union +- zink: inject a 0,0,0,1 clear for RGBX formats +- zink: break out tc/trace context unwrapping +- zink: nv baseline update +- zink: tu a630 baseline update +- mesa/st: add implicit zeroing of clipdistance array +- zink: move descriptor_mode to be a global +- zink: delete ZINK_DESCRIPTOR_MODE_NOFALLBACK +- zink: add an explicit ZINK_DESCRIPTOR_MODE_CACHED +- zink: default to lazy/templated descriptors +- zink: handle max_vertices=0 in geometry shader +- zink: handle null samplerview in get_imageview_for_binding() +- zink: fix crashing on bo allocation failure +- zink: improve oom error message +- zink: call bind_last_vertex_stage() when binding vs +- zink: fix viewport count tracking +- zink: use gfx_barrier for synchronization when binding buffer descriptors +- vulkan/wsi: fix multiple acquires for sw without mit-shm +- llvmpipe: ignore multisample state for alpha_to_one +- zink: don't try to start a renderpass before CmdClearAttachments +- zink: don't flag renderpass change on depth write change as often +- zink: flag renderpass change if renderpass clears change +- zink: do dynamic render implicit clear updates on rp_changed +- zink: unflag ctx->rp_changed after dynamic render update +- zink: remove redundant renderpass update check for clears +- zink: add an assert to verify that deferred clears aren't added in renderpass +- zink: split out rp_changed to be more granular for dynamic render +- zink: break out renderpass attachment load op calc +- zink: break out zs renderpass layout calc +- zink: break out color rt layout calc +- zink: reuse rt layout functions to refactor zink_render_pass_attachment_get_barrier_info() +- zink: add a bits union for zink_render_pass_state comparisons +- zink: improve granularity of renderpass switching +- zink: allow no-op renderpass updates in zink_batch_rp() +- zink: move layout-setting for dynamic render attachments after no-op case +- zink: handle null samplerview/image rebinds more gracefully +- zink: scale number of shader cache-get threads +- zink: invoke descriptor_program_deinit for programs on context destroy +- vulkan/wsi: return VK_SUBOPTIMAL_KHR for sw/x11 on window resize +- zink: add env var to abort on device-lost if no reset callback is set +- util/vbuf: handle multidraws +- anv: disable dev.i915.perf_stream_paranoid=0 warning +- lavapipe: propagate xfb info for pipeline library rasterization stages +- zink: remove deqp fails for lavapipe +- zink: only flag fbfetch as rp update if fbfetch wasn't already configured +- zink: assert !rp_changed after renderpass has started +- zink: avoid crashing when bindless texture updates aren't used +- lavapipe: EXT_non_seamless_cube_map +- lavapipe: accurately set image/ssbo access based on shader usage +- lavapipe: implement extreme uniform inlining +- lavapipe: remove some skips +- zink: add a ci flake +- lavapipe: move vertex input dynamic states into vertex input block +- lavapipe: make binding/layout helper functions static inline +- lavapipe: use correct offset calculation for flagging ssbo/image write access +- lavapipe: use 64bit masks for shader access +- lavapipe: set image access on pipeline bind, not descriptor bind +- llvmpipe: remove weird clamping for format support with sample counts +- llvmpipe: allow resource_copy_region to perform replicate operations +- lavapipe: fix stencil clears +- llvmpipe: add a shortcut for blitter resolves +- vulkan: hook up VK_EXT_multisampled_render_to_single_sampled +- lavapipe: VK_EXT_multisampled_render_to_single_sampled +- lavapipe: advertise VK_EXT_multisampled_render_to_single_sampled +- util/blitter: respect PIPE_TEXTURE_RECT +- lavapipe: fix shader access stage conversion from mesa -> pipe +- zink: stop ORing unordered barrier access +- zink: convert ZINK_DEBUG to enum +- zink: add doc for ZINK_DEBUG=compact +- zink: add ZINK_DEBUG=noreorder +- zink: zink_resource::unordered_barrier -> unordered_exec +- zink: explicitly unset unordered_exec when resource can't be unordered +- zink: rework unordered_exec handling +- zink: add a second resource param to get_cmdbuf() for unordered promotion +- zink: promote image/buffer copy operations to unordered +- zink: move buffer/image functions down in file +- zink: expand unordered_exec +- zink: make get_cmdbuf() public +- zink: enable unordered blit/resolve ops +- virgl: more ci flakes +- d3dadapter: fix sw pipe loading +- nine: set view_mask=0 +- zink: fix gfx program cache pruning with generated tcs +- zink: don't flag lazy push constant set dirty on batch change +- zink: combine loops for lazy descriptor program deinit +- zink: handle mutable swapchain images with dmabuf +- zink: store VkFormatFeatureFlags on creation +- zink: use modifier feature flags during surface creation when necessary +- zink: init cache_put program fence on program creation +- zink: force flush clears on fb change if fb geometry changes +- zink: force a new framebuffer for clear_depth_stencil if the clear region is big +- zink: drop mode_changed check from linewidth/depthbias draw updates +- zink: handle unscaled depth bias from nine +- zink: handle !half_pixel_center +- zink: fix return for PIPE_CAP_DEPTH_CLIP_DISABLE +- nine: check return on resource_get_handle +- zink: demote dmabuf tiling to linear if modifiers aren't supported +- zink: use modifier_aspect to check for modifier plane in zink_resource_get_param +- zink: don't add modifiers if EXT_image_drm_format_modifier isn't present +- zink: add all format modifiers when adding for dmabuf export +- zink: don't fixup sparse texops +- vulkan: Update the XML and headers to 1.3.224 +- radv: fix return type for meta resolve shaders +- nir/validate: clamp unsized tex dests to 32bit +- mesa: fix blending when using luminance/intensity emulation +- mesa: require render target bind for A/L/I in format selection +- vk/render_pass: don't deref null resolve attachments +- zink: handle nir_intrinsic_sparse_residency_code_and mechanics +- zink: ignore nir_texop_lod for tex dest matching +- tu: fix invalid free on alloc failure +- zink: don't call util_queue_fence_init in zink_screen_get_pipeline_cache() +- zink: don't emit entrypoints for function temp variables +- zink: check the variable mode before taking samplemask path in ntv +- zink: explicitly use unsigned types for bit shifts +- zink: only add srgb mutable for images with modifiers +- zink: don't emit illegal interpolation +- llvmpipe: don't assume pipe_context is always available in flush_frontbuffer +- zink: flag all assigned output slots as mapped +- zink: handle split acquire/present +- st_pbo/compute: fix 1D_ARRAY offsets +- st_pbo/compute: fix 1D coord dimension by pre-trimming vectors + +Mike Lothian (1): + +- util: Fix invalid usage of alignas in u_cpu_detect.c + +Mykhailo Skorokhodov (3): + +- anv: Use sampleLocationsEnable for sample locations +- iris: Move Wa_1806527549 and enable by default +- anv: Move Wa_1806527549 and enable by default + +Nagappa Koppad, Basanagouda (1): + +- iris:Duplicate DRM fd internally instead of reuse. + +Nanley Chery (11): + +- iris: Return a 64B stride for clear color plane +- intel/isl: Add a score for I915_FORMAT_MOD_4_TILED +- isl,iris: Add DG2 CCS modifier support for XeHP +- mesa: Simplify UNORM8 conversions for sRGB ASTC textures +- mesa: Follow ASTC decode mode extension for RGBA8 output +- iris: Don't leak surface states for compressed resources +- iris: Don't leak compressed resources in iris_create_surface +- iris: Use fill_surface_states for compressed resources +- anv: Make the D16 reg mode single-sampled +- iris: Make the D16 reg mode single-sampled +- iris: Dedent enum iris_depth_reg_mode + +Nicolas Caramelli (1): + +- egl: Fix EGL_EXT_platform_xcb name string to match the registry + +Panagiotis Apostolou (1): + +- util: Don't block SIGSEGV for new threads + +Patrick Lerda (1): + +- panfrost: Fix unwanted valgrind message related to restart_index + +Paulo Zanoni (4): + +- iris: fix race condition during busy tracking +- intel/compiler: compute int64_options based on devinfo->has_64bit_int +- intel/compiler: split handling of 64 bit floats and ints +- intel/compiler: call ordered_unit() only once at update_inst_scoreboard() + +Pavel Asyutchenko (7): + +- gallium: add PIPE_CAP_FBFETCH_ZS and expose extension +- glsl: add language support for GL_ARM_shader_framebuffer_fetch_depth_stencil +- nir_to_tgsi: Don't count ZS fbfetch vars as outputs +- llvmpipe: enable per-sample shading when FB fetch is used +- llvmpipe: simplify early/late zs tests selection +- llvmpipe: implement FB fetch for depth/stencil +- llvmpipe: enable PIPE_CAP_FBFETCH_ZS + +Pavel Ondračka (13): + +- r300: set PIPE_BIND_CONSTANT_BUFFER for const_uploader +- r300: skip draws instead of using a dummy vertex shader +- r300: fix vertex shader control flow in loops +- r300: deduplicate common NIR options +- r300: be less agresive with copy propagate in loops +- r300: don't try to use inline constants instead of constant swizzles +- r300: Update list of RV515 dEQP failures and add some flakes +- r300: merge simple movs with constant swizzles together +- r300: expose PIPE_CAP_LEGACY_MATH_RULES +- r300: only run merge_movs pass on R500 +- r300: Keep rc_rename_regs() from overflowing RC_REGISTER_MAX_INDEX +- r300: disable PIPE_CAP_PREFER_IMM_ARRAYS_AS_CONSTBUF +- r300: fix variables detection for paired ALU and TEX instructions in different branches + +Pierre-Eric Pelloux-Prayer (81): + +- radeonsi: make si_pc_emit_shaders non static +- radeonsi: add helpers to deal with spm counters +- radeonsi: add spm counters setup code +- radeonsi/sqtt: setup spm bo +- radeonsi/sqtt: enable spm counters on gfx10+ +- radeonsi: make sure profile_peak is used before using sqtt +- ac/spm: setup write broadcasting correctly +- ac/surface: adjust gfx9.pitch[*] based on surf->blk_w +- radeonsi: remove commutative_blend_add option +- radeonsi: drop assume_no_z_fights option +- drirc: remove i965 entry +- gallium: add PIPE_CAP_ALLOW_DRAW_OUT_OF_ORDER +- radeonsi: enable PIPE_CAP_ALLOW_DRAW_OUT_OF_ORDER +- radeonsi: fix VS kill_outputs handling +- radeonsi: don't always strip NIR debug infos +- radeonsi: generate unique shader name in si_get_nir_shader +- amd: update gfx10_format_table.py for gfx11 +- radeonsi/gfx11: add assert in legacy vs path +- radeonsi/gfx11: register changes +- radeonsi/gfx11: image descriptor changes +- radeonsi/tests: allow empty line and comments in csv files +- radeonsi/tests: add a heuristic to pick the baseline +- radeonsi/tests: reformat with black +- u_threaded: clear non-async debug callback correctly +- radeonsi: flush VGT streamout like PAL +- radeonsi: implement pipeline stats workaround +- gallium: add a union to access queries counters +- gallium: always return the correct pipeline stats index +- gallium: deduplicate target -> stats index code +- radeonsi: deduplicate query offsets +- radeonsi: store the pipeline stats index +- radeonsi: implement GL_GEOMETRY_SHADER_PRIMITIVES_EMITTED_ARB in shaders +- radeonsi/test: update sienna_cichlid results +- radeonsi/query: use the qbo correct size +- radeonsi/ngg: reuse the pipeline stats buffer when using atomics +- radeonsi: fix gs_invocation query with NGG +- radeonsi: don't use wave32 for GE on gfx10 if culling is used +- amd: fix ac_build_mbcnt_add in wave32 mode +- util/u_queue: rework UTIL_QUEUE_INIT_SCALE_THREADS to scale faster +- radeonsi: remove unused params from si_shader_select_with_key +- radeonsi: scale the number of shader compiler threads +- radeonsi: store shader variants in an array +- glx: set max values for pbuffer width / height +- radeonsi: fix glTexBuffer max size handling +- radeonsi: wait for PS idle in si_set_framebuffer_state +- radeonsi/tests: add a --slow option +- radeonsi/tests: use a smaller tests-per-group value +- radeonsi: don't use sel->nir in si_check_blend_dst_sampler_noop +- ac: export LLVM c++ ResetCommandLineParser method +- ac: reset LLVM command line parser +- ac/llvm: add use_waterfall_for_divergent_tex_samplers option +- radeonsi: enable use_waterfall_for_divergent_tex_samplers +- st/mesa: use mutex in st_get_texture_sampler_view_from_stobj +- gallium: update winsys_handle documentation +- gallium: add size attribute to winsys_handle +- va/surface: set the correct size in vaExportSurfaceHandle +- radeonsi: set size in si_texture_get_handle +- tradeonsi: fix preamble state producing incorrect packets +- radeonsi: prevent recursion in si_decompress_dcc +- radeonsi: add helper to use si_screen::aux_context +- radeonsi: use helpers to access si_screen::aux_context +- Revert "winsys/amdgpu: use AMDGPU_IB_FLAG_PREAMBLE for the CS preamble on gfx10+" +- dri: store internal_format when known +- mesa/st: don't guess the internal format if it's known +- util/blob: use memcpy in read functions +- mesa: skip draws with invalid indices offset +- vulkan/wsi: define pWaitDstStageMask in the blit submission +- ac: use LLVMContextSetOpaquePointers if available +- radeonsi: use LLVMBuildLoad2 where possible +- ac: add per output is_16bit flag to ac_shader_abi +- ac/llvm: handle opaque pointers in visit_store_output +- ac/llvm: use LLVMBuildLoad2 in visit_load +- radeonsi: use LLVMBuildLoad2 in llvm PS +- radeonsi: use LLVMBuildLoad2 for inter-stage outputs loads +- radeonsi: check last_dirty_buf_counter and dirty_tex_counter +- amdgpu/bo: update uses_secure_bos when importing buffers +- nir: add a nir_opt_if_options enum +- radeonsi: prevent u_blitter recursion in si_update_ps_colorbuf0_slot +- radeonsi: use nir_opt_large_constants earlier +- mesa: avoid reading back textures from VRAM +- radeonsi: invalidate L2 when using dcc stores + +Qiang Yu (63): + +- radeonsi: lower nir_intrinsic_sparse_residency_code_and +- mesa/program: fix nir output reg overflow +- mesa/vbo: remove unused vbo_context->binding +- nir/builder: add load/store array variable helper functions +- mesa: add hardware accelerated select constant +- mesa: add _mesa_bufferobj_get_subdata +- mesa: refine name stack code to prepare for hw select +- mesa: add hw select name stack code path +- mesa/vbo: enclose none-vertex functions with HW_SELECT_MODE +- mapi: add api setup header for hw select mode +- mesa: set CurrentServerDispatch too when glBegin/End +- mesa: add HWSelectModeBeginEnd dispatch table +- mesa: pass select result buffer offset as attribute/varying +- mesa/st: implement hardware accelerated GL_SELECT +- zink: reset PIPE_CAP_ACCELERATED when cpu soft rendering +- virgl: return -1 for PIPE_CAP_ACCELERATED +- mesa: enable HardwareAcceleratedSelect +- ac/llvm: get back nir_intrinsic_load_tess_rel_patch_id_amd +- nir: add nir_intrinsic_load_lshs_vertex_stride_amd +- ac/nir: use nir_intrinsic_load_lshs_vertex_stride_amd +- radeonsi: implement load_lshs_vertex_stride abi +- radeonsi: add tcs_vgpr_only_inputs parameter to si_get_nir_shader +- ac/nir: skip gl_Layer/gl_ViewportIndex write for LS +- radeonsi: replace llvm ls/hs interface lds ops with nir lowered ones +- gallium: disable hardware select for crocus +- ac/llvm: conditionally check wave id in gs sendmsg +- radv: no need to do gs_alloc_req for newer chips in ngg vs/tes +- radeonsi: deserialize nir binary in si_check_blend_dst_sampler_noop +- radeonsi: use si_shader as parameter in si_get_nir_shader +- radeonsi: support multi stage shader state creation in nir shaderlib +- radeonsi: add si_create_passthrough_tcs +- radeonsi: replace llvm based fixed tcs with nir +- radeonsi: implement nir_intrinsic_load_tcs_num_patches_amd +- ac/nir: remove unused parameter in tes input lower +- ac/nir: add nir_intrinsic_load_hs_out_patch_data_offset_amd +- ac/nir: use nir_intrinsic_load_hs_out_patch_data_offset_amd in tess lower +- ac/nir: add pass_tessfactors_by_reg param to hs output lower +- ac/nir: add wave_size parameter to ac_nir_lower_hs_outputs_to_mem +- radeonsi: preload tess offchip ring for tcs +- radeonsi: implement nir_intrinsic_load_ring_tess_offchip(_offset)_amd +- radeonsi: implement nir_intrinsic_load_tess_rel_patch_id_amd for both tcs and tes +- ac/llvm: handle write mask for nir_intrinsic_store_buffer_amd +- ac/nir: add no_input_lds_space param to hs output lower +- radeonsi: replace llvm tcs output with nir lower pass +- radeonsi: enable PIPE_CAP_GLSL_TESS_LEVELS_AS_INPUTS +- radeonsi: set uses_vmem_load_other for nir_intrinsic_load_buffer_amd +- radeonsi: replace llvm tes input load with nir lowering +- ac/llvm: fix code format alignment in visit_load_local_invocation_index +- radeonsi: implement two esgs ring nir intrinsic +- ac/nir: skip gl_ViewportIndex and gl_Layer write in ES +- radeonsi: set lds for gs/es to handle nir shared memory load/store +- ac/nir: remove unused param num_reserved_es_outputs from gs input lower +- ac/nir: change es output lower param to esgs_itemsize +- radeonsi: replace llvm es output with nir lowering +- ac/nir: add triangle_strip_adjacency_fix to gs input lower +- ac/llvm: get back nir_intrinsic_load_gs_vertex_offset_amd +- radeonsi: replace llvm gs input handle with nir lowering +- ac/nir/ngg: fix nogs culling scratch size +- ac/nir/ngg: add a barrier before prim id export +- ac/nir/ngg: Decouple primitive ID store and primitive export. +- nir/lower_gs_intrinsics: fix primitive count for points +- radeonsi: fix tcs_out_lds_offsets arg alignment +- winsys/amdgpu: fix non-page-aligned sparse buffer creation + +Rajnesh Kanwal (29): + +- pvr: Remove double error reporting in error path. +- pvr: Change phase_rate_change type to bool from uint32_t. +- pvr: Setup common nop shader. +- pvr: Add services winsys transfer context support. +- pvr: Add support to create transfer context and setup required shaders. +- pvr: Add vk_sync support and remove service winsys syncobjs interface. +- pvr: Fix argument comments in render job submission. +- pvr: Pass device info struct in pds generator. +- pvr: Add support for R32G32B32A32_UINT, R32_UINT and R8_UINT formats. +- pvr: Allow signal_sync pointer to be NULL in job submission. +- pvr: Remove vk_format_is_pure_integer and use common helper. +- pvr: Add transfer 3d parameter heap support. +- pvr: Fix ISP_RENDER_MODE_TYPE enum in rogue_cr.xml +- pvr: Restrict argument to dev_info where possible. +- pvr: Fix UTIL_ARCH_BIG_ENDIAN check in vk_format.h. +- pvr: Change ALIGN macro to ALIGN_ATTR. +- pvr: Add services winsys transfer cmd submit interface. +- pvr: Move transfer logic to pvr_job_transfer.[ch] +- pvr: Remove clang-format off comment from vk_format.h +- pvr: Implement vkCmdDraw API. +- pvr: Init and store default sampler in pvr_device. +- pvr: Use common vk_buffer implementation. +- pvr: Implement vkCreateBufferView and vkDestroyBufferView. +- pvr: Move binding related checks in common code. +- pvr: Add support for sampler descriptor. +- pvr: Add support for combined image and sampler descriptor. +- pvr: Add support for image descriptor. +- pvr: Add support for storage and uniform texel descriptor. +- pvr: Add support for input attachment descriptor. + +Renato Pereyra (4): + +- venus: add support for vk_xwayland_wait_ready +- venus: Add driconf option to enable implicit fencing +- Revert "wsi/x11: Don't leak xcb_get_geometry_reply_t." +- Revert "wsi/x11: Avoid using xcb_wait_for_special_event in FIFO modes" + +Rhys Perry (94): + +- aco: fix disassembly of SMEM with both SGPR and constant offset +- ac/nir: properly handle large global access constant offsets +- nir/gather_info: fix system_value_read for rt/mesh system values +- nir/builder: add nir_{ine,ibfe,ubfe}_imm() helper +- radv: remove unnecessary align_mul/offset initialization +- radv: initialize meta shader options earlier +- radv: use radv_meta_build_nir_vs_generate_vertices() more +- radv: use nir_op_imm helpers +- radv: use inot(a) instead of ine(a, true)/ieq(a, false) +- ac/nir: fix 64-bit NGG GS output stores +- radv: always use nir_lower_io_lower_64bit_to_32 +- nir/copy_prop_vars: fix non-vector shader call payloads +- radv: fix clearing of TRUNC_COORD with tg4 and immutable samplers +- radv: consider radix sort shaders to be internal/meta +- aco/ra: fix vgpr_limit +- aco: ensure that definitions fixed to operands have matching regclasses +- radv: add radv_has_shader_buffer_float_minmax +- radv: allow LBVH on GFX6+ +- radv: remove subtractions in address calculations +- radv: use SMEM for a few load_global +- aco: split load_sbt_amd result +- aco: skip s_barrier if TCS patches are within subgroup +- ac/nir: skip s_barrier if TCS patches are within subgroup +- aco: fix cmpswap global atomic definition on GFX6 +- nir: allow NIR_PASS(_, ) +- nir: print file when validation fails +- nir: call nir_metadata_preserve in nir_io_add_const_offset_to_base +- nir: call nir_metadata_preserve in nir_vectorize_tess_levels +- nir: call nir_metadata_preserve in nir_lower_memory_model +- radv: call nir_metadata_preserve in various lowering passes +- radv: add missing NIR_PASS() and switch from NIR_PASS_V() +- radv: validate shaders after linking passes +- aco: fix p_constaddr with a non-zero offset +- aco/tests: add test for p_constaddr with a non-zero offset +- aco: only add/subtract low bits of program addresses +- aco/ra: fix usage of invalidated iterator +- aco: use p_parallelcopy for uniform reduction with zero source +- aco: clarify a portion of do_pack_2x16 +- aco/tests: disable regalloc.subdword_alloc.reuse_16bit_operands on GFX11 +- aco/tests: fix gfx11 variants printed as gfx12 +- aco: disable sdwa on gfx11 +- aco: use v_perm_b32 to copy 0xff00/0x00ff/0xff/0x00 +- aco/tests: update for GFX11's removal of SDWA +- radv,aco: disable shader cache when ACO debug options are used +- aco: fix SMEM load_global_amd with non-zero offset +- aco: fix SMEM load_global with VGPR address and non-zero offset +- aco: consider fma with multiplication by power-of-two unfused +- ac: use ResetAllOptionOccurrences instead of ResetCommandLineParser +- gitlab: ask for useful titles +- gitlab: ask that reporters don't include long logs in descriptions +- nir/deref: stop assuming coherent accesses of different SSBOs may alias +- aco: update SMEM offset workaround for LLVM 15 +- aco: don't skip VS->TCS barrier if TCS output vertices doesn't match input +- aco: cleanup force-waitcnt output +- aco/ra: update register file when updating phi definition +- radv/ci: re-enable vega10 fossils testing +- aco: add and use constantValue16() +- aco: don't accept med3 opcodes in get_minmax_info() +- aco: include _e64 variants of 16-bit min/max in minmax optimizations +- aco: ignore precise flag when optimizing integer clamps +- aco: fix single-alignbyte do_pack_2x16() path with fp inline constants +- aco: fix propagate_constants_vop3p with integer vop3p and 16-bit constants +- aco: don't use opsel to fold constants into dot accumulation sources +- aco: fix redirect combine in propagate_constants_vop3p() with negatives +- aco: try sign-extending or shifting constants in propagate_constants_vop3p +- aco: don't use 32-bit fp inline constants for fp16 vop3p literals +- aco/tests: add vop3p constant combine tests +- nir: rename fsin_r600/fcos_r600 to fsin_amd/fcos_amd +- nir/algebraic: optimize bcsel(c, fsin/cos_amd(a), fsin/cos_amd(b)) +- nir: allow 16-bit fsin_amd/fcos_amd +- radv,aco,ac/llvm: use nir_op_f{sin,cos}_amd +- aco: make flat access latency match mtbuf/mubuf/mimg +- aco: include flat-like in vmem clause statistics +- aco: make FLAT_instruction::offset signed +- aco: improve support for scratch_* instructions +- aco: combine additions and constants into scratch load/store +- aco: handle subtractions in parse_base_offset +- aco: refactor VGPR spill/reload lowering +- aco: avoid WAW hazard with BVH MIMG and other VMEM +- aco: include scratch/global in VMEM WAW optimization +- aco: treat flat-like as vmem in some scheduling heuristics +- aco: initialize scratch base registers on GFX9-GFX10.3 +- aco: use scratch_* for scratch load/store on GFX9+ +- aco: use scratch_* for VGPR spill/reload on GFX9+ +- radv: fix vbo_bound_mask indexing +- radv: don't update misaligned_mask in CmdBindVertexBuffers2 +- aco: remove unnecessary exp_cnt increments +- aco: fix LdsBranchVmemWARHazard with 2+ branch chains +- aco: fix hash statistic +- radv: remove claimed support for sRGB vertex buffer formats +- radv: fix 16-bit support in radv_lower_vs_input +- aco: fix 16-bit VS inputs +- aco: don't expand vec3 VS input load to vec4 on GFX6 +- aco: add SCC clobber in build_cube_select + +Ricardo Garcia (1): + +- vulkan: allow null descriptor set layouts in pipeline layouts + +Riteo (1): + +- vulkan/device_select_wayland: fix a memory leak with DRM device handling + +Rob Clark (64): + +- freedreno: Misc indent fixes +- freedreno/drm/virtio: Appease valgrind +- freedreno/drm: Drop FD_PP_PGTABLE +- freedreno/drm: Close bo handle after bo->destroy() +- freedreno/drm: Move bo common init +- freedreno/drm/virtio: Support ring_idx +- freedreno/drm/virtio: Use userspace IOVA allocation +- freedreno/drm/virtio: Drop blocking in host +- freedreno/drm/virtio: Fix SHAREABLE+MAPPABLE +- freedreno/drm/virtio: Protocol updates +- freedreno/drm/virtio: Pass guest handles to execbuf +- freedreno/drm/virtio: Async ccmd batching +- freedreno/drm/virtio: Split up large uploads +- freedreno/a6xx: A couple more FD_BO_NOMAP +- freedreno: Split out helper for staging blit +- freedreno: Use staging transfer if mmap fails +- freedreno/drm: Fix bos_on_stack calculation +- drm-shim: Add GET_UNIQUE support +- freedreno/drm-shim: Update to latest uapi version +- freedreno/drm-shim: Robustify error handling +- drm-shim: Better mmap offsets +- drm-shim: Add error handling for bo_init() +- freedreno/drm-shim: Better iova handling +- drm-shim: Cleanup on device file close +- freedreno/drm: Use DEBUG_GET_ONCE_OPTION() +- freedreno/drm/virtio: Don't try to mmap imported bo's +- freedreno/devices: Add another SKU +- Revert "ci: freedreno farm is down" +- freedreno: Add pkt4 assert +- freedreno/registers: Small cleanup +- freedreno/a6xx: Split VFD_FETCH[] if needed +- freedreno/a6xx: Initialize VFD_FETCH[n].SIZE to zero +- freedreno/a6xx: assert valid vertex_flags reg +- turnip: assert valid vertex_flag reg +- freedreno/ir3: Don't lower_gs multiple times +- freedreno/ir3: Add copy_vars() helper +- freedreno/ir3: Copy vars if needed on EndPrimitive() +- freedreno/ir3: Add more tess varying slots +- freedreno/a6xx: Fix indentation +- freedreno/a6xx: Fix VS const packet size +- freedreno/ir3: Fix GS clip-plane lowering +- freedreno: rename ir3_emit_driver_params() +- freedreno/a6xx: Handle driver-params in GS/DS +- freedreno/a6xx: Do clip-plane lowering in backend +- freedreno/gmem: Reverse order of alternative tile rows +- util: Fix c++ enum casting pickiness +- libsync: Fix compiler warning +- freedreno/a6xx: Remove fd6_format.[ch] +- freedreno/a6xx: Fix enum tag +- freedreno: Extract common helper macros +- freedreno/ir3: Remove unneeded forward declaration +- freedreno/autotune: Make 'offset' macro "private" +- freedreno/drm: Combine upper and lower 32b of OR val +- freedreno/a6xx: De-open-code CACHE_FLUSH enum +- freedreno: Use enum for primtypes table +- freedreno/ci: Update unit test reference decodes +- freedreno/registers: Whitespace fix for gen_header.py +- freedreno/registers: Allow varset to be specified on enum +- freedreno/registers: Move varset to <enum> +- freedreno/registers/a6xx: Some reg64 conversion +- freedreno/drm: Fix potential bo cache vs export crash +- freedreno/gmem: Fix col0 calc +- llvmpipe: Add some missing locking +- freedreno: We really don't need aligned vbo's + +Roman Stratiienko (3): + +- Android.mk: Fix c11-related build failures +- v3dv: Enable sync_fd importing/exporting on Android +- v3dv: Limit API version to v1.0 for Android + +Ruijing Dong (6): + +- radeonsi/vcn: Add support of array_mode for gfx11 +- radeonsi/vcn: update av1 decoding to support vcn4 +- radeonsi/vcn: prepare for unified queue in vcn4 +- radeonsi/vcn: support unified queue in vcn4 +- frontends/va: do texture_map when needed +- frontends/va: WA for ffmpeg 10bit encoding crash + +Ryan Neph (5): + +- venus: update venus-protocol with VK_EXT_image_view_min_lod +- venus: enable VK_EXT_image_view_min_lod +- venus: support VK_KHR_copy_commands2 +- venus: refactor VkCommandBufferBeginInfo fixups to function +- venus: add support for VK_KHR_dynamic_rendering + +Sagar Ghuge (1): + +- anv: Disable storage image compression for possible atomic ops + +Sami Kyöstilä (2): + +- subprojects: Roll Perfetto to v27.1 +- util: Shut down Perfetto before driver unload + +Samuel Pitoiset (213): + +- radv: exclude PRIMITIVE_{COUNT,INDICES} from the per-vertex output mask +- radv: use shader_info::writes_memory +- radv: use shader_info::gs::active_stream_mask +- radv: use shader_info::inputs_read/outputs_written for FS IO +- radv: use shader_info::outputs_written/per_primitive_outputs for VS outputs +- radv: use shader_info::system_values_read +- radv: fix enabling adjust_frag_coord_z and apply per-pipeline +- radv: add support for independent descriptor set layouts +- radv: add support for VkShaderModuleCreateInfo as pNext +- radv: fix initializing pipeline_key::topology for GFX9 and older +- radv: mark all active stages earlier in the pipeline creation path +- radv: use the hardware primitive topology everywhere +- radv: initialize the vertex input interface state in only one place +- radv: do not support UNIFORM_TEXEL_BUFFER with SRGB +- radv: only apply enable_mrt_output_nan_fixup for 32-bit float MRTs +- radv/ci: update the flakes list for GFX9 chips +- aco: fix load_barycentric_at_{sample,offset} on GFX6-7 +- nir: fix marking XFB varyings as always active IO +- nir: mark XFB varyings as unmoveable to prevent them to be remapped +- radv: clarify why STAGE_2_CLEAR_BIT needs to wait for CP DMA to be idle +- radv: remove redundant VK_PIPELINE_STAGE_2_TRANSFER_BIT for CP DMA idle +- radv: add an SQTT workaround for chips with disabled RBs +- radv: fix handling divisor == 0 with dynamic vertex input state +- radv: allow to disable sinking of load inputs for FS via drirc +- radv: enable radv_disable_sinking_load_input_fs for Grid Autosport +- radv: use correct push constants range for internal operations +- radv/ci: update list of expected failures for Bonaire (GFX7) +- radv/ci: remove empty flakes lists for GFX9 +- radv: fix the number of generated primitive queries with NGG GS vs legacy +- radv/ci: stop skipping dEQP-VK.synchronization.* on Bonaire +- radv: use 3D views for 3D internal operations on GFX6-8 +- radv: re-emit dynamic line stipple state if the primitive topology changed +- radv/radix: handle intentional allocation failures properly +- ac,radeonsi: add has_sqtt_auto_flush_mode_bug +- radv/sqtt: fix configuring AUTO_FLUSH_MODE on GFX10.3 +- radv: only init acceleration structure if RT is enabled +- radv: disable instance packing to fix pipeline query stats +- radv: remove useless check against nir_texop_samples_identical +- radv/winsys: allow to use RADV_FORCE_FAMILY=gfx1100 +- radv: disable DCC for Senra Kagura Shinovi Versus +- aco: recognize GFX11 in few places +- aco: do not align VGPRS to 8 or 16 on GFX11 +- radv,aco: add support for packed threadID VGPRs on GFX11 +- radv,aco: use the new TCS WaveID SGPR to compute vs_rel_patch_id on GFX11 +- aco: export MRT0 instead of NULL on GFX11 +- aco: do not set DLC for loads on GFX11 +- aco: do not set GLC stores on GFX11 +- aco: update LDS allocation granularity for PS on GFX11 +- aco: update waitcnt on GFX11 +- aco: do not set RESOURCE_LEVEL for buffer descriptors on GFX11 +- aco: do not set COMPR for exports but use 0x3 channel mask on GFX11 +- radv,aco: export alpha-to-coverage via MRTZ on GFX11 +- aco: only retrieve the scratch offset when it's declared +- radv,aco: do not implicitly export the primitive ID for mesh shaders +- aco: remove unreachable code about viewport index/layer and mesh shaders +- radv,aco: add a workaround for binding 2D views of a 3D image on GFX9 +- radv: implement VK_EXT_image_2d_view_of_3d +- radv: advertise VK_EXT_image_2d_view_of_3d +- radv: do not declare the scratch offset on GFX11 +- aco: do not emit the primitive ID twice for NGG VS or TES with GS +- radv: do not align VGPRS to 8 or 16 on GFX11 +- radv: use the new format table on GFX11 +- radv: update scratch buffer registers on GFX11 +- radv: update sampler registers on GFX11 +- radv: update buffer descriptor registers on GFX11 +- radv: update image descriptor registers on GFX11 +- radv: use PIXEL_PIPE_STATE_DUMP event instead of ZPASS_DONE on GFX11 +- radv: configure DB_RENDER_CONTROL on GFX11 +- radv: do not emit non-existent CP_COHER_START_DELAY on GFX11 +- radv: do not emit FLUSH_AND_INV_DB_META on GFX11 +- radv: update TF_RING_size to a per-SE size on GFX11 +- radv: update VRS registers on GFX11 +- radv: apply a workaround for CB perf counters on GFX11 +- radv: limit CP DMA to max 32KB sizes on GFX11 +- radv: more register changes on GFX11 +- radv: do not lower loading TESS/ESGS rings using the ABI for LLVM +- aco: use ac_is_llvm_processor_supported() for checking LLVM asm support +- radv: rename shader compile functions to spirv_to_nir/nir_to_asm +- radv: do not try to dump the NIR of the trap handler shader +- radv: add new pipeline helpers for NIR->ASM compilation +- radv: init states from pTessellationState at only one place +- radv: init states from pViewportState at only one place +- radv: init states from pRasterizationState at only one place +- aco: remove unnecessary intrinsics that are lowered at the ABI level +- radv: determine if shaders use NGG before linking +- radv: export implicit primitive ID in NIR for legacy VS or TES +- radv: constify radv_pipeline in more radv_pipeline_generate_XXX() helpers +- radv: rename radv_pipeline_generate_XXX() to radv_pipeline_emit_XXX() +- radv: remove couple of useless pCreateInfo pointers in the emit path +- radv: remove unused radv_get_conservative_raster_mode() helper +- radv: move gs_table_depth to the physical_device +- radv: cleanup using device/physical_device during pipeline creation +- radv: split radv_pipeline into radv_{graphics,compute,library}pipeline +- radv: introduce radv_graphics_pipeline_info and split existing info structs +- radv/ci: fix fails list for NAVI21 +- radv: move HS info and task_num_entries to the physical device +- radv: rename radv_cmd_state::pipeline to graphics_pipeline +- radv: unify radv_pipeline_has_XXX() helpers +- radv: move streamout_shader to radv_graphics_pipeline +- radv: move active_stages to radv_graphics_pipeline +- radv: rename VERT_GRP_SIZE to VERTS_PER_SUBGRP on GFX11 +- radv: do not emit more non-existent registers on GFX11 +- radv: update framebuffer registers on GFX11 +- radv: update VRS rates on GFX11 +- radv: use the fragment resolve path by default on GFX11 +- radv: report adjusted LDS size for fragment shaders on GFX11 +- radv: fix configuring COLOR_INVALID on GFX11 +- radv: fix VK_BLEND_FACTOR_CONSTANT_COLOR translation on GFX11 +- radv: configure DB_Z_INFO.NUM_SAMPLES correctly on GFX11 +- radv: disable VK_AMD_shader_fragment_mask on GFX11 +- radv: disable attachementFragmentShadingRate on GFX11 +- radv: use pipeline->slab_bo in more places +- radv: fix writing buffer markers with non-zero memory offset +- radv: copy viewport/scissor when initializing radv_viewport_info +- radv: init states from pMultisampleState at only one place +- radv: init states from pDepthStencilState at only one place +- radv: init states from VkPipelineRenderingCreateInfo at only one place +- radv: use AMD values for computing blend related state +- radv: init states from pColorBlendState at only one place +- radv: init states from VkAttachmentSampleCountInfo at only one place +- radv: init states from VkPipelineDiscardRectangleState at only one place +- radv: init states from VkPipelineFragmentShadingRateState at only one place +- radv: ignore DYNAMIC_STENCIL_OP if stencil test isn't enabled +- radv: reduce radv_tessellation_info::patch_control_points to 8-bit +- radv: reduce radv_input_assembly_info::primitive_topology to 8-bit +- radv: reduce radv_rasterization_info::polygon_mode to 8-bit +- radv: reduce radv_discard_rectangle_info::count to 8-bit +- radv: fix lowering GS intrinsics if NGG is disabled per pipeline +- radv/ci: update list of failures for Pitcairn +- radv: enable radv_zero_vram for Hammerting +- radv/winsys: add get_chip_name() to the null winsys +- radv: report the marketing name as part of the device name +- radv: rename ngg_gs_state to ngg_query_state +- radv: track if primitives generated query features are enabled +- radv: add primitives_generated_query to the graphics pipeline key +- radv: declare the NGG query argument for primitives generated query +- ac/nir/ngg: count the number of generated primitives for VS and TES +- radv: flush the NGG query state when the argument is declared +- radv: add few helpers related to streamout +- radv: implement VK_EXT_primitives_generated_query +- radv: advertise VK_EXT_primitives_generated_query +- radv/ci: rename deqp-XXX jobs to vkcts-XXX +- zink/ci: skip two KHR-GL46 tests that timeout most of the time with RADV +- zink/ci: update list of expected failures with RADV +- radv: remove redundant check when importing vertex input info +- radv: update radv_is_vrs_enabled() to use radv_graphics_pipeline_info +- radv: update the check to determine if dynamic discard rectangle is needed +- radv: update the check to determine if dynamic sample location is needed +- radv: update the check to determine if rasterization is enabled +- radv: remove remaining unused pCreateInfo pointers +- ci: bump the hang-detection tool in the test image for Vulkan +- ci: fix passing down MESA_LOADER_DRIVER_OVERRIDE for zink-anv-tgl +- zink/ci: change the surface type used for dEQP +- ci/valve: split .b2c-test into .b2c-test-vk and .b2c-test-gl +- radv/ci: add GLES/GLCTS testing with Zink on NAVI10 +- zink/ci: update list of failures again +- radv/ci: refactor test machines and dEQP rules to reduce copy&paste config +- ci: update vkd3d-proton builder/runner +- ci: print a message when vk3d-proton returns successfully +- ci: add VKD3D_PROTON_RESULTS to the list of variables to pass down for testing +- radv/ci: add vkd3d-proton testing +- radv: advertise VK_EXT_border_color_swizzle on GFX10+ +- ci: uprev vkd3d-proton to 39d07dea2cef34bfb3ed39741f026bc637e3eec4 +- radv/amdgpu: make sure to reset the number of BO when there is no ranges +- radv/amdgpu: do not add "global" BO to the virtual BO list +- aco: fix validation of SOP1 instructions without definitions +- radv,aco: rename radv_prolog_binary/radv_shader_prolog to shader_part +- radv/llvm: always emit a null export even if the FS doesn't discard +- radv: disable small primitive culling for user sample locations +- radv: use RADEON_FLAG_VA_UNCACHED for the trace BO +- radv: fix command line for dumping waves with UMR +- radv: dump UMR waves before UMR rings +- radv: do not emit SQTT user data packets on the SDMA queue +- radv/ci: add CI lists for LLVM on NAVI21 +- radv: use LOAD_CONTEXT_REG to load the opaque buffer size on GFX10+ +- radv: do not abort if SPM isn't supported for the current GPU +- radv: remove old workaround for HTILE layers with F1 2021 +- zink/ci: update list of failures for piglit with RADV +- radv: disable DCC for Melty Blood Actress Again Current Code +- radv: always enable VK_EXT_debug_utils +- zink/ci: update list of failures +- radv/ci: update list of failures against CTS 1.3.3.0 +- aco: fix load_barycentric_at_sample without MSAA +- radv: fix dumping VS prologs assembly +- radv/ci: enable fossils testing for GFX1100 +- radv: remove the radv_report_apu_as_dgpu workaround for Red Dead Redemption 2 +- radv: fix wide points/lines by configuring the guardband correctly +- zink/ci: update list of CTS flakes for RADV +- ci: uprev vkd3d-proton to 5b73139f182d86cd58a757e4b5f0d4cfad96d319 +- radv,aco: track if a fragment shader needs an epilog +- radv,aco: introduce {radv,aco}_ps_epilog_key +- radv: declare a new user SGPR arg in FS for the epilog PC +- radv: add a function that declares PS epilog shader arguments +- aco: add new pseudo instruction p_jump_to_epilog +- aco: emit p_jump_to_epilog if the main fragment shader has an epilog +- aco: do not abort if the FS doesn't export anything but has an epilog +- aco: prevent adding DONE/VM to the last export if the FS has an epilog +- aco: add aco_postprocess_shader() helper +- radv,aco: rename radv_aco_build_prolog to radv_aco_build_shader_part +- aco: refactor export_fs_mrt_color() for PS epilogs preparation +- aco: add support for compiling PS epilogs +- radv: implement PS epilogs +- radv: disable viewport depth clamping only when necessary +- aco: requires Exact for p_jump_to_epilog +- radv: only force 1x sample for Bresenham lines when pipeline draws lines +- vulkan: add support for VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT +- radv: implement VK_EXT_attachment_feedback_loop_layout +- radv: ignore out-of-order rasterization if stencil write mask is dynamic +- radv: fix gathering XFB info if there is dead outputs +- radv: fix cleaning the meta query state if an error occured +- radv: re-emit viewports if negative one to one or depth clamp mode changed +- radv: fix bogus assertion with RADV_FORCE_VRS +- radv: fix pipelineStageCreationFeedbackCount when it's 0 + +Sarah Walker (2): + +- pvr: Update for firmware 1.17@6256262 +- pvr: Move BRN 44079, 48492 and 66011 code into pvrsrvkm specific directory + +Sathishkumar S (2): + +- radeon/vcn: skip create and destroy message for jpeg +- radeon/vcn: engage all available jpeg engines + +Sebastian Keller (1): + +- egl/wayland: Don't try to access modifiers u_vector as dynarray + +Sergi Blanch Torne (1): + +- ci/crosvm: clean crosvm socket + +Sidney Just (19): + +- wgl: add a flag to determine if running on zink +- wgl: add zink to the list of auto-loaded drivers +- zink: support VK_KHR_win32_surface +- kopper: add win32 loader interface +- zink: support win32 wsi +- wgl: support kopper +- gallium: support for implementing EXT_external_objects_win32 +- mapi: added EXT_external_objects_win32 definitions +- gallium/noop: implement win32 device LUID and node mask getters +- gallium/trace: implement win32 device LUID and node mask getters +- mesa: implement GL_DEVICE_LUID_EXT and GL_DEVICE_NODE_MASK_EXT getters +- mesa: implement import win32 memory and semaphore +- mesa: pipe cap for EXT_memory_object_win32 and EXT_semaphore_win32 +- zink: codegen for Win32 external object extensions +- zink: implement win32 fence import +- zink: implement win32 memory handle import +- zink: implement device LUID and node mask +- zink: enable pipe cap for win32 external memory and fences +- docs: updated zink features and release notes + +Sil Vilerino (39): + +- gallium/vl: Add software winsys and offscreen winsys +- util/u_format: Drop assert that has valid/well-defined behavior +- util/vl_vlc: Support compiling in C++ +- st_vdpau: Pass format when opening resource from handle in st_vdpau_resource_from_description +- d3d12: Add .clang_format file +- gallium/va/radeonsi: Using private as a parameter name conflicts with C++ keywords +- gallium va: VaDeriveImage support stride/offset being different for NV12 planes +- gallium: Add values to pipe_video_cap for multi-slice and multi-reference encode +- gallium va: Handle new VA attributes with new pipe video caps +- gallium: Add multiple slice support to pipe_h264_enc_picture_desc +- gallium va: Add support for multiple slices encoding +- gallium radeon/r600/omx/va: Adds support for multiple reference encoding +- gallium: Add MinLumaBiPredSize8x8 to pipe_h264_sps, pic_init_qs_minus26 to pipe_h264_pps +- gallium omx: Fill out MinLumaBiPredSize8x8 and pic_init_qs_minus26 +- gallium vdpau: Fill out level_idc and MinLumaBiPredSize8x8 +- gallium va: VaHandlePictureParameterBufferH264 fill out new pipe params MinLumaBiPredSize8x8, pic_init_qs_minus26, chroma_format_idc, bit_depth_chroma, bit_depth_chroma_minus8 +- d3d12: Add d3d12_promote_to_permanent_residency +- ci: Update x86 debian build to pick up tag v1.602.0-r1 from DirectX-Headers +- d3d12: Add video decode implementation of pipe_video_codec +- d3d12: Add video encode implementation of pipe_video_codec +- d3d12: Add create_video_codec and create_video_buffer entrypoints +- d3d12: Add util video functions to d3d12_format +- d3d12: Improve planar resource support to handle video requirements +- d3d12: Add support for d3d12 video in d3d12_screen +- d3d12: Add GLSL singleton refcounting to screen for offscreen non-GL rendering +- meson: Support d3d12 as a video-supporting driver +- winsys: Do not use Display type when X11 is not present in build +- meson: Update DirectX-Headers dependency to 1.602.0 version with fallback to the wrap +- meson: Add build option for gallium-d3d12-video feature +- d3d12: Align output buffer offset access to D3D12_FEATURE_DATA_VIDEO_ENCODER_RESOURCE_REQUIREMENTS.CompressedBitstreamBufferAccessAlignment +- d3d12: Fix debug_printf format strings for cross platform builds +- d3d12: Video - Remove unused spCopyQueues from enc/dec objects +- pipe/video: Add PIPE_VIDEO_ENTRYPOINT_PROCESSING +- vl: Replace usage of entrypoint UNKNOWN with PROCESSING for VP +- va: Replace usage of entrypoint UNKNOWN with PROCESSING for VP +- gallium/video: Add video post processing interface +- va: Add support for VPP rotation, flip, alpha blend, crop, scaling +- d3d12: Add pipe_video_codec::process_frame implementation +- d3d12: Enable VPP rotation, flip, alpha blend, crop, scaling via pipe_video_codec::process_frame + +SoroushIMG (4): + +- zink: Fix BO size when it's not aligned to 16 bytes +- zink: Fix spirv stream 0 vertex emit for multistream shaders +- Zink: Fix clear being missed when using emulated draws in zink_blit +- zink: Fix incorrect emission of SPIR-V shift ops + +SureshGuttula (4): + +- radeonsi: Set display_remote for non-refernced frames +- radeonsi/vcn : update enc->dpb ref_use for index 0 +- Revert "radeonsi: Set display_remote for non-refernced frames" +- Revert "radeon: hardcode uvd/vce encoder not_referenced value to false" + +Sviatoslav Peleshko (7): + +- mesa: flush bitmap caches when changing scissors or window rects state +- anv: workaround apps that assume full subgroups without specifying it +- intel/blorp/gen6: Set BLEND_STATEChange only if emitting the blend state +- anv: Dirty all dynamic state bits when creating command buffer state +- intel/blorp: Dirty depth bounds dynamic state bits after blorp +- intel/nullhw: Use correct macro to fix build regression +- iris: Always initialize shader compilation queue ready fence + +TSnake41 (1): + +- zink: print result code string on vulkan failure + +Tapani Pälli (5): + +- isl: disable mcs (and mcs+ccs) for color msaa on DG2 +- anv: use anv_cmd_dirty_mask_t type for dynamic state +- intel/fs: setup SEND message descriptor from nir scope +- iris: implement Wa_14015264727 for DG2 +- anv: implement Wa_14015264727 for DG2 + +Tatsuyuki Ishi (9): + +- radv: Fix redundant subpass barriers due to erroneous comparison +- radv/ci: skip dEQP-VK.fragment_operations.transient_attachment_bit +- radv/ci: Move transient_attachment_bit from fail to skip list +- radv/ci: skip image.sample_texture.*_compressed_format* +- radv/ci: Move sample_texture.*_compressed_format_* to faillist for gfx<=9 +- radv: Fix vkCmdCopyQueryResults -> vkCmdResetPool hazard. +- amd: Revert gfx10 addrlib changes +- radv: Only set pstate for the first hw_ctx. +- radv: Implement radv_flush_before_query_copy to workaround UE Vulkan bugs. + +Thomas Debesse (2): + +- gallium/clover: LLVM setLangDefaults moved from clangFrontend to clangBasic +- gallium/clover: pass -no-opaque-pointers to Clang + +Tiago Koji Castro Shibata (2): + +- d3d12: fallback to integrated adapter instead of arbitrary one +- d3d12: add more formats to supported conversions + +Timothy Arceri (71): + +- glsl/st: vectorise interfaces of SSO shader programs +- nir: fix setting varying from uniform as flat +- nir: fix sorting before assigning varying driver locations +- nir: always set the exact_trip_count_unknown loop terminator property +- nir: remove unreachable loop terminators +- svga: remove duplicate nir compile options +- nir/glsl: add glsl_record_compare() wrapper +- nir/glsl: wrapper contains_{double,interger}() +- nir/glsl: wrapper field_index() +- nir/glsl: wrap component_slots_aligned() +- glsl/mesa: move parse_program_resource_name() to common linker_util code +- glsl: add new build program resource helpers +- nir: add variable data fields required for NIR glsl varying linking +- nir: abort io info gathering if location is not set or is a temp value +- nir: skip lowering io to scalar for must_be_shader_input +- glsl: implement lower_packed_varyings() as a NIR pass +- glsl: add a NIR based varying linker +- glsl: implement opt_dead_builtin_varyings() as a NIR pass +- glsl: implement lower_xfb_varying() as a NIR pass +- glsl: enable the use of the nir based varying linker +- glsl: remove now unused GLSL IR varying linker code +- glsl: simplify finding cursor in varying packing code +- nir: add support for forced sampler indirect loop unrolling +- gallium/drivers: set force_indirect_unrolling_sampler for all required drivers +- glsl: move validation of sampler indirects to the nir linker +- i915g: use a valid setting for force_indirect_unrolling +- softpipe: switch to NIR loop unrolling +- asahi: switch to NIR loop unrolling +- d3d12: switch to NIR loop unrolling +- etnaviv: switch to NIR loop unrolling +- i915g: switch to NIR loop unrolling +- r300: disable GLSL IR loop unrolling +- r600: switch to NIR loop unrolling +- vc4: disable GLSL IR loop unrolling +- nir/i915g/r300/nv30: skip marking varyings as flat in some drivers +- etnaviv: assert if etna_shader_io_file reg overrun +- etnaviv: vectorise io +- nir/gcm: fix pushing instructions into if blocks +- ci: uprev piglit 2022-05-31 +- nir: add nir based version of the lower_const_arrays_to_uniforms pass +- glsl/nir: skip adding hidden uniforms to the remap tables +- glsl/nir: allow the nir linker to remove dead uniforms we created +- glsl: move common link time optimisation calls to linker code +- glsl: move gl_nir_link_opts() call out of the st code +- glsl: switch to NIR based implementation of lower_const_arrays_to_uniforms() +- glsl: remove now unused lower_const_arrays_to_uniforms() +- gallivm: disable GLSL IR loop unrolling in LLVMPIPE +- freedreno/ir3: tidy up duplication of common nir options +- freedreno: switch to NIR loop unrolling +- lima: lower all undefs to zero in vs +- lima: fixup nir indirect unroll options to match gallium CAP +- lima: switch to NIR loop unrolling +- nouveau/nv30: disable GLSL IR loop unrolling +- nouveau/nv50: disable GLSL IR loop unrolling +- nouveau/nvc0: disable GLSL IR loop unrolling +- svga: disable GLSL IR loop unrolling +- gallium: remove PIPE_SHADER_CAP_MAX_UNROLL_ITERATIONS_HINT CAP +- glsl: remove the now unused GLSL IR loop unrolling code +- glsl: drop extra optimise swizzles call +- glsl: tidy up link_varyings_and_uniforms() +- glsl: remove never true do_dead_code() parameter +- glsl: inline do_common_optimization() call +- util: add dri config option to disable GL_MAP_UNSYNCHRONIZED_BIT +- util: use force_gl_map_buffer_synchronized workaround with RAGE +- glsl: merge lower_buffer_access with lower_shared_reference +- glsl: drop non-nir path for atan in builtin functions +- st/glsl: fix broken vertex attrib mapping +- nir/loop_unroll: clean up after complex_unroll_single_terminator() +- glsl: correctly track cross slot component packing +- Revert "nir: Preserve offsets in lower_io_to_scalar_early" +- glsl: dont lower precision for textureGatherOffsets + +Timur Kristóf (119): + +- radv: Minor formatting fix in radv_device.c file. +- radv: Move queue submit sparse bindings to a separate function. +- radv: Move empty queue submit code path to a separate function. +- radv: Move normal (non-empty) queue submit to a separate function. +- radv: Move up early exit for transfer queues in radv_get_preamble_cs. +- radv: Simplify some boolean code in radv_get_preamble_cs. +- radv: Initialize BO pointers when creating preambles. +- radv: Refactor cache flush code for the initial preambles. +- radv: Don't create continue preamble when it's not needed. +- radv: Rename fill_geom_tess_rings to radv_fill_shader_rings. +- radv: Don't use pointers to pointers when updating the preambles. +- radv: Only emit what is appropriate to the queue family for preambles. +- ac/nir: Add ac_nir_load_arg helper for shader arguments. +- radv: Move radv_nggc_settings enum out of radv_cmd_buffer. +- nir: Add upper bound for AMD shader arg intrinsics. +- radv: Lower ABI in NIR for tess/ESGS/NGG shader arguments. +- ac/nir: Remove now-superfluous ac_nir_lower_tess_to_const. +- aco: Remove now-superfluous intrinsics. +- ac/llvm: Remove now-superfluous intrinsics and ABI callbacks. +- radv/amdgpu: Group queue submit info into a structure. +- radv/amdgpu: Initialize semaphore info with a designated initializer. +- radv/amdgpu: Pass new queue submit info structure to internal function. +- radv/amdgpu: Pass correct struct type instead of repeating the cast. +- radv/amdgpu: Use scheduled dependency for submitting to multiple queues. +- ac/nir: Add I/O lowering for task and mesh shaders. +- ac: Add task ring entry shader argument. +- radv: Add radv_pipeline_has_task helper. +- radv: Set user data register for task shaders. +- radv: Set wave size for task shaders. +- radv: Fill task shader info. +- radv: Add task shader arguments. +- radv: Add task ring entry argument for mesh shaders. +- radv: Implement task shader intrinsics in the ABI. +- radv: Enable nir_opt_offsets for task shaders. +- radv: Use I/O lowering for task and mesh shaders. +- radv: Postprocess task shader configuration. +- radv: Allow linking task shaders. +- radv: Lower shared and task_payload variables in task/mesh shaders. +- radv: Print task shader stage name before disasm. +- radv: Fix task shader info. +- radv: Fix loading task shader ring buffer addresses. +- aco: Fix scratch with task shaders. +- radv: Introduce ring info structure for queues, refactor preamble generation. +- radv: Use a ws variable during preamble creation. +- radv: Pass radv_device pointer to some functions. +- radv: Refactor queue state to separate structure. +- radv: Clarify emitting graphics shader pointers. +- nir: Add explicit task payload atomic intrinsics. +- radv: Implement conditional rendering for async compute queue. +- ac: Remove trailing white space. +- aco: Remove trailing whitespace. +- radv: Remove trailing whitespace. +- radv: Disable predication for supass clear and image clears. +- nir: Add new launch_mesh_workgroups intrinsic. +- nir: Add common task shader lowering to make the backend's job easier. +- radv: Add mesh and task stage names to pipeline executable properties. +- nir: Keep track of cross-invocation mesh shader output access. +- radv, ac/nir: Fix multiview layer export for mesh shaders. +- ac/nir/taskmesh: Use task shader lowering from common NIR code. +- ac/nir/taskmesh: Use 3 dimensional workgroup ID. +- ac/nir: Add remappability to tess and ESGS I/O lowering passes. +- ac: Add RDNA2 task+mesh shader draw packet opcodes. +- ac: Add task shader ring information. +- radv: Implement task shader draw and payload rings. +- nir: Introduce new intrinsics for AMD specific mesh shader task ring. +- ac/nir/ngg: Clean up mesh shader output LDS layout. +- ac/nir/ngg: Use mesh shader scratch ring when outputs don't fit LDS. +- ac/nir/ngg: Use variables for outputs without cross-invocation access. +- radv: Implement mesh shader scratch ring. +- radv: Enable NGG wave ID for mesh shader scratch ring. +- ac/nir/ngg: Add mesh shader workgroup index. +- radv: Lower mesh shader 3D workgroup ID to 1D index. +- ac/nir/taskmesh: Preserve workgroup ID Y and Z when applying firstTask. +- radv: Always use 3D block ID and grid size in task shaders. +- gallium/u_blitter: Fix depth. +- zink: Enable the VK_EXT_depth_clip_enable extension. +- zink: Always enable depth clamping, make depth clipping independent. +- nir/lower_task_shader: don't use base index for shared memory intrinsics +- ac/nir/ngg: Refactor LDS instructions in NGG GS vertex emit and export. +- ac/nir/ngg: Ignore driver location for mesh shader outputs. +- radv: Don't assign driver locations to mesh shader outputs. +- radv: Add CULL_PRIMITIVE to special output mask. +- radv: Use NIR optimization to move discards to the top. +- radv: Only initialize DGC state when DGC is enabled. +- radv: Remove trailing whitespace introduced by DGC commits. +- radv: Refactor predication for compute queues. +- radv: Refactor some CP DMA functions to work with radeon_cmdbuf. +- radv: Refactor radv_emit_userdata_address to work with radeon_cmdbuf. +- radv: Refactor radv_emit_descriptor_pointers to work with radeon_cmdbuf. +- radv: Refactor radv_emit_inline_push_consts to work with radeon_cmdbuf. +- radv: Refactor view index emit to use a per-stage function. +- radv: Move inline push constants to a new function. +- radv: Minor cleanup of radv_queue_submit_normal. +- radv: Allow reusing pipeline compute state emit functions. +- radv: Add dispatch_initiator_task field to radv_device. +- radv: Create internal cmdbuf when a graphics pipeline needs compute. +- radv: Flush descriptors and push constants for task shaders. +- radv: Introduce radv_before_taskmesh_draw. +- radv: Implement mesh shading draw calls with task shaders. +- radv: Synchronization for task shaders. +- radv: Support task shaders in secondary cmd buffers. +- radv: Workaround MEC taskmesh dispatch hang when count buffer has zero. +- radv: Submit internal compute cmdbuf. +- radv: Copy BO list to ACE internal CS. +- radv: Enable task shader feature for NV_mesh_shader. +- ac/nir/ngg: Move primitive ID workgroup barrier to proper place. +- aco: Remove hack for primitive ID export. +- ac/nir/ngg: Create output variable for primitive ID export. +- ac/llvm: Implement load_num_subgroups for NGG shaders. +- ac/llvm: Add LLVM bug workaround to ac_build_mbcnt_add. +- ac/llvm: Use gs_prim_id for NGG VS. +- ac/nir/ngg: Copy comment about LDS layout for NGG GS. +- ac/nir/ngg: Remember proper bit sizes of GS output variables. +- ac/llvm: Implement GDS atomic add NIR intrinsic. +- radv/llvm: Remove incorrect hardcoded workgroup size from NGG GS. +- radv: Properly set LDS size for LLVM NGG shaders. +- radv: Use NIR-based NGG lowering with LLVM. +- aco: Fix p_init_scratch for task shaders. +- nir/gather_info: Clear cross-invocation output mask. + +Tomeu Vizoso (4): + +- lima/ci: Disable lima-mali450-piglit-gpu:arm64 +- panvk/ci: Disable CI for a while +- ci: Disable jobs to the Collabora lab +- Revert "ci: Disable jobs to the Collabora lab" + +Vadym Shovkoplias (4): + +- anv: Fix geometry flickering issue when compute and 3D passes are combined +- driconf: Add a limit_trig_input_range option +- drirc: Set limit_trig_input_range option for the Wolfenstein Youngblood +- drirc: Set limit_trig_input_range option for glmark2 + +Vasily Khoruzhick (7): + +- lima: Implement texture_barrier +- lima: enable PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT +- lima: Lower nir_op_idiv with other scalar math operations. +- u_transfer_helper: flush temporary transfer first for MSAA +- lima: implement lima-specific blitter +- lima: wire up MSAA 4x support +- lima: fix buffer overallocation for index, vertex and constant buffers + +Victor Hermann Chiletto (1): + +- radv: always check entry count in descriptor pool when allocating + +Viktoriia Palianytsia (1): + +- anv: Add workaround for sample mask with multisampling + +Vinson Lee (14): + +- panfrost: Fix memory leaks on unit test failures. +- zink: Fix memory leak on error path. +- radeonsi: Move NULL check before dereference. +- dri2: Fix memory leak. +- d3d12: Initialize d3d12_video_encoder_bitstream member m_uiOffset. +- amd: Initialize Gfx11Lib members in constructor. +- d3d12: Remove unnecessary NULL check. +- clc: Fix build with llvm-15. +- microsoft/spirv_to_dxil: Fix missing-prototypes errors. +- radv: Remove unnecessary null check. +- microsoft/compiler: Fix assert. +- tu: Check dereferenced value of rop_reads_dst. +- aco: Initialize spill_ctx members in constructor. +- r600/sfn: Initialize TestShaderFromString member m_instr_factory. + +Vlad Zahorodnii (2): + +- winsys/amdgpu-radeon: Allow specifying context priority +- radeonsi: Add support for EGL_IMG_context_priority + +Yevhenii Kolesnikov (1): + +- nir: Remove single-source phis before opt_if_loop_last_continue + +Yiwei Zhang (51): + +- venus: override aspectMask for internal tiling modifier +- venus: use linear modifier for legacy common wsi path +- venus: fix view format for ahb image +- venus: refactor to add struct vn_env +- venus: add env perf options and introduce no_async_set_alloc +- venus: add VN_PERF option no_async_buffer_create +- venus: add VN_PERF option no_async_queue_submit +- venus: refactor vn_cmd_submit +- venus: flush when batched draw calls reach a threshold +- venus: disable VK_EXT_extended_dynamic_state2 +- venus: refactor android gralloc pieces +- venus: cache front_rendering_usage bit at gralloc init +- venus: update vn_GetSwapchainGrallocUsage2ANDROID for shared present +- venus: enable ANB shared presentable image prop +- venus: add VN_RELAX_BASE_SLEEP_US env var for tuning cpu utils +- venus: log performance envs upon instance creation with debug init +- venus: extend buffer cache to cover layering usage +- vulkan: remove the VkPhysicalDeviceMemoryProperties workaround +- radv: remove VkPhysicalDeviceMemoryProperties2 workaround +- venus: use common entry point for VkPhysicalDeviceMemoryProperties +- venus: resolve AHARDWAREBUFFER_FORMAT_Y8Cb8Cr8_420 to external format +- venus: fix a bug on buffer cache init failure path +- venus: fix the queue init failure path +- venus: fix a missing mtx_destroy in vn_device_init +- venus: adjust device resources init order +- venus: let device track queue families used for device creation +- venus: add vn_feedback_pool backed by coherent buffer (part 1) +- venus: add vn_feedback_pool backed by coherent buffer (part 2) +- venus: add feedback check/set/reset helpers +- venus: add event feedback +- venus: add NO_FENCE_FEEDBACK perf option and disable sparse resource +- venus: create per queue family command pool for feedback cmd +- venus: renderer to store allow_vk_wait_syncs capset +- venus: add fence feedback +- venus: resolve YV12 AHB to VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM +- venus: use narrow range to match up with mesa EGL +- venus: swizzle the chroma channels for YVU420 to match the VkFormat +- venus: GetPhysicalDeviceSparseImageFormatProperties to match sparse support +- venus: suballocate more for layering +- venus: add more tracepoints for perf analysis +- venus: filter out VK_EXT_physical_device_drm on the driver side +- venus: check dynamic state for raster enablement +- venus: reenable VK_EXT_extended_dynamic_state2 +- Revert "venus: suballocate more for layering" +- anv: enable VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM for modifier support +- venus: fix external memory ext filtering +- venus: avoid feedback for external fence +- venus: allow no external memory handle when renderers lacks support +- zink: fix zink_create_fence_fd to properly import +- zink: fix in-fence lifecycle +- venus: ignore pInheritanceInfo if not secondary command buffer + +Yogesh Mohan Marimuthu (12): + +- radeonsi/gfx11: make flat_scratch changes for graphics +- radeonsi/gfx11: make flat_scratch changes for compute +- radeonsi/gfx11: interp changes for 32bit +- radeonsi/gfx11: interp changes for 16bit +- radeonsi/gfx11: instruction cache line size is 128 bytes +- radeonsi/gfx11: program inst_pref_size for compute +- radeonsi/gfx11: program db render control register +- radeonsi/gfx11: export alpha through mrtz for alpha-to-coverage if mrtz is there +- ac,radeonsi/gfx11: swizzle MRT0/1 for dual source blending +- vulkan/wsi: fix extra free if buffer_blit_queue +- radeonsi: remove tabs from code +- radeonsi: no need to call si_pm4_clear_state() in si_pm4_free_state() + +Yogesh Mohanmarimuthu (1): + +- radeonsi/gfx11: use PIXEL_PIPE_STATE_DUMP event instead of ZPASS_DONE + +Yonggang Luo (119): + +- zink: Remove redundant framebuffer_mtx from zink_screen.h +- win32: Fixes 32 bits visual studio module definition files by add script gen_vs_module_defs.py +- win32: Do not use BUILD_GL32, we use def file to export win32 dll symbols. +- misc: Replace \`#ifdef\\t__cplusplus` with \`#ifdef\\s\\s__cplusplus` +- misc: replace \`#ifdef\\s\\s__cplusplus` with \`#ifdef\\s__cplusplus` +- zink: Trim spaces in zink.rst +- zink: Improve zink.rst with links +- st: Improve _mesa_error about GL_OUT_OF_MEMORY in st_cb_texture.c +- llvmpipe: Do not use _Atomic keyword that doesn't support by MSVC +- radv: fixes msvc compiling error +- radv: Fixes compiling error with msvc +- radv: Add losing member hw_ip_version_major and hw_ip_version_minor for win32 +- radv: Add macros for win32 that accessed +- nir: Fixes MSVC compiling error about unused variable \`_` +- egl/wgl: On win32, there is no support for EGL_EXT_device and EGL_EXT_platform_device +- util: Fixes test_util_get_process_exec_path on windows host with msys2/mingw +- util: Fixes typo in test_util_get_process_exec_path +- ci/vs2019: Clear CI_COMMIT_MESSAGE and CI_COMMIT_DESCRIPTION for please meson +- radv: Remove usage of \`cnd_monotonic.h` +- d3d12: Fixes d3d12 compiling errors in \`D3D12ResourceState.h` for mingw +- d3d12: Move shared code that include d3d12 headers into d3d12_common.h +- d3d12: Fixes compiling error by include \`<unknwn.h>` +- d3d12: \`#include <windows.h>` instead \`#include <Windows.h>` for building under linux with mingw +- d3d12/wgl/test: Fixes wgl_tests.cpp for d3d12 with mingw +- dzn: Fixes compiling error when build with msys2/mingw +- dzn: Fixes compiling error by include \`<unknwn.h>` +- microsoft/clc: Fixes compiling of microsoft clc with mingw +- docs: Fixes dispatch.rst with __THREAD_INITIAL_EXEC +- meson: Fixes timespec_get detect code +- c11/threads: Remove the detecting code for \`HAVE_TIMESPEC_GET` in threads_win32.h +- c11: Implement c11/time.h with c11/impl/time.c +- c11: Implement timespec_get on win32 properly when not available +- util: Use timespec_get directly, it's always present +- util: Remove usage of designated initializers in timespec_test.cpp +- util: always enable timespec_test.cpp +- util: Remove unused function timespec_passed +- util: Add unittest for timespec_get +- ci/vs2019: There is no meaning to set $env:ErrorActionPreference +- ci/vs2019: switch to powershell 7 +- ci/vs2019: Update editorconfig for ps1 files +- ci/vs2019: Give each option its own line in mesa_build.ps1 +- ci/vs2019: Rename to vsInstallPath +- ci/vs2019: Do not install subprojects +- ci/vs2019: Ignore cert install output, it's too long +- ci/vs2019: Ignore error when removing directory recursively +- ci/vs2019: Disable progress when install python +- ci/vs2019: Disable progess bar for pip3 install +- ci/vs2019: Use shared VULKAN_SDK_VERSION for install and build vulkan related packages +- ci/vs2019: Add a dedicated script for entering VS dev shell +- ci/vs2019: Split call to C:\\vs_buildtools.exe into multiple lines +- ci/vs2019: Show the total vs2019 install time +- ci/vs2019: Reduce the vs2019 build tools components to be installed +- ci/vs2019: ignore _build and _install directory by write * into .gitignore +- ci/vs2019: Install all build artifacts into C:\\mesa-deps +- ci/vs2019: Split choco install output +- ci/vs2019: Split install of vs2019 and choco into separate Docker image +- ci/vs2019: Remove comment for windows_test_vs2019 that not true +- ci/vs2019: Convert mesa_build.ps1 to using PowerShell 7 && operator +- microsoft/compiler: Include stddef.h before using size_t +- microsoft/compiler: Fix error from double extern +- dxil: Fixes compiling dxcapi.h with mingw. +- dxil: Sync dxil_spirv_shader_stage with gl_shader_stage +- dxil: Fixes warning about comparing different enum value and enum literal +- microsoft/spirv_to_dxil: Delete unused local variable +- microsoft/clc: Delete unused local variables +- microsoft/clc: Add missing void to no-parameter function signature +- microsoft/spirv_to_dxil: Fixes maybe-uninitialized compiling error in dxil_spirv_nir.c +- ci: Building spirv-to-dxil on both mingw and debian/x64 +- intel: using C++11 keyword thread_local +- glx: using C++11 keyword thread_local +- util: Define WIN32_LEAN_AND_MEAN before include of \`windows.h` in u_thread.h +- c11: Move the implementation of threads.h into c source code +- c11: Fixes memory leak of pack in thrd_create on win32 +- c11: Implement thread_local in c11/threads.h +- util/c11: Update function u_thread_create to be c11 conformance +- meson: Using get_argument_syntax as the \`--compiler_id` option for gen_vs_module_defs.py +- meson: Use cc.get_argument_syntax instead cc.get_id when possible. +- meson/rtti: Using get_argument_syntax to check msvc compatible compiler +- meson/clover: Getting clover can be built with no-rtti llvm in mingw +- ci/vs2019: Upgrade Windows 10 SDK version to 20348 +- docs: Update windows build requirement with Visual Studio +- d3d12: Use static_cast instead of dynamic_cast in d3d12_video_enc_h264.cpp +- d3d12: Convert #include <Windows.h> to #include <windows.h> for mingw on linux +- d3d12: Fixes compiling error in d3d12/wgl/d3d12_wgl_framebuffer.cpp with gcc +- microsoft/clc: Fixes narrowing error in clc_compiler_test.cpp with mingw/gcc +- microsoft/clc: Disable clc_compiler_test on non-windows platform +- dzn: Fixes incompatible pointer type error +- ci/x86_build: Getting pushd popd be paired, avoid using cd +- ci: Prepare the container for building all mesa components with mingw under linux +- ci: Trigger the new mingw/linux dockers to be build +- d3d12: Turn d3d12_format.h to include d3d12_common.h +- ci: Building all mesa functional with mingw on debian +- meson: Enable wgl tests on mingw +- CODEOWNERS: Update c11 code owners +- CODEOWNERS: evelikov renamed to xexaxo +- gallium: Move -DHAVE_PIPE_LOADER_DRI and -DHAVE_PIPE_LOADER_KMS to be pre_args +- gallium: Do not call to pipe_loader_sw_probe_kms when not present +- gallium: Rename macros HAVE_PIPE_LOADER* +- gallium: Disable dri2 interface on OSX +- glx: __glX_tls_Context should use __THREAD_INITIAL_EXEC +- util: thread_local doesn't support to be exported on OSX +- glx: Convert tab to space in applegl_glx.c +- dri: Trim trailing space in dri/dri_util.* +- ci: Building osmesa on OSX +- util: Disable tests/sparse_array_test.cpp on MacOS as it's too slow +- mapi: Trim trailing spaces in stub.c and u_current.c +- meson: Remove usage of use_elf_tls +- egl: Remove usage of USE_ELF_TLS macro +- mapi: Remove usage of USE_ELF_TLS +- loader: Remove usage of USE_ELF_TLS in loader.c +- glx: Remove usage of USE_ELF_TLS +- util: Remove usage of USE_ELF_TLS in u_thread.h +- meson: Remove pre_args += '-DUSE_ELF_TLS' +- glx: Remove usage of pipe_tsd_set and pipe_tsd_get +- auxiliary: Remove pipe_tsd +- d3d12: Fixes compile error with mingw/gcc-x64 when static linkage to runtime library +- microsoft/clc: Fixes compiling errors with clang/mingw64 in clc/clc_compiler_test.cpp +- util: Fixes invalid assumption that return non null by function util_format_fetch_rgba_func +- c11: #include <threads.h> when the os/platform provide it + +Yurii Kolesnykov (2): + +- Remove Travis CI config +- Init macOS GitHub Action + +Zack Rusin (4): + +- svga: Don't try to build x86/x64 assembly on different arch's +- svga: finish readbacks before mapping resources +- svga: Use direct maps when GB objects are present +- svga: Add support for SVGAv3 + +Zhang, Jianxun (1): + +- iris: Wa_14016820455 for GFX_VERx10 == 12.5 + +illiliti (1): + +- Use proper types for meson objects + +jheaff1 (1): + +- build(glx): Fix build by adding missing deps + +naveen (1): + +- github/ci: Set permissions for GitHub actions + +newbluemoon (1): + +- nine: replace ulimit with sysconf call + +prashanth (1): + +- vulkan/device_select: force default device if environment variable set + +sjfricke (2): + +- isl: fix bug where sb.MOCS is not being set +- anv: fix assert to build with shader cache disabled + +xperia64 (1): + +- wgl: Add driver_zink as a dependency of the wgl frontend + +İlhan Atahan (1): + +- Add Adreno 616 and 620 to use turnip on these GPU's . diff --git a/lib/mesa/docs/relnotes/22.2.1.rst b/lib/mesa/docs/relnotes/22.2.1.rst new file mode 100644 index 000000000..97afc7e9a --- /dev/null +++ b/lib/mesa/docs/relnotes/22.2.1.rst @@ -0,0 +1,227 @@ +Mesa 22.2.1 Release Notes / 2022-10-11 +====================================== + +Mesa 22.2.1 is a bug fix release which fixes bugs found since the 22.2.0 release. + +Mesa 22.2.1 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.2.1 implements the Vulkan 1.3 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + 0079beac0a33f45e7e0aec59e6913eafbc4268a3f1e2e330017440494f91b13c mesa-22.2.1.tar.xz + + + +New features +------------ + +- None + + +Bug fixes +--------- + +- r600: Shadows incorrecly rendered on Counter-Strike: Global Offensive with the NIR backend +- radeonsi/radv: Discrepancy in handling wd_switch_on_eop +- Shadows rendering issue in Deathloop +- RADV: Regression with RPCS3 where nothing renders +- [RADV][bisected] commit 3714c89d0ebf0c541d8f7a7a807ce7c1bfe3cfea at least lead to GPU hang when I run the game "Quake II RTX" + + +Changes +------- + +Caio Oliveira (1): + +- nir/lower_task_shader: Don't fail adding a launch when last instruction is a jump + +Connor Abbott (3): + +- tu: Fix maxPerStageDescriptorUpdateAfterBindInputAttachments +- ir3/analyze_ubo_ranges: Account for reserved consts +- ir3/analyze_ubo_ranges: Account for reserved consts better + +Daniel Schürmann (1): + +- aco/optimizer: disallow can_eliminate_and_exec() with s_not + +Dave Airlie (5): + +- lavapipe: fix 3d depth stencil image clearing. +- lavapipe: take layers into account around render surface creation. +- lavapipe: handle view mask layer count properly for begin rendering. +- llvmpipe: memset empty images in the key +- llvmpipe: bind sampler views/images properly. + +Dylan Baker (15): + +- docs: add release notes for 22.2.0 +- docs: Add sha256 sum for 22.2.0 +- .pick_status.json: Update to 3730be9873d556918a20795b37399b7e2e9630b5 +- .pick_status.json: Update to c268ae9efe602129b848c800dca83411d55c3aab +- .pick_status.json: Mark 49efa73ba11c4cacaed0052b984e1fb884cf7600 as denominated +- .pick_status.json: Mark 5b7c2dffc43eb7ada61807a6e679cc2c1b3b2682 as denominated +- .pick_status.json: Mark 1aacd9492de42412c6c9ca79d48a40c4ffebcd79 as backported +- .pick_status.json: Mark cbad4adc133b16c803ec9445c8dd144bc5023a62 as denominated +- .pick_status.json: Mark 40a235c9a82027bcb92d34b7386ea6b1a2e1d587 as denominated +- .pick_status.json: Update to 3ed712f7483791aa0a256472d276ff5a83c643bc +- .pick_status.json: Update to e5631add0d1c97da4686f474735d45705facd308 +- .pick_status.json: Mark 5af6dad179dc4140964ab444a2a7f8cf8d16c561 as denominated +- .pick_status.json: Update to 68f3c38c8011e3ff304a67b6ffb33fd21ee65b0c +- .pick_status.json: Mark ff933485b7e328c5041050d5e41a2dd28a032a92 as denominated +- docs: remove new_features.txt + +Emma Anholt (2): + +- turnip: Fix the "written stencil is unmodified" check. +- turnip: Fix busy-waiting on syncobjs with OS_TIMEOUT_INFINITE. + +Erik Faye-Lund (4): + +- mesa/st: always use normalized coords for samplers +- mesa/st: always use normalized coords for samplers +- mesa/st: always use normalized coords for samplers +- docs: fixup broken rst syntax + +Feng Jiang (1): + +- virgl/vtest: fix memory overwrite problem in virgl_vtest_send_get_caps() + +Gert Wollny (2): + +- r600/sfn: Handle shifts on Cayman +- nir_lower_to_source_mods: Don't sneek in an abs modifier from parent + +James Park (1): + +- meson,amd: Remove Windows libelf wrap + +Jason Ekstrand (3): + +- iris: Use a larger alignment for buffer allocations +- iris: Fix more BO alignments +- iris: Handle resource offsets in buffer copies + +Jesse Natalie (1): + +- u_atomic: Add a helper for pointer compare-exchange + +Jonathan (2): + +- gallium/u_threaded: Prepare for buffer sharedness tracking +- gallium/u_threaded: buffer sharedness tracking + +Jonathan Gray (1): + +- iris: check i915 features after hw gen + +Lionel Landwerlin (12): + +- anv: clear descriptorsets if AllocateDescriptorSets fails +- anv: add missing wokraround for texture cache invalidate +- intel/nir/rt: fixup generate hit +- intel/nir/rt: spill/fill the entire ray query data +- intel/nir/rt: fix ray query proceed level +- intel/nir/rt: change scratch check validation +- intel/nir/rt: store ray query state in scratch +- intel/fs: fixup a64 messages +- intel/nir: fixup preserved metadata in rayquery lowering +- intel/nir: fix potential invalid function impl ptr usage +- intel/nir: disable assert on async stack id +- anv: remove HDC flush from invalidate bits + +Marcin Ślusarz (1): + +- intel/compiler: remove second shading rate lowering for mesh + +Marek Olšák (1): + +- radeonsi: don't flush asynchronously for fence_server_signal + +Martin Krastev (1): + +- meson: add svga gallium driver to 'auto' group on arm/aarch64 + +Mike Blumenkrantz (23): + +- zink: uncap ssbo size +- zink: fix atomic ssbo indexing with non-32bit values +- zink: split up get_clear_data() +- zink: make void clears more robust +- zink: fix/relax resolve geometry check +- zink: fix atomic ssbo fadd offsets +- st/bitmap: use normalized coords in samplers with lowered rects +- st/drawpixels: use normalized coords in samplers with lowered rects +- zink: handle culldistance xfb outputs like clipdistance +- lavapipe: set writemask for compute shader buffers +- lavapipe: propagate shader access info across libraries +- zink: free vertex_state_cache on shutdown +- zink: always unset var->data.explicit_xfb_buffer +- zink: avoid overflow when clamping bufferviews +- zink: always run optimize_nir after lower_64bit_vars +- zink: don't flatten 64bit arrays during rewrite +- mesa: set normalized_coords for bindless texture buffer samplers +- aux/trace: dump depth_clamp for rasterizer state +- zink: set layouts before possibly reordering image copies +- zink: add a draw barrier when unbinding a sampler-bound fb surface +- glthread: handle DeleteBuffers(n=-1) gracefully +- zink: fix/improve handling for multi-component bitfield ops +- pipe-loader: fix zink driinfo header path + +Pierre-Eric Pelloux-Prayer (3): + +- radeonsi/gfx11: don't set VERTS_PER_SUBGRP to 0 +- tc: do a GPU->CPU copy to initialize cpu_storage +- tc: don't use CPU storage for glBufferData + +Qiang Yu (1): + +- nir: fix nir_xfb_info buffer_to_stream length + +Rhys Perry (1): + +- aco: fix VMEMtoScalarWriteHazard s_waitcnt mitigation + +Samuel Pitoiset (1): + +- radv: make sure to initialize wd_switch_on_eop before checking its value + +SoroushIMG (1): + +- zink: fix nir_op_unpack_64_2x32 emission + +Tapani Pälli (1): + +- mesa/st: fix a set_sampler_views call parameter order + +Timur Kristóf (4): + +- aco/optimizer_postRA: Mark a register overwritten when predecessors disagree. +- aco/optimizer_postRA: Don't assume all operand registers were written by same instr. +- aco/optimizer_postRA: Fix logical control flow handling. +- radv: Use a fallback for marketing name when libdrm doesn't know it. + +Yiwei Zhang (1): + +- mesa/st: emit string marker based on PIPE_CAP_STRING_MARKER + +Yonggang Luo (3): + +- drm-shim: drop gnu99 override +- ci/docs: Stick to specific version of python packages +- util: Disable usage of __attribute__((__const__)) when the compiler is clang + +nyanmisaka (1): + +- gallium: Do not include offscreen winsys if glx is disabled diff --git a/lib/mesa/docs/relnotes/22.2.2.rst b/lib/mesa/docs/relnotes/22.2.2.rst new file mode 100644 index 000000000..3c91b53a0 --- /dev/null +++ b/lib/mesa/docs/relnotes/22.2.2.rst @@ -0,0 +1,145 @@ +Mesa 22.2.2 Release Notes / 2022-10-19 +====================================== + +Mesa 22.2.2 is a bug fix release which fixes bugs found since the 22.2.1 release. + +Mesa 22.2.2 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.2.2 implements the Vulkan 1.3 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + 2de11fb74fc5cc671b818e49fe203cea0cd1d8b69756e97cdb06a2f4e78948f9 mesa-22.2.2.tar.xz + + + +New features +------------ + +- None + + +Bug fixes +--------- + +- radv: Crash in dEQP-VK.ray_query.misc.dynamic_indexing +- glthread: radeonsi: offset textures in game starsector with glthread enabled +- Crashing on Windows VM +- Exanima renders with the wrong colors. +- nouveau: tegra124: GL_OUT_OF_MEMORY error + + +Changes +------- + +Alejandro Tafalla (1): + +- freedreno: Fix graphic glitches on a4xx and a5xx + +Alyssa Rosenzweig (4): + +- nir/lower_system_values: Fix cs_local_index_to_id with variable workgroups +- pan/mdg: Lower PIPE_COMPUTE_CAP_MAX_THREADS_PER_BLOCK on Midgard +- pan/mdg: Fix 16-bit alignment with spiller +- nir: Fix nir_fmax_abs_vec_comp + +Andri Yngvason (1): + +- gallium/vl: Add opaque rgb pixel formats + +Daniel Schürmann (1): + +- aco/spill: Fix spilling of Phi operands + +Danylo Piliaiev (1): + +- tu: Reset whether there is DS resolve for dynamic subpass + +Dave Airlie (1): + +- gallivm: handle llvm coroutines for llvm > 15 + +Diogo Ivo (1): + +- nouveau: treat DRM_FORMAT_INVALID as implicit modifier + +Dylan Baker (7): + +- docs: Add sha256 sum for 22.2.1 +- .pick_status.json: Update to 243aa6b2ec0c2626b1333ba666a6d6d60ede8505 +- .pick_status.json: Update to c4482a3c1a973975eb27ac284a18bebca24f7876 +- .pick_status.json: Update to 3eed5931edf6e5f45378b013ca21f98f17af2b34 +- .pick_status.json: Update to b02e9ef35a0446019cda9473e4c355c7cc4bb24d +- .pick_status.json: Mark 4c7a44413a07d3fb314f786e047bb7212c082a6c as denominated +- .pick_status.json: Mark dbd022f2ab43ff0a9ecc05c61123467e25f109de as backported + +Emma Anholt (1): + +- turnip: Don't use the dynamic color write enable during non-dynamic. + +Erik Faye-Lund (1): + +- gallium/u_threaded_context: remove stale comment + +Filip Gawin (1): + +- r300: don't use smooth line if not requested + +Gert Wollny (3): + +- r600/sfn: Always start a new CF after a KILL instruction +- r600/sfn: don't propagate registers into conditional test +- virgl: Report CONSTANT_BUFFER_SIZE according to GL_MAX_UNIFORM_BLOCK_SIZE + +Lionel Landwerlin (3): + +- vulkan/runtime: don't lookup the pipeline disk cache if disabled +- anv: initialization pipeline layout to 0s +- anv: add missing tracepoint + +Liviu Prodea (1): + +- clc/clover: Link clang statically when shared-llvm is disabled + +Mike Blumenkrantz (3): + +- zink: clamp line_stipple_factor to 1 if stipple is disabled +- zink: unset rp_changed after initializing renderpass attachments +- zink: disable fbfetch when flushing clears + +Ryan Houdek (1): + +- vulkan/wsi: Add dep_libudev to idep dependencies + +Sil Vilerino (2): + +- gallium/va: vaDeriveImage to check PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP +- d3d12: Implement cap PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP + +SoroushIMG (3): + +- zink: fix invalid Offset set for variables which do not need an offset +- zink: stop enabling minmax filtering when not supported +- zink: fix isNan mismatch between NIR and SPIR-V + +Timothy Arceri (5): + +- util/conf: enable init to zero workaround for Exanima +- util/radeonsi: enable zerovram workaround for Exanima +- radv: add radv_zero_vram workarounds for OpenGL games +- glthread: fix matrix stack depth tracking +- glthread: leave dlist dispatch in place for Begin/End + +Yonggang Luo (1): + +- util: Turn -DWINDOWS_NO_FUTEX to be pre_args diff --git a/lib/mesa/docs/relnotes/22.3.0.rst b/lib/mesa/docs/relnotes/22.3.0.rst new file mode 100644 index 000000000..46997e6f7 --- /dev/null +++ b/lib/mesa/docs/relnotes/22.3.0.rst @@ -0,0 +1,5723 @@ +Mesa 22.3.0 Release Notes / 2022-12-02 +====================================== + +Mesa 22.3.0 is a new development release. People who are concerned +with stability and reliability should stick with a previous release or +wait for Mesa 22.3.1. + +Mesa 22.3.0 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.3.0 implements the Vulkan 1.3 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + 644bf936584548c2b88762111ad58b4aa3e4688874200e5a4eb74e53ce301746 mesa-22.3.0.tar.xz + + +New features +------------ + +- GL_ARB_shader_clock on llvmpipe +- VK_KHR_shader_clock on lavapipe +- Mesa-DB, the new single file cache type +- VK_EXT_attachment_feedback_loop_layout on RADV, lavapipe +- VK_KHR_global_priority on RADV +- GL_KHR_blend_equation_advanced_coherent on zink +- VK_EXT_load_store_op_none on RADV +- VK_EXT_mutable_descriptor_type on RADV +- VK_EXT_shader_atomic_float on lvp +- VK_EXT_shader_atomic_float2 on lvp +- GL_NV_shader_atomic_float on llvmpipe +- VK_EXT_image_robustness on v3dv +- VK_EXT_extended_dynamic_state3 on lavapipe +- VK_EXT_extended_dynamic_state3 on RADV +- VK_EXT_pipeline_robustness on v3dv +- Mali T620 on panfrost +- Shader disk cache on Panfrost +- support for R8G8B8, B8G8R8, R16G16B16 and 64-bit vertex buffer formats on RADV +- initial GFX11/RDNA3 support on RADV +- various ray tracing optimizations on RADV +- extendedDynamicState2PatchControlPoints on RADV (VK_EXT_extended_dynamic_state2 feature) +- Radeon Raytracing Analyzer integration (using RADV_RRA_* environment variables) + + +Bug fixes +--------- + +- r600,regression: Wrong color on a Minecraft shaderpack on Cayman +- [iris/crocus] Native CS:GO OpenGL: fire produces black rendering artifacts +- VDPAU GL interop is broken, causing Black and White video +- panfrost: application crashes with 22.2.4/22.3.0-rc3 +- etnaviv: compiler assertion fail in glmark2 terrain +- anv: dEQP-VK.pipeline.monolithic.multisample.variable_rate.8_1_fb_1 failure +- DG2: Age of Empire IV incorrect rendering +- r600: Sky not rendered correctly with the NIR backend on Transport Fever 2 +- "intel/compiler: Signed integer range analysis for imul_32x16 generation" regresses several OpenCL tests +- aco: Missing 8-bit uadd_carry +- !16168 broke Xephyr -glamor fallback to swrast +- panfrost/rk3399: performance drop in emulationstation and in RetroArch menu [mesa >= 21.3.8, batocera.linux, bisected] +- d3d12/VA-API: [-wmissing-prototypes] with MSYS2 MinGW-W64 UCRT64 GCC +- clover: windows: library filename has \`-1` suffix and a \`lib` prefix when built with mingw +- freedreno: !19236 broke anitchamber trace +- radv, dxvk: Rendering errors in World of Tanks after "Switch to dynamic rendering only" +- gen9 gt3e/gt4e skus fail dEQP-VK.pipeline.multisample.sample_locations_ext.* +- Intel - GPU Hang when starting new game in Hellblade: Senua's Sacrifice +- v3d: Wrong colors (pink) in videos in Firefox (likely YUV->RGB shader issue) +- anv/zink: cts failures on ICL +- panfrost t860 glmark-es2 regression +- panfrost_drm.h build error +- radv: Artifacts in multiple games with yuzu (regression, bisected) +- Doom (1993) Ray-Traced has render glitches. +- radv: Flickering in Spider-Man Remastered (Regression) (Bisected) +- radv: Hitman 2 using Direct3D 12 has discolored squares on RDNA2 with DCC enabled +- panfrost/midgard - on Duckstation PSX emulator: segfault on GLES 3.0 and bad shader compilations on 3.3 +- nouveau: Updating to Mesa*22.2.0 makes videos have corrupted colors in Firefox +- [radeonsi] OpenGL rendering issue with Unreal Tournament 99 & OldUnreal 4.69 rc4 +- Replace the functionals in util/debug.h with util/u_debug.h +- gallium: nir_opt_access desynchronizes image access modifiers in NIR with Program->sh.ImageAccess +- [radv/navi21] No sprites being rendered in Quake 2 RTX +- Hacknet does not render on Panfrost +- !19259 broke some piglit arb_tessellation_shader tests with radeonsi if TCS passthrough shader is cached +- panfrost g52 issue +- [Intel] Doom Eternal crashes shortly after launching +- GravityMark segfault when using ray tracing ANV +- nir: Incorrect idiv lowering +- intel-clc fails with "Unsupported SPIR-V capability" and SEGFAULT. +- Navi 14 hangs running "rendercheck -t cacomposite -f a8r8g8b8" since !18170 +- glx=xlib can not be built under ubuntu 22.04 +- radv: incorrect query result for VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT +- crocus: EGL_ANDROID_native_fence_sync extension is missing +- freedreno/a6xx: spec@arb_tessellation_shader@execution@tess-instance-id fail +- r300: wrong primitive positions in CXBX-R emulator with nine and RADEON_DEBUG=notcl +- EGL re-initialization can fail silently +- Rusticl: fails to create program if source size is stated, but last char is a null terminator +- r300: register allocation sometimes generate invalid swizzles +- Mesa git Upgrade causes cursor detection issues in Wayland, issue was not in 22.3.0-git.20220920.8d66c45.fc36 +- Proposal: Remove rbug, graw, graw tests, xa, xvmc +- intrinsics.h:112:21: error: use of undeclared identifier 'intel_sub_group_shuffle' +- tu: broken since dynamic pcp +- radv: Crash in dEQP-VK.ray_query.misc.dynamic_indexing +- t860: mediapipe application failed +- glthread: radeonsi: offset textures in game starsector with glthread enabled +- Crashing on Windows VM +- zink: descriptor-handling regression +- hasvk: noperspective is broken on Haswell hardware (Intel(R) HD Graphics 4400) +- util/u_atomic.h can not be included in util/u_debug.h +- radv: dynamic patch control points broken +- Exanima renders with the wrong colors. +- Missing tile cache flushes causing corruptions +- turnip: compute shader hang in the Monster Hunter World +- anv_private.h:118:10: fatal error: anv_entrypoints.h: No such file or directory +- freedreno: mis-rendering on gardensofkadesh/homeworld +- piglit.spec.egl 1_4.egl-ext_egl_image_storage times out +- nouveau: tegra124: GL_OUT_OF_MEMORY error +- r600: Shadows incorrecly rendered on Counter-Strike: Global Offensive with the NIR backend +- r300: regalloc doesn't handle brk correctly +- Recent WSI commit breaks every Vulkan application on RDNA 2 GPUs +- 8775f08bf11 makes warcraft 3 crash with free(): double free detected in tcache 2 +- anv: probable ccs issue on icl +- iris: remap_tess_levels assertion failure in shader-db runs +- RADV: Mafia & Mafia III :Definitive Edition graphical issues with certain effects +- Cyberpunk 2077 ground reflectiveness issues using RADV +- [RADV] Grand Theft Auto V: Shadow rendering bug on Steam Deck +- rusticl: fails to build on non-Linux: cannot find value \`stderr` in this scope +- llvmpipe-piglit-rusticl fail at api/clenqueuefillimage +- Broken build with Clang-16 due to: aco_register_allocation.cpp:382:7: error: no matching function for call to 'printf' +- src/gallium/drivers/r600/sfn/sfn_instr.cpp:92:33: error: expected primary-expression before 'int' +- r600: Make NIR the default backend +- undefined reference to \`intel_ds_begin_xfb' +- data-race leads to crash in u_trace_context_init() +- radeonsi/radv: Discrepancy in handling wd_switch_on_eop +- Shadows rendering issue in Deathloop +- RADV: Regression with RPCS3 where nothing renders +- [RADV][bisected] commit 3714c89d0ebf0c541d8f7a7a807ce7c1bfe3cfea at least lead to GPU hang when I run the game "Quake II RTX" +- eglCreateContext doesn't generate EGL_BAD_MATCH when share_context is on a different EGLDisplay +- Intel - Homerun Clash misrendering issue +- r300: dEQP-GLES2.functional.shaders.indexing.tmp_array.float_dynamic_write_dynamic_loop_read_fragment regression +- turnip/a660: dEQP-VK.spirv_assembly.instruction.terminate_invocation.terminate.subgroup_ballot failure due to "movmsk" reordered with "kill" +- WSI reports 11 RGB bits is unsupported in X11, using zink/swrast +- [radeonsi] Graphical corruption in Valve Source-based games +- RADV: VK_EXT_load_store_op_none support? +- radv: extendedDynamicState2PatchControlPoints +- Vega 8 rendering GDM grey blank login screen after recent change +- r300: possibility of merging ADD and MUL into MAD when effectively only 2 sources are used +- MESA_DEBUG=incomplete_tex shows unnecessary validation during glClear +- anv: Performance drop with Vulkan on Wayland +- lima: crash during dEQP-EGL execution +- Regression from https://gitlab.freedesktop.org/mesa/mesa/-/commit/8fcb4aa0ebd7b9d0d8f80986fb817afea2fc4a87 +- radv: Enable instruction timing with RGP for RT +- Updates to 22.2 RCs cause artifacts on nouveau and blank screen on VirtIO +- nouveau: corruption on NVAC after switching to the NIR backend +- egl: Remove big-display-lock +- DG2: vulkancts tests spin forever +- [regression][bisected] v3dv: dEQP-VK.api.object_management.alloc_callback_fail_multiple.graphics_pipeline fails +- v3dv: evaluating reducing current value of DEFAULT_MAX_BO_CACHE_SIZE +- venus-lavapipe flakes +- amd: LPDDR4 and LPDDR5 hits an assertion failure in ac_memory_ops_per_clock +- zink: GL_TEXTURE_LOD_BIAS vs mipLodBias, and GTF-GL46.gtf21.GL3Tests.texture_lod_bias.texture_lod_bias_all failure on Turnip +- ci: radeonsi-raven-skqp also tests RADV but only runs on radeonsi changes +- Gallium OpenGL megadriver fails to build if LLVM is enabled but not required by any component built +- panfrost: crash in dEQP test +- bitpack_helpers.h:34:10: fatal error: valgrind.h: No such file or directory +- [gen9_iris] piglit causes crash on 4GB-memory machine when running spec/!opengl 1.1/max-texture-size +- Abort due to "Unknown intrinsic: intrinsic store_deref" (firefox, radeonsi) +- radv: missing support for some 3-component vertex formats in RDNA2 +- radv: radv_translate_buffer_dataformat overhead +- nir: Replacing GC with manual memory management +- turnip: dEQP-VK.subgroups.vote.graphics.subgroupallequal_*int regression on a635 +- regression from !17949 +- r600/sfn: xonotic shows a big performance regression when using the NIR backend with sb disabled +- r600/sfn: register allocation fails for a number of piglits and deqp tests +- DG2 corruption in gnome desktop from F36 +- d3d12: Delete custom OpenGLOn12 target, add option to override libgallium_wgl.dll name +- radeonsi: error allocating sparse buffers.. bug? +- RADV: vulkaninfo fails to report extended VkShaderStageFlagBits for VK_KHR_ray_tracing_pipeline +- [radeonsi] Transform feedback with array outputs fails +- [BISECTED/radeonsi] Flickering triangles with wayfire cube +- re-evaluate the linker for CI +- [RADV] (Vega/Navi) Halo: The Master Chief Collection: Texture issue on Halo 3 and Reach +- ../src/util/u_queue.c:218: util_queue_fence_destroy: Assertion \`fence->signalled' failed for iris when disabled UTIL_FUTEX_SUPPORTED +- Freedreno turnip mesa cause emulation close after 15-20min gameplay +- glsl: textureGatherOffset param broken +- turnip: broken after switching to userspace vma +- turnip: gfxbench aztec ruins crashes on zink +- !16863 disabled all meson tests in CI +- intel unittests failed a lot because recents commits Job Failed #27063524 +- Microsoft CLC runtime crash (at least on 32-bit) since !12510 +- Job Failed #26465985 +- v3dv: android: VTS big regression after 316728a55bc8fe398be1ac2796a22f8c91fb886c +- radeonsi: driver crashes with kwin-effects-cube +- [Turnip] Final Fantasy X: Incorrect hair rendering +- iris: Suboptimal usage of HiZ + CCS for mipmapped buffers +- Document state of Dozen Vulkan support in features.txt for MesaMatrix site.. +- turnip: gfxbench car chase crash on zink +- [r300] - osu! - Shader optimizations cause black and glitchy rendering +- [Bisected] Microsoft CLC: build failure with MSVC and Dshared-llvm=disabled with AMDGPU target available +- Android.mk: meson enables flags based on the libraries installed on the host system +- anv: invalid returns from vkCreateDescriptorPool +- [Bisected][RadeonSI] Blender crashes on startup +- lavapipe CI times out +- turnip: depth/stencil regressions +- radv regression: Hitman 2 rendering issue +- llvmpipe: assert in KHR-Single-GL45.enhanced_layouts.xfb_all_stages +- android: vulkan: Disable blocked KHR_performance_query extension (v3dv, turnip, anv) +- Poor performance/utilization with Splitgate +- [Bisected][RADV] Rendering issues and performance drop with GFX10 on RPCS3 +- freedreno_gmem.c crashes Firefox Nightly in Khronos +- freedreno / mesa 22.1.3 crashes Firefox Nightly +- !17693 broke piglit oes_egl_image_external_essl3 +- VK_PRESENT_MODE_IMMEDIATE_KHR rendering artifacts +- dzn: Build failure when it's the only driver built - 22.2-devel regression +- d3d12: Unresolved external symbols from Version.lib +- a618 vk_full timing out at 60 minutes instead of 120 + + +Changes +------- + +Adam Jackson (25): + +- egl: Remove some can't-happen supported API checks +- egl: Move an error check earlier in EGL_BUFFER_AGE_EXT query +- wsi/x11: Defer clearing image busy flag for non-shm upload +- egl/kopper: Don't add EGL_SWAP_BEHAVIOR_PRESERVED_BIT configs +- glx: Use XSaveContext, delete glxhash.c +- Revert "glx: Use XSaveContext, delete glxhash.c" +- wsi/x11: Fix the is-visual-supported check +- nouveau: const cleanup +- egl/dri2: Respect the arguments to dri2_set_blob_cache_funcs +- egl/dri2: Fix a typo in a comment +- egl/dri2: Fix some thinkos in old context release +- glx/dri*: Unify glx_context subclassing +- glx/dri: Avoid a weird indirection in driFetchDrawable +- glx: Remove some excess work from the GLX_FBCONFIG_ID fallback +- egl: Fix a bunch of maybe-uninitialized warnings +- egl: Promote _eglLock/UnlockDisplay to internal API +- egl: Untangle some spaghetti around _eglLookupDisplay +- egl: Factor out the eglTerminate write-lock pattern +- egl: Rewrite eglSetBlobCacheFuncsANDROID to use the standard macros +- mesa/fbo: Don't force both read/draw bindings to zero +- egl: Formatting fix +- egl: Factor some common terminate cleanup up to common code +- egl/dri2: Fix a weird conditional in dri2_make_current +- egl: Remove a bogus restriction from eglMakeCurrent +- rusticl: meson devenv support + +Adrián Larumbe (5): + +- panfrost: Handle Job VA cycles when decoding a dump file +- panfrost: Remove documentation reference to deprecated parameter +- panfrost: Add compiler args option for building usermode tools +- panfrost: Sync panfrost_drm.h from drm-misc-next +- panfrost: Add userspace crash dump decoder and analyser + +Alejandro Piñeiro (11): + +- v3dv/bo: reduce DEFAULT_MAX_BO_CACHE_SIZE to 64 +- v3dv/pipeline: use a array instead of individual pointer to stages +- v3dv/pipeline: keep qpu_insts around if we expect them to be used later +- v3d: replace all TODO around for FIXME +- broadcom/compiler: don't call nir_opt_load_store_vectorize on all v3d_optimize_nir calls +- broadcom/compiler: update how we compute return_words_of_texture_data on non-ssa +- broadcom/compiler: add more lowerings/optimizations on v3d_optimize_nir +- v3dv/pipeline: use v3d_optimize_nir +- broadcom/compiler: pass a strategy struct to vir_compile_init +- broadcom/compiler: call nir_opt_gcm with a custom strategy +- v3dv/bo: reset bo and then call gem close + +Alejandro Tafalla (1): + +- freedreno: Fix graphic glitches on a4xx and a5xx + +Alessandro Astone (3): + +- Android.mk: Disable lmsensors +- Android.mk: Filter out --build-id=md5 linker flag +- Android.mk: Provide a custom entry name to ensure meson checks succeed + +Alex Brachet (1): + +- nir: Fix qsort comparator function + +Alyssa Rosenzweig (414): + +- docs/asahi: Document varying interpolation +- agx: Fix packing of samplers in texture instrs +- agx: Fix ld_var cf packing +- asahi: Plumb through lower_clip_fs +- asahi: Extend counts in BIND packets +- asahi: Allow large uniform records +- asahi: Dump all textures&samplers +- asahi: Fix using multiple textures/samplers +- asahi: Use a single bind texture/sampler per pipeline +- asahi: Split vertex/fragment pipeline binds +- asahi: Correct bind fragment pipeline size +- asahi: Encode known bits of Linkage in the XML +- asahi: Fix varying XML +- asahi: Decode Interpolation packets +- asahi,agx: Rewrite varying linking +- agx: Add AGX_MESA_DEBUG=noopt option +- agx: Model perspective coefficient reg in the IR +- agx: Rename varying load instructions +- agx: Use split instead of extract for ldcf +- agx: Implement noperspective interpolation +- agx: Lower more ALU operations +- agx: Align 64-bit register pairs +- agx: Assert that registers are naturally aligned +- agx: Extract umul_high implementation +- agx: Implement nir_op_umul_high +- agx: Stop using broken idiv lowering +- agx: Handle type-changing splits +- agx: Remove p_extract +- agx: Only emit the used components of gl_FragCoord +- asahi: Don't advertise ARB_clip_control yet +- asahi: Identify depth clip mode bit +- pan/decode: Don't pass around memory handles +- pan/decode: Remove pandecode_msg +- pan/decode: Stop passing job index around +- pan/decode: Stop passing suffixes around +- pan/decode: Simplify pandecode_fbd +- pan/decode: Reorder MFBD decoding +- pan/decode: Unify SFBD/MFBD decoding +- pan/decode: Defeature disassembler stats +- pan/decode: Centrally declare pandecode entrypoints +- pan/decode: Clean up _bifrost_ decode routines +- pan/mdg: Remove disassembler stats +- panfrost: Don't segfault on unknown models +- pan/bi: Implement some extracts and inserts +- pan/bi: Fuse [US][8|16]_TO_F32 ops +- pan/bi: Test int8/16 -> float32 opts +- pan/bi: Optimize bitwise arithmetic of booleans +- pan/bi: Optimize pattern from nir_lower_idiv +- pan/bi: Don't use the broken idiv lowering +- pan/bi: Use nir_opt_idiv_const +- asahi: Fix warning building for macOS 12.0 +- pan/bi: Don't reorder image loads across stores +- pan/bi: Don't allow ATEST to take a temporary +- pan/bi: Handle info.fs.untyped_color_outputs on Valhall +- panfrost: Handle untyped_color_outputs on Bifrost +- panfrost: Set blit output variable types correctly +- panfrost: Don't saturate in Bifrost blend shaders +- panfrost: Promote blend shader outputs 8->16-bit +- panfrost: Avoid blend shader when not blending +- panfrost: Don't compile empty blend shaders +- panfrost: Invert no_colour to enabled +- panfrost: Simplify blitter blend shader creation +- panfrost: Include mask in replace blend shader name +- panfrost: Assert that blend shaders are nontrivial +- r600/sfn: Don't use broken idiv lowering +- gallium: Inline away util_make_fragment_tex_shader_writemask +- gallium: Inline away util_make_fragment_tex_shader interp_mode +- gallium: Remove util_make_fragment_tex_shader_xrbias +- pan/mdg: Replicate swizzles for scalar sources +- pan/mdg: Reexpress umul_high packing +- pan/mdg: Use correct idiv lowering +- pan/mdg: Always write return address to r1.w +- pan/mdg: Print 3 sources for CSEL +- panfrost/ci: Update G72 xflakes list +- gallium: Add u_default_set_debug_callback +- v3d: Use u_default_set_debug_callback +- vc4: Use u_default_set_debug_callback +- softpipe: Use u_default_set_debug_callback +- lima: Use u_default_set_debug_callback +- etnaviv: Use u_default_set_debug_callback +- r300: Use u_default_set_debug_callback +- panfrost: Route shader-db to debug, not stderr +- pan/bi: Implement general 8-bit vector construction +- pan/va: Handle 8-bit lane when lowering constants +- pan/bi: Implement nir_op_vec8 and nir_op_vec16 +- pan/bi: Assert that vectors are sufficiently small +- docs/asahi: Document image layouts +- asahi: Identify compressed render target fields +- asahi: Rename 'tiled 64x64' to 'twiddled' +- asahi: Add XML for multisampled textures +- asahi: Fix bind sizes +- asahi: Fix ASTC enums +- ail: Introduce image layout module +- ail: Rewrite tiled memcpy for correctness +- asahi: Handle 2-channel sRGB textures +- asahi: Fix is_format_supported returns +- asahi: Advertise ETC1 +- asahi: Advertise ASTC formats +- asahi: Rename our fake twiddled DRM modifier +- ail: Add unit tests for miptree layouts +- pan/bi: Fix dual texturing with uniforms +- pan/bi: Fix out-of-bounds write in va_lower_split_64bit +- pan/bi: Consider all dests in helper_block_update +- pan/bi: Model [IF]CMP_{OR,AND,MULTI} ops +- pan/va: Split out compare instructions +- pan/bi: Set I->nr_dests, I->nr_srcs +- pan/bi: Model 3rd source for ATEST +- pan/bi: Consider nr_srcs/dests in CSE +- pan/bi: Use nr_{srcs,dests} in bi_foreach_{src,dest} +- pan/bi: Add validation for nr_srcs/nr_dests +- pan/bi: Remove trivial dest[0] = bi_null() +- pan/bi: Clean up destination printing +- pan/bi: Improve register printing +- pan/bi: Register allocate BLEND dest on Valhall +- pan/bi: Don't remove dests in DCE +- pan/bi: Use bi_emit_split_i32 for trimming vecs +- pan/bi: Introduce TEXC_DUAL psuedoinstruction +- pan/bi: Assert destinations are non-NULL +- pan/bi: Assume destinations are non-NULL +- pan/bi: Don't read nonexistant destinations +- pan/bi: Don't read nonexistant sources +- pan/bi: Add variable dest/src support to builder +- pan/bi: Use variable src/dest for collect/split +- pan/bi: Use safe helpers to remove srcs/dests +- pan/bi: Rebuild some instructions when lowering +- pan/bi: Rebuild DISCARD when optimizing +- pan/va: Use builder in va_lower_isel +- pan/bi: Use builder for MUX -> CSEL opt +- pan/bi: Use builder for IADD -> IADDC opt +- pan/bi: Use builder for atomic lowering in scheduler +- pan/bi: Reduce memory of scheduler access array +- pan/bi: Dynamically allocate source/dests +- pan/bi: Don't write registers in optimizer tests +- pan/bi: Add bi_after_block_logical helper +- pan/bi: Lift bi_predecessor_index helper +- pan/bi: Add phi nodes +- pan/bi: Validate phi ordering +- pan/bi: Don't lower FAU for phis +- pan/bi: Remove bogus assert lowering branches +- pan/bi: Don't optimize if without else +- pan/bi: Don't reorder phis in pre-RA scheduling +- pan/bi: Validate before&after pre-RA sched +- pan/bi: Inline DCE into bi_lower_vector +- pan/bi: Translate phis from NIR +- pan/bi: Add SSA-based liveness pass +- pan/bi: Add brainless out-of-SSA pass +- pan/bi: Reduce some moves when going out-of-SSA +- pan/bi: Preserve SSA form from NIR +- pan/bi: Clean up after converting to SSA +- pan/bi: Clear reg in squeeze_index +- pan/bi: Don't use bi_temp_reg +- pan/bi: Assume SSA in copyprop +- pan/bi: Assume SSA in mod prop +- pan/bi: Assume SSA in CSE +- pan/bi: Assume non-null sources in constant folding +- pan/bi: Assume SSA in minor passes +- pan/bi: Assume SSA when scheduling for pressure +- pan/bi: Assume SSA when translating NIR +- pan/bi: Assume SSA for helper invocation analysis +- pan/bi: Move non-SSA liveness into RA +- pan/bi: Remove NIR registers from the IR +- pan/bi: Get rid of bi_get_node +- pan/bi: Inline node_to_index +- pan/bi: Remove bi_max_temp +- pan/bi: Simplify bi_get_index prototype +- pan/bi: Add and use bi_foreach_ssa_src macro +- pan/bi: Strengenth assert in the validator +- pan/bi: Remove assert(bi_is_ssa(dest)) +- pan/bi: Add and use bi_replace_src helper +- pan/bi: Add and use bi_num_successors helper +- agx: Fix tib access in internal shaders +- agx: Fix float copyprop of neg(neg) case +- agx: Add more unit tests for float copyprop +- agx: Don't hardcode uniform enums +- agx: Defeature indirect vbufs +- agx: Plumb through nir_op_txf +- asahi: Fix Indexed Draw command +- asahi: Split up unk 2b +- asahi: Note some more unknowns in the XML +- asahi: Make ctx->zs a pointer +- asahi: Export agx_blitter_save +- asahi: Handle empty fragment shaders +- asahi: Use util_blitter_clear +- asahi: Trim garbage at end of set shader +- asahi: Relax assert in decoder +- asahi: Label batch flush reasons +- asahi: Fix depth for cube maps +- asahi: Fix nonmipmapped array textures +- asahi: Fix "stride" for tiled textures +- asahi: Identify and use first level field of texture +- asahi: Correct SET_SHADER_EXTENDED disambig bit +- asahi: Add 1D and 1D Array enums +- asahi: Handle out-of-bounds clear_stencil +- asahi: Fix depth/stencil buffers +- asahi: Identify stencil test enable +- asahi: Split RASTERIZER into constituent words +- asahi: Identify object type field via PowerVR +- asahi: Don't use lower_wpos_pntc +- asahi: Match PPP data structures with PowerVR +- asahi: Hoist constant PPP state to start of batch +- asahi: Dirty track everything +- asahi: Make BO list growable +- asahi: Annotate VDM/CDM commands as per PVR +- asahi: Express VDM commands according to PowerVR +- asahi: Handle Stream Link VDM commands +- asahi: Allocate new cmdbufs if out of space +- docs/envvars: Document Asahi variables +- pan/decode: Fix job cycle detection +- nir/lower_blend: Avoid emitting unnecessary fsats +- panfrost: Respect buffer offset for OpenCL +- panfrost: Honour cso->req_local_mem +- nir: Add nir_intrinsic_texture_base_agx sysval +- agx,asahi: Implement nir_intrinsic_load_texture_base_agx +- agx: Make p_combine take a dynamic src count +- agx: Implement texture offsets and comparators +- agx: Implement txd +- agx: Implement load_global(_constant) +- agx: Lower txs to a descriptor crawl +- agx: Don't use nir_find_variable_with_driver_location +- asahi: Don't crash on <4 channel render targets +- asahi: Handle blending with MRT +- u_transfer_helper: Handle Z24X8 for drivers that don't use the interleaved transfer_map +- u_transfer_helper: Pack Z24S8 to Z24-in-Z32F and S8 +- asahi: Assert cache line alignment on Z/S buffers +- asahi: Identify ZLS Control word from PowerVR +- asahi: Identify CDM block types +- asahi: Decode CDM commands separate from VDM +- asahi: Shuffle IOGPU structs +- asahi: Identify IOGPU compute header +- asahi: Decode IOGPU compute header +- asahi: Assert that u_transfer_helper is well-behaved +- asahi: Use the internal format internally +- asahi: Identify spill buffer histogram +- asahi: Simplify IOGPU attachment packing +- asahi: Identify shared memory fields +- asahi: Consolidate magic numbers for USC controls +- asahi/genxml: Overflow up to words when packing +- asahi: Overhaul USC control packing +- asahi: Identify pixel stride +- asahi: Identify shared memory layouts +- panfrost: Evict the BO cache when allocation fails +- rusticl: Build Panfrost +- panfrost: Adapt emit_shared_memory for indirect dispatch +- agx: Convert and clamp array indices in NIR +- panfrost: Default pipe->clear_texture impl +- panfrost: Allow compiling MESA_SHADER_KERNEL +- panfrost: Upload default sampler for txf +- panfrost: Bump PIPE_CAP_MAX_TEXTURE_ARRAY_LAYERS +- panfrost: Advertise PIPE_CAP_INT64 +- panfrost: Honour flush-to-zero controls on Valhall +- panfrost: Assume launch_grid parameters always change +- pan/va: Fix missing swizzle on CLZ.v2u16 +- pan/bi: Unit test swizzle lowering +- pan/bi: Lower some 8-bit swizzles +- pan/bi: Test some 8-bit swizzle lowering +- pan/bi: Lower swizzles for 8-bit shifts +- pan/bi: Strip negate when lowering swizzles +- pan/va: Pack IADD.sat bit +- pan/va: Fix v4s8 form of R2 opcodes +- pan/bi: Handle uhadd, urhadd opcodes +- pan/va: Pack .rhadd bit +- pan/bi: Add HADD.v4s8.rhadd packing test cases +- pan/va: Add 8-bit integer max assembler case +- pan/bi: Lower 8-bit min/max to bcsel+comparison +- pan/bi: Lower f2i8, f2u8 +- pan/bi: Remove bogus assert for pack_32_2x16 +- pan/bi: Allow selecting from an 8-bit vec8 +- pan/bi: Handle swizzles in unpack_64_2x32_split_{x,y} +- pan/bi: Lower <32-bit bit_count +- pan/bi: Lower fisnormal +- pan/bi: Scalarize phis before the opt loop +- pan/bi: Call nir_lower_64bit_phis +- pan/bi: Use .auto for image stores +- pan/bi: Fix 1D array indexing on Valhall +- pan/bi: Stub out scoped_barrier +- pan/bi: Implement unpack_64_2x32 +- agx: Fix bfi_mask packing +- agx: Emit fewer combines for intrinsics +- agx: Use 16-bit immediates +- agx: Validate immediates +- agx: Test fsat+f2f16 together +- agx: Ensure that the optimizer sees legitimate SSA +- agx: Pass mask into ld/st_tile instructions +- agx: Account for mask when writing registers +- agx: Remove load_kernel_input path +- panfrost: Remove load_kernel_input path +- nir/lower_system_values: Fix cs_local_index_to_id with variable workgroups +- asahi: Eliminate "Pixel Format" type from GenXML +- asahi/genxml: Defeature uint/float +- panfrost,asahi: Remove unused prepare macro +- panfrost,asahi: Remove unused function +- asahi,panfrost: Remove exact attribute +- asahi: Use a header more like Intel's GenXML +- asahi: Remove no-direct-packing +- agx: Add iterators for phi/non-phi instructions +- agx: Set PIPE_SHADER_CAP_INDIRECT_CONST_ADDR +- agx: Emit smaller combines for nir_op_vec2/3 +- agx: Don't prefix pseudo-ops +- agx: Improve printing of immediate sources +- agx: Switch to dynamic allocation of srcs/dests +- agx: Dynamically size split instruction +- agx: Rename "combine" to "collect" +- agx: Emit trap at pack-time, not during isel +- agx: Print instructions as "dest = src" +- agx: Print agx_dim appropriately +- agx: Preload vertex/instance ID only at start +- agx: Write to r0l with a "nesting" instruction +- agx: Introduce ra_ctx data structure +- agx: Pass in max regs as a paramter to RA +- agx: Reserve live-in regs at the start of block +- panfrost/ci: Disable t720 jobs +- nir: Be smarter fusing ffma +- mesa,gallium: Revert "Make point coord origin a CAP" +- panfrost/ci: Remove stale fail +- pan/mdg: Lower PIPE_COMPUTE_CAP_MAX_THREADS_PER_BLOCK on Midgard +- pan/mdg: Limit work registers for large workgroups +- pan/mdg: Fix 16-bit alignment with spiller +- nir: Fix nir_fmax_abs_vec_comp +- nir: Usher nir_normalize_cubemap_coords into 2022 +- mesa: Fix multiple matrix pops in a row +- nir/lower_io: Set interpolated_input dest_type +- u_transfer_helper: Handle Z24S8 with z24_in_z32f but no interleaving +- asahi: Lower point sprites in driver +- gallium: Default to PIPE_CAP_MIXED_COLORBUFFER_FORMATS +- panfrost: Remove NIR_SERIALIZED support +- asahi: Hold a reference to BOs in a batch +- nir/opt_preamble: Move load_texture_base_agx +- agx: Remove command line compiler +- asahi: Remove AGX_FAKE_DEVICE environment variable +- asahi: Precompile for shader-db +- asahi: Route shader-db stats to debug callback +- asahi: Identify "Uniform high" USC word +- asahi: Identify more shader-related fields +- asahi: Remove "padding" field +- asahi: Make agx_varyings a union +- asahi: Use USC_UNIFORM_HIGH +- asahi: Rename LOD clamps tests to fit other packing +- asahi: Don't use unnecessary test fixture +- asahi: Add group tests +- asahi: Support preamble shaders +- agx: Be less sloppy about high uniforms +- agx: Check that we don't push too much +- agx: Make push_count public +- agx: Handle multiple functions +- agx: Align preamble shaders +- agx: Add uniform_store instruction +- agx: Allow larger indices for ld/st +- agx: Strengthen assert for packing ld/st instructions +- agx: Implement scalar load/store_preamble +- agx: Handle uniforms passed to COLLECT +- agx: Restrict copyprop of uniforms +- agx: Handle 64-bit moves +- agx: Avoid reading high uniforms from device_load +- agx: Report GPRs to the driver +- asahi: Set uniform counts accurately +- asahi: Set GPR count accurately +- agx: Add agx_nir_opt_preamble pass +- agx: Use agx_nir_opt_preamble +- panfrost: Don't use lower_wpos_pntc on Midgard +- panfrost: Use proper formats for pntc varying +- etnaviv: Use correct idiv lowering +- nir/lower_idiv: Remove imprecise_32bit_lowering +- nir/lower_idiv: Inline convert_instr_precise +- panfrost: Zero polygon list for fragment-only +- panfrost: Don't allow VS side effects on midgard +- pan/bi: Clean up sysval handling a bit +- pan/mdg: Fix 64-bit address arithmetic +- panfrost: Use compute-based XFB on Midgard +- panfrost: Avoid a XFB special case +- panfrost: Lower MAX_BLOCK_SIZE on Midgard +- pan/mdg: Don't schedule across memory barrier +- panfrost,asahi: Support ARB_buffer_storage +- gallium: Stub support for Asahi + DRM +- gallium: Only use Asahi's software path on macOS +- panfrost/ci: Disable trace-based testing +- panfrost: Enable rendering to 16-bit and 32-bit +- panfrost: Add lots of perf_debug annotations +- asahi: Identify counts for compute kernels +- softpipe: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT +- i915g: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT +- etnaviv: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT +- gallium: Default to PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT +- nir/opt_algebraic: Fuse c - a * b to FMA +- pan/bi: Don't lower outputs for compute +- panfrost: Don't use nir_variable in the compilers +- pan/mdg: Disassemble the .a32 bit +- pan/mdg: Don't pair ST_VARY.a32 with other instrs +- pan/mdg: Use .u32 for flat shading +- pan/mdg: Use bifrost_nir_lower_store_component +- panfrost: Don't allocate space for empty varyings +- panfrost: Don't use nir_variable to link varyings +- panfrost: Delete set_global_resources +- panfrost: Move small compute functions to pan_context.c +- panfrost: Remove bogus assert +- panfrost: Consolidate all shader compiling code +- panfrost: Merge pan_assemble.c into pan_shader.c +- panfrost: Remove unused req_input_mem copy +- panfrost: Rename structs to panfrost_(un)compiled_shader +- panfrost: Remove uncompiled_shader->active_variant +- panfrost: Use u_dynarray for variants +- panfrost: Make fixed_varying_mask a fragment-only key +- panfrost: Precompile transform feedback program +- panfrost: Precompile shaders +- panfrost: Implement a disk cache +- docs/new_features: Add Panfrost disk cache +- panfrost: Don't copy resources if replaced +- panfrost: Replace resource shadowing flush +- panfrost: Fix build with Perfetto (again) +- panfrost: Copy resources when necessary +- panfrost: Require 64-byte alignment on imports +- panfrost: Fix reference counting with batch->resources +- panfrost: Use PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY +- pan/mdg: Fix out-of-order execution +- panfrost: Revert "Require 64-byte alignment on imports" + +Andri Yngvason (1): + +- gallium/vl: Add opaque rgb pixel formats + +Anton Bambura (1): + +- panfrost: Enable Mali-T620 + +António Monteiro (2): + +- freedreno: get timestamp from os_time_get_nano instead of os_time_get +- gallium/util: remove network class + +Axel Davy (6): + +- frontend/nine: Skip invalid swvp calls +- frontend/nine: Fix buffer tracking out of bounds +- frontend/nine: Fix ATOC handling +- frontend/nine: Fix cso restore bug +- frontend/nine: Fix shader multi-use crash +- frontend/nine: Fix ff position_t fallback when w = 0 + +Bas Nieuwenhuizen (56): + +- vulkan/wsi: Take max extent into consideration for modifier selection. +- radv: Remove redundant radv_QueuePresentKHR. +- radv: Add binding code for 3d sparse images. +- radv: Add 3d tile shapes for sparse binding. +- radv: Expose 3d sparse images. +- amd/common: Don't rely on DCN support checks with modifiers. +- amd/common: Disable DCC retile modifiers on RDNA1 +- radv: Don't flatten bottom AS exit if statement. +- radv: Store top of stack in a register. +- Revert "radv/rt: use derefs for the traversal stack" +- radv: Use constant for ray traversal exit condition. +- radv: Use nested ifs for pushing child nodes in traversal loop. +- radv: Add scratch stack to reduce LDS stack in RT traversal. +- ac/surface: Fix some warnings. +- radv: Hardcode root node id. +- radv: Translate the BVH copy shader to glsl from nir_builder. +- radv: Remove aabb bounds from instance nodes. +- radv: Use proper matrices for instance nodes. +- radv: Use GLSL matrices for instance transforms in BVH. +- radv: Use deterministic order for dumping acceleration stuctures. +- radv: Properly initialize all memory in RRA dumps. +- radv: Do an early check of distance in triangle culling. +- vulkan/wsi/x11: Fix double free of modifiers. +- radv: Clean up unused fields in BVH IR. +- radv: Add a field for the offset of the bvh in the blas. +- radv: Simplify buffer copy address generation. +- radv: Start the BVH after the header. +- radv: Generate parent links in BVH. +- radv: Only emit parents from parents that actually end up in the tree. +- radv: Split global & local bvh node variable. +- radv: Add traversal backtracking with a short stack. +- radv: Properly annotate all the invalid node usage. +- radv: Implement pipeline-wide skiptriangles/skipaabbs. +- radv: Consider inactive internal nodes never in the tree. +- radv: Constify entry_size. +- radv: Use a VkPipelineCache handle for meta shaders. +- radv: Rework modification detection of the meta cache. +- radv: Mark dEQP-VK.ray_query.misc.dynamic_indexing as crashing in CI. +- aco: Pre-split result of bvh64_intersect_ray_amd. +- radv: Improve gfx11 clear colors for DCC. +- radv: Use correct primgrp size for gfx11. +- radv: Add BVH IR header. +- radv: Make the number of internal nodes be written on the GPU. +- radv: Add PLOC shader +- radv: Use PLOC for BVH building +- radv: Speculatively tune RT pipelines for GFX11. +- radv: Use correct types for loading the rings with LLVM. +- radv: Handle GSVS ring intrinsic correctly with LLVM. +- radv: Handle attribute ring intrinsic correctly with LLVM. +- radv: Fix compute scratch buffer emission. +- radv: Make the compute scratch waves per SE as well. +- radv: Fix custom build id with C90 compilation. +- radv: Use correct init order for ETC2 image views +- radv: Fix sampler types in ETC2 decode. +- vulkan: Remove asserts that weren't valid for RADV ETC2 emulation. +- radv: reserve space for the scissor in vkCmdBeginRendering. + +Benjamin Tissoires (1): + +- CI: convert to use the new S3 server instead of the legacy minio + +Boris Brezillon (1): + +- ci/panvk: Skip dEQP-VK.api.object_management.max_concurrent.query_pool + +Brian Paul (113): + +- st_pbo/compute: fix memset() warning +- lavapipe: s/u_foreach_bit/u_foreach_bit64/ in handle_pipeline_access() +- llvmpipe: further bump LP_MAX_TGSI_SHADER_IMAGES to 64 +- llvmpipe: always pass non-zero writemask to assign_reg() +- util: allow GALLIUM_LOG_FILE=stdout +- gallivm: move lp_build_nir_aos_context declaration, etc +- gallivm: change texture/sampler_index params to unsigned +- llvmpipe: asst. formatting, clean-ups in lp_state_fs.c +- gallivm: asst. clean-ups in lp_bld_sample_soa.c +- gallivm: fix nir AOS swizzling issues +- lavapipe: remove continue statements in emit_state() to be more consistent +- lavapipe: zero-init sampler objects +- cso: asst. clean-ups in cso_hash.[ch] +- cso: use util_bitcount +- cso: asst. clean-ups in cso_cache.[ch] +- cso: asst. clean-ups in cso_context.[ch] +- draw: asst. clean-ups in draw_context.[ch] +- draw: asst. clean-ups in draw_vs_llvm.c +- draw: asst. clean-ups in draw_vs_variant.c +- draw: asst. clean-ups in draw_vs.[ch] +- draw: remove goto in llvm_pipeline_generic() +- draw: remove unused draw_has_llvm() +- draw: asst. clean-ups in draw_pipe.h +- draw: asst. clean-ups in draw_pt_fetch.c +- draw: asst. clean-ups in draw_pt_post_vs.c +- draw: asst. clean-ups in draw_pipe_cull.c +- draw: asst. clean-ups in draw_pipe_clip.c +- draw: asst. clean-ups in draw_pipe_flatshade.c +- draw: asst. clean-ups in draw_pipe_offset.c +- draw: asst. clean-ups in draw_pipe_pstipple.c +- draw: asst. clean-ups in draw_pipe_twoside.c +- draw: asst. clean-ups in draw_pipe_unfilled.c +- draw: asst. clean-ups in draw_pipe_user_cull.c +- draw: asst. clean-ups in draw_pipe_util.c +- draw: asst. clean-ups in draw_pipe_wide_line.c +- draw: asst. clean-ups in draw_pipe_wide_point.c +- draw: asst. clean-ups in draw_vertex.[ch] +- draw: asst. clean-ups in draw_pt_so_emit.c +- draw: asst. clean-ups in draw_pipe_aapoint.c +- draw: asst. clean-ups in draw_llvm.c +- draw: asst. clean-ups in draw_cliptest_tmp.h +- draw: asst. clean-ups in draw_gs.[ch] +- draw: asst. clean-ups in draw_prim_assembler.[ch] +- draw: asst. clean-ups in draw_pipe.c +- draw: asst. clean-ups in draw_pt_vsplit_tmp.h +- draw: asst. clean-ups in draw_split_tmp.h +- draw: asst. clean-ups in draw_pt.c +- draw: clean-up count assignment in draw_pt_arrays() +- draw: asst. clean-ups in draw_private.h +- draw: asst. clean-ups in draw_pt_fetch_shade_pipeline_llvm.c +- draw: replace double conditional expression with if/else +- draw: make draw_geometry_shader_run() return void +- draw: fix some const-correctness in draw_vbo() +- draw: move vs_slot var in tgsi/llvm_fetch_gs_input() +- draw: s/start_or_maxelt/start/ in llvm VS code +- draw: fix vertex id offset bug +- draw: fix base vertex bug +- llvmpipe: asst. clean-ups in lp_rast.h +- llvmpipe: asst. clean-ups in lp_rast_debug.c +- llvmpipe: asst. clean-ups in lp_rast_priv.h +- llvmpipe: asst. clean-ups in lp_rast_rect.c +- llvmpipe: asst. clean-ups in lp_rast.c +- llvmpipe: asst. clean-ups in lp_rast_linear.c +- llvmpipe: asst. clean-ups in lp_rast_linear_fallback.c +- gallium: whitespace fixes in p_context.h, p_screen.h +- llvmpipe: asst. clean-ups in lp_context.[ch] +- llvmpipe: asst. clean-ups in lp_surface.c +- llvmpipe: asst. clean-ups in lp_screen.[ch] +- llvmpipe: asst. clean-ups in lp_query.c +- llvmpipe: asst. clean-ups in lp_setup.c +- llvmpipe: asst. clean-ups in lp_setup_rect.c +- llvmpipe: asst. clean-ups in lp_setup_vbuf.c +- llvmpipe: asst. clean-ups in lp_setup_line.c +- llvmpipe: asst. clean-ups in lp_setup_point.c +- llvmpipe: remove gotos in do_rect_ccw() +- llvmpipe: asst. clean-ups in lp_setup_tri.c +- llvmpipe: asst. clean-ups in lp_bld_alpha.c +- llvmpipe: asst. clean-ups in lp_bld_blend_aos.c +- llvmpipe: asst. clean-ups in lp_bld_depth.c +- llvmpipe: asst. clean-ups in lp_bld_interp.c +- llvmpipe: asst. clean-ups in lp_debug.h +- llvmpipe: asst. clean-ups in lp_draw_arrays.c +- llvmpipe: asst. clean-ups in lp_fence.c +- llvmpipe: asst. clean-ups in lp_flush.[ch] +- llvmpipe: asst. clean-ups in lp_linear_interp.c +- llvmpipe: asst. clean-ups in lp_linear_sampler.c +- llvmpipe: asst. clean-ups in lp_scene.c +- llvmpipe: asst. clean-ups in lp_scene.h +- llvmpipe: asst. clean-ups in lp_state_sampler.c +- llvmpipe: asst. clean-ups in lp_state_derived.c +- llvmpipe: asst. clean-ups in lp_state_clip.c +- llvmpipe: asst. clean-ups in lp_state_cs.c +- llvmpipe: asst. clean-ups in lp_state_gs.c +- llvmpipe: asst. clean-ups in lp_state_rasterizer.c +- llvmpipe: asst. clean-ups in lp_state_so.c +- llvmpipe: asst. clean-ups in lp_state_surface.c +- llvmpipe: asst. clean-ups in lp_state_tess.c +- llvmpipe: asst. clean-ups in lp_state_vertex.c +- llvmpipe: asst. clean-ups in lp_test_blend.c +- llvmpipe: asst. clean-ups in lp_test_conv.c +- llvmpipe: asst. clean-ups in lp_test_format.c +- llvmpipe: asst. clean-ups in lp_test_main.c +- llvmpipe: asst. clean-ups in lp_tex_sample.c +- llvmpipe: asst. clean-ups in lp_texture.[ch] +- llvmpipe: improve some if/switch code in llvmpipe_get_shader_param() +- frontend/dri: assorted clean-ups in dri-screen.c +- frontends/dri: clean-ups in dri_util.c +- glx: clean-ups in create_context.c +- glx: clean-ups in drisw_glx.c +- llvmpipe: s/uint/enum pipe_prim_type/ in lp_setup_context.h +- llvmpipe: fix comment typo +- llvmpipe: asst. clean-ups in lp_state_fs.c +- llvmpipe: check rectangle vertices have equal W components + +Caio Oliveira (38): + +- intel/compiler: Print more details when fs_visitor::validate() fails +- intel/compiler: Call get_mesh_urb_handle one level up in call-stack +- intel/compiler: Get URB handle in emit_task/mesh_intrinsic functions +- intel/compiler: Make component() work for FIXED_GRF/ARF +- intel/compiler: Remove INTEL_DEBUG=tcs8 +- intel/compiler: Rename 8_PATCH to MULTI_PATCH +- intel/compiler: Use fs_reg helpers for TCS icp_handle selection +- intel/compiler: Use fs_reg helpers for GS icp_handle selection +- intel/compiler: Create fs_visitor::emit_tcs_barrier() +- intel/compiler: Add helper for barrier message payload setup for gfx >= 125 +- intel/compiler: Use builder to allocate fs regs for TCS store output +- intel/compiler: Make a type for Thread Payload and FS variant +- intel/compiler: Use FS thread payload only for FS +- intel/compiler: Create struct for TCS thread payload +- intel/compiler: Store Patch URB output in TCS thread payload struct +- intel/compiler: Store Primitive ID in TCS thread payload struct +- intel/compiler: Store start of ICP handles in TCS thread payload struct +- intel/compiler: Create and use struct for TES thread payload +- intel/compiler: Create and use struct for VS thread payload +- intel/compiler: Create and use struct for TASK and MESH thread payloads +- intel/compiler: Create and use struct for GS thread payload +- intel/compiler: Store start of ICP handles in GS thread payload struct +- intel/compiler: Create and use struct for Bindless thread payload +- intel/compiler: Export brw_get_subgroup_id_param_index() +- intel/compiler: Create and use struct for CS thread payload +- intel/compiler: Make thread_payload struct abstract +- intel/compiler: Add a few more brw_ud* helpers +- intel/compiler: Use brw_ud* helpers in thread payload code +- nir/lower_task_shader: Don't fail adding a launch when last instruction is a jump +- spirv: Pass vtn_block into vtn_emit_branch() +- spirv: Add workaround to ignore OpReturn after OpEmitMeshTasksEXT +- spirv: Call vtn_emit_ret_store() only when handling return branch +- spirv: Make a helper function to check glslang/shaderc +- spirv: Enable OpEmitMeshTasksEXT workaround for Clay Shader Compiler +- intel/compiler: Explicitly include build-id when linking intel_clc +- nir: Don't reorder volatile intrinsics +- intel/compiler: Fix dynarray usage in intel_clc +- intel/compiler: Fix missing tie-breaker in brw_nir_analyze_ubo_ranges() ordering code + +Caleb Cornett (3): + +- wgl: Homogenize read framebuffer reference counting logic +- wgl: Always release local references in stw_make_current_by_handles +- wgl: Release the context before deleting it in wglDeleteContext + +Chad Versace (43): + +- venus: Remove cryptic pointer arithmetic on present attachments +- venus: Drop vn_present_src_attachment::acquire +- venus: Track subpass properties +- venus/pipeline: Refactor vn_fix_graphics_pipeline_create_info +- venus/pipeline: Add pipeline state bools for ignore rules +- venus/pipeline: Split ignore rules for pViewportState, pMultisampleState +- venus/pipeline: Fix ignore rule for pDepthStencilState +- venus/pipeline: Fix ignore rule for pColorBlendState +- venus/pipeline: Ignore pViewportState::pViewports +- venus/pipeline: Ignore pViewportState::pScissors +- venus/pipeline: Ignore basePipelineHandle +- venus/pipeline: Fix ignore rules for VK_KHR_dynamic_rendering +- venus/pipeline: Fix ignore rules for VK_EXT_extended_dynamic_state +- venus: Dedupe pipeline handle creation +- venus: Fix failure path on pipeline creation +- venus: Enable VK_EXT_pipeline_creation_cache_control +- venus: Sort passthrough extensions +- vulkan: Make vk_copy_struct_guts() public +- venus: Simplify vn_GetPhysicalDeviceFeatures2 +- venus: Simplify vn_GetPhysicalDeviceProperties2 +- venus: Enable VK_KHR_shader_integer_dot_product +- venus: Enable VK_KHR_shader_non_semantic_info +- venus: Enable VK_KHR_shader_terminate_invocation +- venus: Enable VK_KHR_zero_initialize_workgroup_memory +- venus: Enable VK_EXT_subgroup_size_control +- venus: Enable VK_EXT_texel_buffer_alignment +- venus: Enable VK_EXT_ycbcr_2plane_444_formats +- venus: Enable VK_KHR_format_feature_flags2 +- venus: Enable VK_EXT_private_data +- venus: Enable VK_EXT_tooling_info +- venus: Document VK_EXT_texture_compression_astc_hdr support +- venus: Enable VK_EXT_pipeline_creation_feedback +- venus: Rename some feature/property structs +- venus: Refactor VN_ADD_TO_PNEXT +- venus: Add macros VN_SET_CORE_* +- venus: Fix features/properties for unavailable extensions +- venus: Use VkPhysicalDeviceVulkan13{Features,Properties} +- venus/queue: Rename some local vars +- venus: Change signature of vn_get_fence_feedback_cmd +- venus: Add vn_queue_wait_idle_before_present() +- venus: Refactor vn_command_buffer_builder temp storage +- venus: Enable VK_KHR_synchronization2 +- venus: Enable Vulkan 1.3 + +Charmaine Lee (6): + +- svga: fix mksstats build +- svga: fix SetConstantBufferOffset command opcode +- svga: compare graphics shader stages against COMPUTE +- mesa/st: fix reference to nir->info after nir_to_tgsi +- svga: support TGSI_SEMANTIC_TEXCOORD in swtnl draw context +- svga: fix invalid component access of domain location + +Chia-I Wu (83): + +- vulkan: sort ALLOWED_ANDROID_VERSION by api levels +- vulkan: fix ALLOWED_ANDROID_VERSION for api level 26 +- vulkan: fix ALLOWED_ANDROID_VERSION up to api level 29 +- vulkan: extend ALLOWED_ANDROID_VERSION up to api level 31 +- turnip: add tu_common.h as the common header +- turnip: remove includes that are already in tu_common.h +- turnip: add tu_drm.h +- turnip: add tu_suballoc.h +- turnip: update tu_cs.h +- turnip: add tu_query.h +- turnip: add tu_image.h +- turnip: add tu_formats.h +- turnip: update tu_descriptor_set.h +- turnip: add tu_shader.h +- turnip: add tu_pipeline.h +- turnip: add tu_clear_blit.h +- turnip: add tu_dynamic_rendering.h +- turnip: add tu_lrz.h +- turnip: add tu_pass.h +- turnip: add tu_wsi.h +- turnip: update tu_autotune.h +- turnip: add tu_device.h +- turnip: add tu_cmd_buffer.h +- turnip: add tu_android.h +- turnip: update tu_util.h +- turnip: move away from tu_private.h +- turnip: remove tu_private.h +- turnip: remove headers from libtu_files +- turnip: use SPDX-License-Identifier +- vulkan: disallow VK_KHR_performance_query on android +- turnip: fix a use-after-free in autotune +- turnip: remove unused member in tu_submission_data +- turnip: handle fence wrapping in autotune +- freedreno/pps: improve interaction with turnip +- turnip: lower the queue priority to 1 +- turnip: pCreateInfo->pPoolSizes[i] is wordy +- turnip: trim down host size of tu_descriptor_pool +- turnip: trim down bo size of tu_descriptor_pool +- turnip: improve perfetto sync_timestamp +- turnip: fix a missing trace_end_gmem_clear +- turnip: move trace_start_gmem_store before cond exec +- turnip: fix gem_store tracepoint +- turnip: tidy up tracepoint header includes +- util/u_trace: include the generated header first +- util/u_trace: add PERFETTO HeaderScope +- turnip: add tu_clone_trace_range helper +- turnip: convert tu_perfetto_state to a stack +- turnip: clean up tu_perfetto.h +- turnip: rename some tracing stages +- turnip: add cmd_buffer tracepoint +- turnip: improve tracing of secondary cmd buffers +- ir3: fix predicate splitting in scheduler +- util/perf: move u_perfetto to here +- util/perf: add cpu_trace.h +- util/perf: fix multiple tracepoints in a scope +- util/perfetto: add a simple C wrapper for track events +- util/perf: support and prefer perfetto for cpu trace +- venus: use MESA_TRACE_* +- pps: enable track_event in system.cfg +- docs: update perfetto with the latest status +- vulkan/wsi: add cpu tracepoints to swapchain functions +- vulkan/wsi: add cpu tracepoints for internal waits +- turnip: add some cpu tracepionts +- freedreno: get cpu timestamp again after FD_TIMESTAMP +- util/perf: ignore ut->enabled for iterators +- util/perf: remove u_trace::enabled +- util/perf: add u_trace_instrument +- util/perf: simplify u_trace_instrument +- util/perf: use ALWAYS_INLINE for tracepoints +- turnip: reduce MAX_STORAGE_BUFFER_RANGE +- vulkan: update ALLOWED_ANDROID_VERSION for api level 33 +- subprojects: uprev perfetto to v29.0 +- turnip: fix error handling for tu_queue_init +- turnip: add tu_physical_device::submitqueue_priority_count +- turnip: add support for VK_KHR_global_priority +- turnip: use vk_buffer +- turnip: use vk_descriptor_set_layout +- vulkan: update comments to device enumeration callbacks +- turnip: fix kgsl tu_enumerate_devices return code +- turnip: advertise VkExternalFenceProperties correctly +- venus: fix VkImageFormatListCreateInfo for AHB +- freedreno/pps: use 64-bit reads when possible +- freedreno/pps: loop countables by references + +Christian Gmeiner (18): + +- lima/ci: There is no need to redefine HWCI_TEST_SCRIPT +- lima/ci: Make use of .lava-piglit +- isaspec: Move isa_decode(..) declaration +- ci: include etnaviv support in ARM64 container +- ci: switch to mkbootimg.py +- ci/bare-metal: introduce BM_MKBOOT_PARAMS +- ci/etnaviv: add GC7000 support +- etnaviv: Remove possibility to force MSAA +- etnaviv: rs: Color resolve works +- etnaviv: rs: Directly call etna_try_rs_blit(..) +- etnaviv: rs: msaa: Set cache mode for SMALL_MSAA +- etnaviv: Update headers from rnndb +- etnaviv: blt: Add color resolve support +- etnaviv: blt: Directly call etna_try_blt_blit(..) +- Revert "etnaviv: completely turn off MSAA" +- etnaviv: Support negative float inline immediates +- etnaviv: nir: lower extract byte/word +- etnaviv: Hide MSAA support behind debug flag + +Christophe (2): + +- zink: add profile documentation +- Zink: add Zink profiles file + +Connor Abbott (49): + +- tu: Decouple vertex input state from shader +- tu: Implement VK_EXT_vertex_input_dynamic_state +- tu: Fix sysmem depth attachment clear flushing +- ir3/spill: Fix extracting from a vector at the end of a block +- nir: Free instructions more often +- tu/lrz: Fix multiple subpass case with secondaries +- tu/lrz: Fix multiple depth attachment case with secondaries +- tu: Fix descriptor set size bounds +- tu: Don't preload variable-count descriptors +- tu: Initial implementation of VK_EXT_inline_uniform_block +- tu: Also set SP_FS_MRT_REG for unused attachments +- tu: Fix case where vertex input is set but not vertex buffers +- tu: Remove unused need_indirect_descriptor_sets +- tu: Group pipeline state based on VkPipeline*StateCreateInfo +- tu: Rename RB_MSAA_CNTL to RB_BLIT_GMEM_MSAA_CNTL +- tu: Always set RB_BLIT_GMEM_MSAA_CNTL when clearing GMEM +- tu: Make MSAA emission always dynamic +- tu: Split out primtype calculation for tess +- ir3: Don't use msaa key for a6xx+ +- tu: Nuke tu_pipeline_cache +- tu: Don't use layer_zero/view_zero in shader key +- tu, ir3: Move multi_pos_output to ir3_shader_variant +- tu: Don't use output state to compute render components +- tu, ir3: Determine dual-src blend from shader for FS outputs +- tu: Emit \*_OUTPUT_CNTL1 as part of blend state +- tu: Move no_earlyz computation to blend/msaa state +- tu: Split up prim order computation +- tu: Use vk_pipeline_shader_stage_to_nir +- tu: Abstract driver-specific const state +- tu: Initial implementation of VK_EXT_graphics_pipeline_library +- tu: Fix maxPerStageDescriptorUpdateAfterBindInputAttachments +- tu: Set textures_used for input attachments correctly +- ir3/analyze_ubo_ranges: Account for reserved consts +- Revert "freedreno,ir3: rename Z_CLAMP_ENABLE to Z_CLIP_DISABLE" +- freedreno/a6xx, tu: GRAS_CL_CNTL::UNK5 is Z_CLAMP_ENABLE +- tu: Fix setting RB_DEPTH_CNTL::Z_CLAMP_ENABLE +- ir3: Delete outputs from fixup_regfootprint() +- ir3: Move fixup_regfootprint() to ir3_collect_info() +- ir3/analyze_ubo_ranges: Account for reserved consts better +- ir3: Add missing cat5 encoding to asm parser +- tu: Fix param_stride placement +- tu: DS primitive stride does not use patch control points +- tu: Implement extendedDynamicState2PatchControlPoints +- freedreno: Add LABEL flush +- tu: Add compute shader instrlen workaround +- tu: tu: Clear patchControlPoints dirty state with static patchControlPoints +- tu: Use right enum for compute active_shader_stages +- tu: Fix binding NULL descriptor sets +- ir3: Don't save/restore disasm string pointer + +Constantine Shablya (3): + +- nir: add a pass to remove non-uniform access qualifier when the operands are uniform +- radv: use nir_opt_uniform_access +- anv: use nir_opt_uniform_access + +Corentin Noël (9): + +- virgl/ci: Update virglrenderer +- ci: disable the freedreno farm. +- virgl/ci: Update virglrenderer +- gallivm: avoid the use of an uninitialized value +- virgl: Add TEXTURE_SHADOW_LOD capability support +- virgl: Add nir_lower_tex pass +- mesa/main: do not copy the exact size of the string +- gallium/hud: use snprintf(..., "%s", ...) instead of strncpy +- ci: Remove Wno-error=stringop-truncation from debian-arm64-asan + +Damian Korczowski (1): + +- d3d12: Fix placed footprint of subresources + +Daniel Schürmann (20): + +- nir/opt_algebraic: a & ~a -> 0 +- aco: use std::vector::reserve() more often +- aco/live_var_analysis: implement faster merging of live_out sets for some cases +- aco/optimizer: disallow can_eliminate_and_exec() with s_not +- aco/optimizer: do can_eliminate_and_exec() optimization later +- aco/optimizer: optimize s_and(exec, s_and(x, y)) more aggressively +- aco/optimizer: change inverse_comparison in-place +- radv/rt: create separate radv_rt_pipeline struct +- radv/rt: inline radv_rt_pipeline_create_() helper into radv_rt_pipeline_create() +- aco: simplify operands_offset calculation in create_instruction() +- aco: implement custom memory resource +- aco: use monotonic_buffer_resource for instructions +- aco: implement allocator_traits for monotonic_allocator<T> +- aco/opt_value_numbering: use monotonic_allocator for unordered_map +- aco/spill: Fix spilling of Phi operands +- aco/ra: only rename fixed Operands if the copy-location matches +- aco: change thread_local memory resource to pointer +- nir: add AMD RT traversal intrinsics +- radv/rt: create traversal shader independent from main shader +- radv/rt: overwrite hit args with undef in case of a miss + +Daniel Stone (4): + +- ci: Use mold for x86-64 and AArch64 builds +- CI: Collabora farm down for maintenance +- CI: Re-enable Collabora devices +- Revert "panfrost/ci: Disable t720 jobs" + +Danilo Krummrich (10): + +- nv50/ir/nir: add conversion ops for bit width < 32 +- nv50/ir/nir: convert to 32 bit for all OP_SET opcodes +- nv50/ir/nir: avoid 8/16 bit dest regs for OP_MOV +- nv50/ir: add isUnsignedIntType() and isIntType() helpers +- nv50/ir/nir: convert 8/16 bit src to 32 bit for {i,u}2f64 +- nv50/ir/nir: always round towards zero for f2i/f2u +- nv50/ir: add intermediate conversion for f2{i,u}{8,16} +- nv50/ir: split and cvt 64bit integers for {i,u}2{i,u}{8,16} +- nv50/ir: handle S8/S16 integers converting to S64 +- nv50/ir: handle U8/U16 integers converting to U64 + +Danylo Piliaiev (26): + +- tu: Flush depth on depth img transition from undef layout +- tu: Disable LRZ write when alpha-to-coverage is enabled +- freedreno: Disable LRZ write when alpha-to-coverage is enabled +- tu: Assert that if draw state is enabled it has valid iova +- tu: Switch to userspace iova allocations if kernel supports it +- tu: Enable bufferDeviceAddressCaptureReplay +- tu: Disable userspace iova allocations +- tu: Implement VK_EXT_attachment_feedback_loop_layout +- freedreno: PC_SO_STREAM_CNTL_STREAM_ENABLE has per-stream enable bits +- tu: Fix streamout with tess_use_shared +- tu: Update HS_WAVE_INPUT_SIZE formula +- util/u_trace: Add locking for ctx_list +- freedreno: Name more \*_DBG_ECO_CNTL regs +- freedreno: Add all variable magic regs to device-info tables +- tu: Use newly obtained magic reg values +- tu: Trivially expose VK_EXT_rasterization_order_attachment_access +- ir3: Prevent reordering movmsk with kill +- turnip: implement VK_EXT_multi_draw +- turnip: Implement VK_EXT_mutable_descriptor_type +- tu: Expose Vulkan 1.3 +- tu: Do not DCE unused output vars used for transform feedback +- tu: Lazily init VSC to fix dynamic rendering in secondary cmdbufs +- tu: Reset whether there is DS resolve for dynamic subpass +- tu: Fix the size of patch control points state +- freedreno: Add 'replay' tool which allows to replay cmdstreams +- freedreno/fdl: Increase alignment for UBWC images + +Dave Airlie (85): + +- gallivm: fix printf hook for cached shaders. +- llvmpipe: add shader clock support +- lavapipe: enable shader clock +- draw: fix up jit type creation for gs/tcs/tes +- llvmpipe/gallivm/draw: introduce a buffer type. +- zink: add defines for the shader stage templates. +- virgl: abstract virgl shader stages from pipe shader stages. +- svga: compare shader type against compute not tess eval. +- gallium/tgsi: reorder pipe shader type defines. +- gallium/iris/crocus: collapse a bunch of conversion functions. +- gallium: fixup some inconsistent uses of enum pipe_shader_type. +- gallium: use gl shader types as the basis for the gallium ones +- gallivm/nir/st: lower image derefs in advance. +- lavapipe: lower images to non-derefs in vulkan side +- lavapipe: scan shader for info before lowering images. +- llvmpipe/ci: update ci results for clover. +- draw: don't touch info values that aren't valid. +- nir_to_tgsi_info: drop const_buffers_declared +- crocus: sync performance monitor code with iris. +- vulkan: update rest of the headers to v1.3.225 +- gallivm: drop unused parameter to lp_build_sample_aos +- gallivm/sample: remove unused base parameter from dynamic callbacks. +- gallivm: don't indirect image/sampler destroy. +- llvmpipe: finish rendering before flushing frontbuffer resources. +- radv: move to using common record_result. +- turnip: use common command record result. +- panvk: use common command buffer record result. +- dozen: move to use common command buffer result code. +- lavapipe: fix 3d depth stencil image clearing. +- gallivm: add atomic 32-bit float support +- llvmpipe: export GL_NV_shader_atomic_float +- lavapipe: export VK_KHR_shader_atomic_float +- docs: add new llvmpipe/lavapipe atomic float extensions +- lavapipe: use slow clear path if conditional rendering. +- llvmpipe: handle different blocksizes for sampler/image views +- lavapipe: take layers into account around render surface creation. +- lavapipe: handle view mask layer count properly for begin rendering. +- llvmpipe: memset empty images in the key +- llvmpipe: bind sampler views/images properly. +- gallivm: handle llvm coroutines for llvm > 15 +- gallivm: fix buffer_type for LLVM 15 +- lavapipe: add fmin/fmax to image lowering. +- gallivm/nir: fix fmin/fmax translation +- gallivm/nir: bitcast when non-float ptr type. +- llvmpipe/fs: start passing explicit context pointer type. +- llvmpipe/fs: convert thread data ptr to opaque ptr friendly apis +- llvmpipe/fs: convert linear context to opaque pointers friendly +- llvmpipe/cs: convert cs context to opaque friendly api +- llvmpipe/cs: convert cs thread data to opaque friendly api +- llvmpipe/fs: fix invocations access for opaque ptrs. +- llvmpipe/setup: move setup code to explicit pointers. +- llvmpipe/cs: move compute code to explicit pointer types +- gallivm/struct: add opaque ptr friendly pointer accessors. +- llvmpipe/fs: add mask_type for mask_store accesses. +- llvmpipe/fs: pass explicit mask_type into interp code. +- llvmpipe/fs: port depth code to opaque pointer api +- llvmpipe/fs: add sample position type to the interp interface +- llvmpipe/fs: pass mask type to alpha to coverage handler +- llvmpipe/fs: use explicit api in viewport code. +- llvmpipe/fs: handle explicit types around blending and c/zs bufs calcs +- llvmpipe/fs: cleanup some remaining mask handling and reuse types +- llvmpipe/fs: convert outputs handling to explicit api +- gallivm/nir: move to explicit pointer interfaces. +- gallivm/nir: fixup atomic path for explicit pointers. +- gallivm/sample: move border color load to explicit pointers +- gallivm/llvmpipe: hand sample position type in for loading. +- llvmpipe/fs/linear: port to explicit pointers. +- draw: fix gs/tes explicit pointer handling +- draw: handle primid for tess being a different pointer type +- gallivm/tgsi: port tgsi to explicit types. +- gallium/tgsi: handle temps/outputs array. +- vulkan: update beta and video headers to 1.3.230 +- llvmpipe/tests: port to new pointer interfaces. +- gallivm/llvmpipe: add opaque pointers support to sampler +- draw/llvmpipe: move texture/sampler/image member load to opaque. +- draw/llvmpipe: add way to return pointer types to generic code. +- gallivm/sample: use retrieved types to do opaque pointer loads. +- gallivm: remove legacy pointer_get apis +- gallivm/sample: move some first_level/last_level calcs out +- gallivm/nir: drop some unused struct members. +- gallivm/sample: refactor multisample offset calcs code. +- glsl/types: fix dword slots calc for float16 matricies. +- clover/llvm: add kernel arg info for opaque pointers +- nir/lower_bool_to_int32: add support for lowering functions. +- gallivm: add coro malloc hooks earlier and always. + +David Heidelberg (71): + +- ci: compress LAVA rootfs with zstd instead of gzip +- ci: replace gzip usage with zstd where posible +- ci/bin: ignore Gitlab GQL cache +- ci: Turn off the entire Lima farm (devices report out-of-space) +- ci: move DXVK instalation outside of x86_test-vk +- ci: make shellcheck happy on dxvk script +- ci: separate wine setup into own script +- ci: Windows runner is experiencing DNS issues; disable Microsoft farm +- ci: prepare piglit-traces for WINE and DXVK +- ci/lavapipe: implement traces +- docs/ci: show how to run traces locally +- ci/lavapipe: wine flakes, switch to manual for now +- ci: use apt-get in x86_test-vk instead of apt +- ci: do not install i386 sub-architecture in VK images +- ci: traces: update freedreno and broadcom to brotli compressed traces +- ci: GraphQL was already implemented in ci_run_n_monitor.py +- ci: implement stress testing jobs in ci_run_n_monitor.py +- ci: fix leftover tag in image-tags.yml +- ci: rename debian-gallium to debian-build-testing +- ci: move lava-test into debian-build-testing +- ci: make shellcheck happy about .gitlab-ci/container/ directory +- ci: use shellcheck for .gitlab-ci/container/ directory +- ci: drop last bits of wine32 +- ci: bump cross compilation to LLVM 13 where possible +- ci: uprev piglit 2022-08-30 +- ci: set venus on lavapipe to manual due to flakes +- ci: print env as other setups do +- ci/lava: print set-job-env-vars.sh as other setups do +- ci/traces: do not keep a cache, which will be wiped a few steps later +- ci/test: collapse printing env +- ci/lava: collapse printing env +- ci/freedreno: convert traces to new YAML format +- ci/virgl: convert traces to new YAML format +- ci/llvmpipe: convert traces to new YAML format +- ci/lavapipe: convert traces to new YAML format +- ci/broadcom: convert traces to new YAML format +- ci/panfrost: convert traces to new YAML format +- ci/radeonsi: convert traces to new YAML format +- ci/intel: convert traces to new YAML format +- ci/i915: convert traces to new YAML format +- ci/crocus: convert traces to new YAML format +- ci: check traces.yml files with yamllint +- ci: performance traces: make use of no-perf label +- ci: export whole results/ directory for all traces jobs +- ci: split ci_run_n_monitor into script and shared parts +- ci: introduce update_traces_checksum.py +- ci: uprev piglit 2022-09-08 +- ci: use xargs instead of find -exec +- ci: add jq utility +- ci/traces: remove first line with YAML version to prevent failure +- ci/radeonsi: drop glmark2 terrain trace +- ci/panfrost: drop glmark2 terrain trace +- ci/intel: drop glmark2 terrain trace +- ci/lava: add wine and apitrace into amd64 rootfs container +- ci/lava: prepare wineprefix for GL and DXVK +- ci/dxvk: fix DXVK 64-bit only wine setup +- ci: also handle default wine rootfs (for testing) +- ci/traces: prevent showing wine instalation dialog +- ci/radeonsi: add traces for radeonsi Zork +- ci/amd: move radeonsi in amd directory +- ci: uprev DXVK to 1.10.3 +- ci/traces: Blender demo (Cube Diorama) flakes on Intel APL +- ci/freedreno: add Guilherme Gallo into restricted traces access list +- ci/panfrost: deduplicate gitlab-ci.yml +- ci/panfrost: enable piglit-gl on g52 again and deparalelize +- ci/panfrost: Humus Portal trace got fixed, update checksum +- ci/panfrost: re-enable traces on t760 +- ci/linker: update mold to 1.6 and enable on s390x +- ci/zink: rename zink job to zink-lvp to better describe it +- ci/freedreno: disable antichambers trace +- ci/update_traces_checksum.py: check if checksum is in the array, not it's value + +David Riley (1): + +- drm-shim: Allow drm-shim to work with glibc fortify. + +Dawn Han (11): + +- Update venus-protocol to add extension \`VK_VALVE_mutable_descriptor_type` +- Increase enum type in vn_descriptor_set. +- Update bitset when init descriptor layout. +- Track bitset when create descriptor pool +- Enable .VALVE_mutable_descriptor_type in physical_device +- venus: support the promoted VK_EXT_mutable_descriptor_type +- venus: Update mutable descriptor struct type +- venus: extend lifetime of push descriptor set layout +- venus: extend VkPipelineLayout lifetime for batched VkCmdPushConstants() +- venus: implement vkCmdPushDescriptorSetWithTemplateKHR +- venus: enable VK_KHR_push_descriptor + +Diogo Ivo (1): + +- nouveau: treat DRM_FORMAT_INVALID as implicit modifier + +Dmitry Baryshkov (1): + +- freedreno/registers: update hdmi registers to add more 8x74 regs + +Dmitry Osipenko (11): + +- util/disk_cache: Add option to disable compression +- util/disk_cache: Enable testing of uncompressed caches +- util/disk_cache: Append to disk_cache_load_cache_index() with _foz +- util/disk_cache: Move struct cache_entry_file_data to the disk_cache_os.h +- util/disk_cache: Make disk_cache_os.h usable by C++ code +- util/disk_cache: Add new mesa-db cache type +- virgl: Fix unmapping of blob resources +- llvmpipe: Align persistent mappings to page size +- ci: Bump crosvm version +- ci: Update virgl-iris-traces checksums +- ci/virgl: Enable virgl-iris-traces + +Dylan Baker (47): + +- docs: reset new features +- docs: add release notes for 22.1.5 +- docs: Add sha256 sum for 22.1.5 +- docs: add release notes for 22.1.4 +- docs: Add sha256 sum for 22.1.4 +- docs: update calendar and link releases notes for 22.1.4 +- docs: update calendar and link releases notes for 22.1.5 +- docs: update calendar for 22.2.0-rc1 +- docs: add release notes for 22.2.0 +- docs: Add sha256 sum for 22.2.0 +- docs: update calendar and link releases notes for 22.2.0 +- docs: Add calendar entries for 22.2 release. +- intel/genxml: use a single pattern for res +- intel/genxml: use a set for lookups +- intel/genxml: don't use parens with python assert statement +- intel/genxml: add type annotations to gen_sort_tags.py +- intel/genxml: use \`a not in {x, y}` instead of \`a != x or a != y` +- intel/genxml: remove unused variable in gen_sort_tags.py +- intel/genxml: don't overwrite the input of gen_sort_tags in place +- intel/genxml: add 'nonzero' to gen_sort_tags +- intel/genxml: add a validation mode to gen_sort_tags +- intel/genxml: run gen_sort_tags on all of the xml +- meson: run genxml sort tests +- docs: add release notes for 22.2.1 +- docs: Add sha256 sum for 22.2.1 +- docs: update calendar and link releases notes for 22.2.1 +- intel/genxml: write a space before the '/>` closing tag +- intel/genxml: re-process with space before \`/>` +- intel/genxml: remove newlines between xml islands +- intel/genxml: re-process with extra whitespace removed +- intel/genxml: use ElementTree.write and .indent +- intel/genxml: reprocess xml with elementree output +- intel/genxml: split the processing code up +- intel/genxml: use a main() function +- intel/genxml: use Path.replace instead of unlink and rename +- docs: add release notes for 22.2.2 +- docs: Add sha256 sum for 22.2.2 +- docs: update calendar and link releases notes for 22.2.2 +- docs: Extend calendar entries for 22.2 by 2 releases. +- util/indicies: move common static functions to private header +- util/indicies: use itertools.product instead of deeply nested loops +- util/indicies: write a file with u_indicies_gen.py +- util/indicies: simplify some of u_indicies_gen.py +- util/indicies: use itertools.product in u_unfilled_gen.py +- util/indicies: convert u_unfilled_gen.py to write a file +- util/glsl2spirv: fix type error in argument handling +- util/glsl2spirv: fix appending extra flags + +Ella Stanforth (2): + +- util: fix missing fcntl.h on musl +- vulkan: wsi: fix musl build + +Emma Anholt (108): + +- ci/bare-metal: Re-open serial and everything after test phase timeout. +- zink: Take the tiling path for resources with modifiers. +- zink: Replace the "optimal_tiling" flag with a "linear" flag instead. +- ci/zink: Re-enable a630 zink-on-turnip traces job using gbm as the backend. +- ci/swrast: Add some flakes I've noticed in the IRC channel. +- ci: Add testing of the khr-single tests. +- ci: Upgrade deqp-runner to 0.15.0. +- zink: Make sure that we keep the existing ici pNext chain on inserts. +- tu: Move the vkCreateImage format list checks to helper functions. +- tu: Treat viewFormatCount==0 as no format list. +- tu: Use the format list to decide whether we can do UBWC with MUTABLE. +- ci/tu+zink: Re-enable the traces that were asserting about UBWC compatibility. +- ir3: Suppress disasm of internal shaders unless IR3_SHADER_DEBUG=internal. +- tu: Only emit as many bindless regs as we have seen descriptor sets. +- tu: Only emit as many VPC interp/repl regs as will be referenced. +- tu: Only emit FS output regs for as many MRTs as we have (but at least 1). +- tu: Emit only as many VBs as we've ever seen bound on the command buffer. +- ci/tu: Add another cwe_after_bind fail that appeared in the reshuffle. +- ci: Move 'never' rules includes above "on_success" rules includes. +- ci: disable the freedreno farm. +- freedreno/regs: Add a bit of documentation of what SKIP_IB2 does. +- turnip: Fix enabling of IB2 skipping. +- nir/lower_mediump: Lower FS outputs to 16-bit when the value was upconverted. +- nir_to_tgsi: Add support for TG4 with explicit offsets. +- nir_to_tgsi: Add support for 64-bit fsign/isign. +- nir_to_tgsi: Add support for demote, is_helper_invocation, and subgroup ops. +- nir: Make nir_lower_discard_if() handle demotes and terminates, too. +- nir_to_tgsi: Use nir_lower_discard_if for demote_if. +- nir_to_tgsi: Add support for bindless textures and images. +- turnip: Make sure bandwidth config per pixel starts from 0. +- freedreno/ir3: Move chip-specific nir compiler options to C code. +- freedreno/ir3: Switch to NIR for a3xx/a4xx's vertex id lowering. +- gallium,glsl: Delete PIPE_CAP_VERTEXID_NOBASE and lower_vertex_id. +- nir: Add a pass to lower mediump temps and shared mem. +- spirv: Mark phis as mediump instead of directly lowering them to 16 bit. +- turnip: Enable lowering of mediump temps/CS shared to 16-bit. +- turnip: Treating non-d/s-write pipelines as not having d/s feedback loops. +- mesa: Remove PIPE_CAP_CS_DERIVED_SYSTEM_VALUES_SUPPORTED/lower_cs_derived. +- zink: Don't upload shader immediate arrays through UBO 0. +- zink: Don't lower indirect derefs of temp arrays. +- Revert "ci: disable the freedreno farm." +- ci: Add support for Jetson TK1. +- nouveau: Fix compiler warnings about silly address checks in ir_print. +- ci/turnip: Add missing a618 full-run bypass fails. +- turnip: Only emit descriptor loads for active stages in the pipeline. +- freedreno/ir3: Consistently lower mediump inputs to 16-bit (when we can). +- nir/lower_mediump_vars: Don't lower mediump shared vars with atomic access. +- mesa: Lower mediump temps and CS shared when the driver supports FP16+INT16. +- turnip: Add some missing LRZ disable debug. +- turnip: Ignore dynamic color write enables past our number of attachments. +- turnip: Don't look at RB.Z_READ_ENABLE for setting LRZ.Z_TEST_ENABLE. +- turnip: Skip rather than invalidate LRZ on gl_FragDepth writes. +- turnip: Ignore pDescriptorCounts[] for non-variable-count layouts. +- turnip: Keep a host copy of push descriptor sets. +- ci/turnip: Use all 9 a618 runners at once for VK testing. +- ci/turnip: Bump the full-run a618 runner count to 3. +- ci/turnip: Re-enable the compressed cubemap tests. +- ci/turnip: Allow running spirv_ids_abuse in full VK runs. +- ci/turnip: Drop a couple of spillall skips. +- ci/turnip: Move some 15-second-ish test skips to pre-merge skips. +- ci/nouveau: Drop BM_POE_TIMEOUT. +- ci/nouveau: Rearrange job setup variables. +- ci/nouveau: fix up Jetson Nano +- ci/nouveau: Update gm20b xfails. +- ci: Update kernel to drm-next-2022-08-12-1 +- turnip: Fix leak of autotune fence BOs. +- turnip: Add TU_DEBUG=bos to print stats of BOs live at submit time. +- turnip: Fix the "written stencil is unmodified" check. +- turnip: Use the simplified stencil write flags for the LRZ-allowed check. +- ci/zink+turnip: Add a manual full run of the dEQP CTS. +- turnip: Fix busy-waiting on syncobjs with OS_TIMEOUT_INFINITE. +- util/dag: Move the callback function pointer to the state. +- util/dag: Add a validation function. +- freedreno/ir3: Validate our scheduling DAGs after construction. +- nir/nir_opt_copy_prop_vars: Don't leak dynarray memory during the pass. +- nir/vars_to_ssa: Always do OOB load/store removal. +- glsl: Remove lower_output_reads. +- glsl: Remove do_set_program_inouts. +- glsl: Remove opt_array_splitting. +- glsl: remove opt_structure_splitting. +- glsl: Remove lower_vec_index_to_swizzle. +- ci/iris: Update iris traces checksums. +- turnip: Don't use the dynamic color write enable during non-dynamic. +- ci/zink: Clear stale xfails for turnip. +- zink: Skip border color clamping for compressed formats. +- zink: Enable ASTC texture format translation. +- zink: Map ETC1 to ETC2 to avoid uncompressing in the frontend. +- ci/zink: Add some more flakes for turnip. +- zink: Fix RelaxedPrecision decoration of texture samples. +- zink: Decorate mediump outputs as RelaxedPrecision. +- turnip: Add a perf_debug for feedback-related performance traps. +- turnip: Move the ubwc_possible check before mutable formats. +- turnip: Add perf_debug for UBWC being disabled due to mutable formats. +- turnip: Add perf debug for more UBWC-disable cases that we could support. +- zink: Fix dummy CB path decision for VK_EXT_cwe presence. +- turnip: Be sure we blit depth, not stencil, for Z32FS8 -> Z32F resolves. +- nir/opt_phi_precision: Fix missing swizzles when narrowing phi srcs. +- turnip: Enable LRZ testing (not writing) in the presence of discards. +- ci/freedreno: Update CivV trace expectation. +- ci/freedreno: Update known flakes/timeouts. +- turnip: Fix reservation for indirect compute's IR3_DP_SUBGROUP_ID_SHIFT. +- zink: No need to use a 2-sample dummy image for bindless without null descs. +- zink: Lazily allocate the dummy surfaces. +- zink: Enable mesa/st frontend shader caching. +- rusticl: Fix the invalid memory migration flags check. +- iris: Disable GLSL lower_const_arrays_to_uniforms. +- gallium: update docs about PIPE_CAP_PREFER_IMM_ARRAYS_AS_CONSTBUF. +- ir3/ra: Make sure we don't pick a preferred reg overflowing the file. + +Eric Engestrom (56): + +- bin/gen_release_notes.py: bump advertised vulkan version to 1.3 +- pvr: fix memleak in error paths +- vk/device-select-layer: fix .sType of VkPhysicalDeviceGroupProperties +- docs/features: add missing extensions supported by anv +- docs/features: add missing extensions supported by lavapipe +- docs/features: add missing extensions supported by radv +- docs/features: add missing extensions supported by turnip +- docs/features: add missing extensions supported by venus +- docs/features: start tracking extensions supported by dozen +- docs/features: start tracking extensions supported by panvk +- docs/features: start tracking extensions supported by powervr +- docs/features: drop unsupported VK_KHR_display & VK_KHR_get_display_properties2 for lavapipe +- ci: rename GitHub CI workflow from \`ci` to \`macos` to be a bit more explicit about what it covers +- meson: drop duplicate \`lib` in lib name +- wsi/x11: fix memleak in wsi_x11_connection_create() +- aco: drop unused variable +- anv: convert assert into unreachable to avoid fallthrough error +- meson: replace manual compiler flags with meson arguments +- vc4: introduce VC4_DBG() macro to make VC4_DEBUG checks consistent +- v3d: introduce V3D_DBG() macro to make V3D_DEBUG checks consistent +- wayland: avoid writing interface names as free-form strings +- vc4: drop dead qir_POW() helper +- v3dv: implement VK_EXT_shader_module_identifier +- docs: move rusticl readme into docs/ +- broadcom: fix dependencies in static_library() calls +- docs: add release notes for 22.1.6 +- docs: update calendar and link releases notes for 22.1.6 +- docs: add release notes for 22.1.7 +- docs: update calendar and link releases notes for 22.1.7 +- ci: unexport local variable (and fix formatting) +- ci: bump mold to 1.5 +- docs/release-calendar: add 22.3 schedule +- v3dv: drop error overwrite +- vc4: don't lower uadd_carry +- vc4: don't use imprecise_32bit_lowering for idiv lowering +- vc4: pre-compile shaders to make up for the lack of draw calls in shader-db +- vc4: use PIPE_MASK_RGBA name instead of its value 0xf +- vc4: send shaderdb output through KHR_debug +- vc4: mark piglit copypixels-(draw-)sync as flaky +- VERSION: bump to 22.3.0-rc1 +- .pick_status.json: Update to 1174f376096ed6ceebb0fb2810456f1501a68df7 +- zink: add missing build_id linker args +- vk/runtime: drop incorrect UNUSED annotation +- v3dv: avoid freeing already-freed memory +- VERSION: bump for 22.3.0-rc2 +- .pick_status.json: Update to 9bd11f65238ce101bf846f5528f9088630e983f7 +- ci: avoid triggering vc4 & v3d tests on v3dv-only MRs +- VERSION: bump for 22.3.0-rc3 +- .pick_status.json: Update to f2e535e4fed5bdc13e11a443316a4b77cd5eb81a +- v3d(v): account for debug flags when using the cache +- .pick_status.json: Mark 56bd81ee21cb42e4a5afa2939514b570a1283754 as denominated +- VERSION: bump for 22.3.0-rc4 +- .pick_status.json: Update to 22be0d09a005b4b955a46b65a919cfd786d6814f +- .pick_status.json: Mark 470fbb35efe1935242b346427ec0fa22b40fff2b as denominated +- commit_in_branch.py: add support for checking staging branches +- docs: add release notes for 22.3.0 + +Erico Nunes (1): + +- ci: enable lima farm + +Erik Faye-Lund (226): + +- docs: fixup link to virgl docs +- zink: add have_D32_SFLOAT_S8_UINT boolean +- zink: do not use VK_FORMAT_D32_SFLOAT_S8_UINT without checking +- zink: remove needless check +- glsl/tests: do not perform eol-conversion on windows +- zink: type_main -> type_void_func +- zink: add spirv_builder_function_call +- zink: wrap discard in a function +- zink: fix conditions for draw-parameters +- zink: fixup indent in caps-check +- mesa/main: simplify implmementation of _mesa_is_format_etc2 +- mesa/main: simplify implementation of _mesa_is_format_astc_2d +- mesa/st: simplify st_compressed_format_fallback +- mesa/st: avoid double-mapping if both images are the same +- mesa/st: do not use memcpy when using compressed fallback +- mesa/st: add context-flag for bptc-support +- mesa: add format-helper for bptc +- mesa: add _mesa_unpack_bptc-function +- mesa/st: implement fallback for bptc +- mesa/st: enable bptc extension with fallback +- zink: allow X32 -> A32 +- mesa/main: remove always-true define +- mesa/main: add fp16-versions of bptc decompression functions +- mesa: fall-back to fp16 for bptc-textures +- gallium/u_transfer_helper: clean up boolean flags +- gallium/u_transfer_helper: add in-place interleave flag +- zink: use full transfer-helper api +- gallium/u_transfer_helper: hide deinterleave helpers +- mesa/st: also override mip filter for integer textures +- mesa/st: always force integer filtering to nearest +- mesa/st: force nearest-filtering for fp32 textures +- zink: request forcing of fp32-samplers to nearest when needed +- mesa/st: do not blit when using compressed fallback +- zink: clamp miplodbias when creating sampler +- zink: add ntv support for demote +- zink: add spirv_builder support for is_helper_invocation +- zink: add ntv support for is_helper_invocation +- zink: expose demote when supported +- zink: lower discard/demote depending on derivatives +- Revert "Revert "ci: set venus on lavapipe to manual due to flakes"" +- mesa/st: use _mesa_generate_mipmap for fallback-formats +- mesa/st: do not fallback to srgba formats for dxt +- util: allow unpacking less than a block from dxt +- mesa/st: add context-flags for s3tc +- mesa: add format-helpers for s3tc +- mesa/main: add _mesa_unpack_s3tc +- mesa/st: implement fallback for s3tc +- mesa/st: enable s3tc extensions with fallback +- v3d: do not pretend to fake rgtc-support +- vc4: respect z-offset in tiled blits +- vc4: do not attempt to do deep tiled blits +- docs/zink: remove bptc from required formats for gl4.2 +- docs/zink: document rgtc requirement +- mesa/st: add context-flag for rgtc +- mesa: add format-helper for rgtc +- util/format: fix broken indentation +- util/format: allow unpacking less than a block from rgtc +- util/format: implement rgtc -> r8 / r8g8 unpack +- mesa/main: add _mesa_unpack_rgtc +- mesa/st: implement fallback for rgtc +- mesa/st: do not fall back to uncompressed for rgtc +- mesa/st: enable rgtc extension with fallback +- panfrost: do not fake rgtc-support +- docs: update staus of mark GL_ARB_texture_compression_bptc +- lima: don't store width in resource-level +- lima: do not align width/height for non-shared resources +- mesa/st: add context-flag for latc +- mesa: add format-helper for latc +- mesa/main: add support for latc in _mesa_unpack_rgtc +- mesa/st: implement fallback for latc +- mesa/st: do not fall back to uncompressed for latc +- mesa/st: enable latc extensions with fallback +- freedreno: do not fake rgtc-support +- u_transfer_helper: rip out fake_rgtc code +- mesa/st: always use normalized coords for samplers +- mesa/st: always use normalized coords for samplers +- mesa/st: always use normalized coords for samplers +- zink: assert on unnormalized texcoords +- docs: add an amber article +- zink: emulate latc formats with rgtc +- zink: fix luminance clears +- zink: fixup indent +- zink: refactor swizzle-clamping code +- zink: fix luminance/luminance-alpha emulation +- zink: correct emulation of luminance-alpha formats +- ci/crocus: update ci results +- mesa/main: fix broken indent +- mesa/main: don't copy signed rgtc via float +- docs: fixup broken rst syntax +- gallium/u_threaded_context: remove stale comment +- gallium: normalized_coords -> unnormalized_coords +- zink: fix broken pool-alloc consolidation +- zink: fixup dynarray-type +- zink: use util_dynarray_clear +- mesa/st: rip out point-sprite cap +- mesa/main: remove driver-cap for ARB_point_sprite +- mesa/st: move static asserts out of translate_prim +- mesa/st: remove translate_mode +- docs: correct spelling of "implemented" +- docs: correct spelling of "attempts" +- docs: correct spelling of "bug fixes" +- docs: correct spelling of "specified" +- docs: correct spelling of "hierarchical" +- docs: correct spelling of "implementation" +- docs: correct spelling of "to" +- docs: correct spelling of "Khronos" +- docs: correct spelling of "choosing" +- docs: correct spelling of "subtly" +- docs: correct spelling of "Raspbian" +- docs: correct spelling of "shareable" +- docs: correct spelling of "simplification" +- docs: correct spelling of "synced" +- docs: correct spelling of "maximum" +- docs: correct spelling of "hierarchy" +- docs: correct spelling of "commutativity" +- docs: correct spelling of "compression" +- docs: correct spelling of "laid" +- docs: correct spelling of "average" +- docs: correct spelling of "addition" +- docs: correct spelling of "explicitly" +- docs: correct spelling of "rasterizer" +- docs: fractionary -> fractional +- docs: ie. -> i.e. +- docs: correct spelling of "correctly" +- docs: correct spelling of "traveling" +- docs: correct spelling of "freedreno" +- zink: rework primitive rasterization type logic +- zink: reuse rast_prim for line-rasterization check +- docs: correct spelling of "addition" +- docs: correct spelling of "passes" +- docs: correct spelling of "irrespective" +- docs: correct spelling of "semantic" +- docs: correct spelling of "specifies" +- docs: correct spelling of "Gouraud" +- docs: correct spelling of "Copr" +- docs: spell out "build directory" +- docs: spell out "SHA256 checksum" +- docs: spell out "environment variable" +- docs: spell out "distribution" +- docs: use the official spelling of SHA-1 +- docs: eg. -> e.g. +- docs: XOR'd -> XORed +- docs: consistently upper-case VC4, V3D etc +- docs: consistently spell DRI in all-caps +- docs: consistently spell ANV in allcaps +- docs: consistently spell URL in allcaps +- docs: consistently spell OK in allcaps +- docs: consistently use CMake +- docs: consistently spell API all-caps +- docs: consistently spell VirGL in camel-case +- docs: consistently spell Haswell in camel-case +- docs: use consistent casing for dEQP +- docs: ip -> IP +- docs: irc -> IRC +- docs: xorg -> X.Org +- docs: pci -> PCI +- docs: cpu -> CPU +- docs: gpu -> GPU +- docs: rgb -> RGB +- docs: perfetto -> Perfetto +- docs: lavapipe -> Lavapipe +- docs: freedreno -> Freedreno +- docs: adreno -> Adreno +- docs: radeon -> Radeon +- docs: nouveau -> Nouveau +- docs: llvmpipe -> LLVMpipe +- docs: vulkan -> Vulkan +- docs: lima -> Lima +- docs: ppir -> PPIR +- docs: autotools -> Autotools +- docs: ArchLinux -> Arch Linux +- docs: meson -> Meson +- docs: Gitlab -> GitLab +- docs: xyzw -> XYZW +- docs: hw -> HW +- docs: lava -> LAVA +- docs: skqp -> SkQP +- docs: servo -> Servo +- docs: Linux fbdev -> Linux Framebuffer +- docs: virtualisation -> virtualization +- docs: dependant -> dependent +- docs: healthcheck -> health check +- docs: vmware -> VMware +- docs: spell out "distribution" +- docs: spell out "transform and lighting" +- docs: de-duplicated -> deduplicated +- docs: de-nominate -> denominate +- docs: freedreno -> Freedreno +- docs: llvmpipe -> LLVMpipe +- docs: softpipe -> Softpipe +- docs: panfrost -> Panfrost +- docs: piglit -> Piglit +- docs: ini -> INI +- docs: swr -> OpenSWR +- docs: yml -> YAML +- docs: xml -> XML +- docs: Adreno aXXX -> Adreno XXX +- docs: remove non-existent directory +- docs: do not mention classic swrast +- docs: name correct driver +- docs: clean up labels +- docs: add missing link to mesa3d.org +- docs: add links to usenet groups +- docs: link to replacement forum +- docs: Android.mk -> ndk-build +- docs: gitlab -> GitLab +- docs: nVidia -> NVIDIA +- docs: spell ATI codenames in allcaps +- docs: nfs -> NFS +- docs: fixup rebase mistake +- zink: correct depth-bias enable condition +- zink: only set line-width if drawing lines +- zink: consider polygon-mode for rast_prim +- zink: do not read is_generated unless in tcs shader +- zink: put union fields into structs named by the shader-stages +- docs/zink: fix and cleanup rst syntax +- docs: fixup broken link syntax +- docs: do not mention EGL_MESA_drm_display +- docs: remove stale envvar-reference +- zink: fix json-errors in profile-file +- zink: remove needless requirements +- zink: fix incorrect requirements +- zink: update textureCompressionBC requirement +- docs/zink: add missing required device-feature +- mesa: treat unsupported queries as dummies +- d3d12: fix max-array-layers + +Fabian Vogt (1): + +- gallivm: Fix LLVM optimization with the new pass manager + +Feng Jiang (5): + +- virgl/vtest: fix memory overwrite problem in virgl_vtest_send_get_caps() +- gallium: add key size to the structure pipe_picture_desc +- vl: change vl_video_buffer_destroy() to non-static +- gallium/util: add a new helper function pipe_buffer_size() +- virgl: add support for hardware video acceleration + +Filip Gawin (5): + +- r300: add list of deqp gles2 r400 failures +- r300: add khr r400 failures +- gallium: avoid using float based conditions in loops +- r300: don't use smooth line if not requested +- r300: update r400 tests + +Francisco Jerez (4): + +- intel/fs: Fix horiz_offset() to handle FIXED_GRFs with non-trivial 2D regions. +- nir/lower_int64: Enable lowering of 64-bit float to 64-bit integer conversions. +- nir/lower_int64: Implement lowering of 64-bit integer to 64-bit float conversions. +- nir/lower_int64: Fix float16 to int64 conversions. + +Frank Binns (13): + +- pvr: get free list min size from the runtime info +- pvr: reset all command buffer state in pvr_cmd_buffer_reset() +- pvr: Implement vkResetCommandBuffer +- pvr: remove redundant TODO +- pvr: remove image pointer from image view struct +- pvr: set samplerAnisotropy to false +- pvr: cube map tex state packing fixes +- pvr: don't setup attachment tex state for images with input attachment bit unset +- pvr: finish render job sample count setup +- pvr: add required pixel formats +- pvr: remove implicit sync support +- pvr: setup buffer and image format feature bits +- CODEOWNERS: remove rajnesh-kanwal as an Imagination maintainer + +Friedrich Vock (28): + +- amd/common: move ac_memory_ops_per_clock into ac_gpu_info.h +- amd/common: Remove redundant code for determining memory ops per clock +- radv/winsys: Return VK_ERROR_INVALID_EXTERNAL_HANDLE if buffer imports fail +- radv: Generalize instance count in BVH states to leaf node count +- radv: Add metadata to acceleration structures +- radv: Add Radeon Raytracing Analyzer trace dumping utilities +- radv: Add Radeon Raytracing Analyzer capturing layer +- radv: Enable Radeon Raytracing Analyzer traces +- docs: Add documentation about RADV RRA tracing +- radv: Use radv_CmdUpdateBuffer instead of radv_update_buffer_cp for geometry info writes +- radv: Free geometry infos in radv_CmdBuildAccelerationStructuresKHR +- radv: Correct accel struct header size +- radv: Rename internal node shader to lbvh_internal +- radv: Split CmdBuildAccelerationStructuresKHR into different functions +- radv: Add BVH IR types +- radv: Add conversion shader for internal nodes +- radv: Add conversion shader for leaf nodes +- radv: Create pipelines for conversion meta shaders +- radv: Build acceleration structures using BVH IR +- radv: Rename emulated float helpers +- radv: Use a struct for AABBs +- radv: Add radv_indirect_unaligned_dispatch +- radv/rt: Fix internal converter synchronization +- radv/rt: Dispatch internal converter indirectly +- radv/rt: Track number of inactive leaf nodes +- radv: Add global sync utilities +- radv: Add REF as a typename macro to .clang-format +- radv: Use spirv1.5 instead of vulkan1.2 + +Georg Lehmann (40): + +- aco: Check that we don't override exec_val operands during branching sequence optimization. +- aco/assembler: Fix v_cmpx with SDWA. +- aco: Fix optimizing branching sequence with s_and_saveexec. +- aco/assembler: Fix v_cmpx pre GFX10. +- aco: Use v_cmpx pre GFX10. +- aco: Force tex operand to have the correct sub dword size before packing. +- nir: Add nir_ssa_scalar_is_undef. +- nir/fold_16bit_tex_image: Add an option to fold image sources. +- aco: Implement storage image A16. +- aco: Combine 16bit undef and constants instead of using s_pack. +- radv: Fold 16bit image sources. +- nir: Print selection control for nir_if. +- nir/opt_algebraic: Optimize check for single bit. +- nir/opt_algebraic: Optimize d3d9 pow with fmulz. +- aco: Fix image instructions with lod when 2d_view_of_3d is enabled on GFX9. +- ac/llvm: Fix image instructions with lod for 2d on GFX9. +- aco: Use plain VOPC for vcmpx when possible. +- nir/opt_algebraic: Optimize unpacking of upcasts to 64bit integers. +- radv: Enable VK_EXT_load_store_op_none. +- aco: Use v_fmaak/v_fmamk if two operands are the same literal. +- aco: Unswizzle v_pk_fma_f16 literals to produce more v_pk_fmac_f16. +- radv: Fix GLSL BDA struct alignment and use pointer arithmetic SIZEOF. +- nir/opt_algebraic: Optimize more (a cmp b ? a : b) to min/max. +- aco: Use s_pack_ll for s_bfe operand on GFX9+. +- nir: Print nir_selection_control_divergent_always_taken. +- nir/opt_algebraic: Mirror optimizations for find_msb_rev. +- aco: Implement [ui]find_msb_rev. +- ac/llvm: Implement [ui]find_msb_rev. +- nir/opt_algebraic: Add an option to lower uclz. +- radv,aco: Lower uclz in NIR. +- nir/opt_algebraic: Optimize various find_msb_rev patterns. +- nir: Fix ifind_msb_rev constant folding. +- aco: fmaak/fmamk can't use SDWA. +- aco: Don't use opsel for p_insert. +- ac/llvm: Implement signed idot on GFX11. +- aco: Implement signed idot instructions on GFX11. +- radv: Use available dot product instructions on gfx11. +- radeonsi: Use available dot product instructions on gfx11. +- aco: Use opsel for the third operand. +- aco: Use s_pack_ll_b32_b16 for scalar zero extend. + +Gert Wollny (95): + +- radeonsi-raven/ci: Move spec@arb_timer_query@timestamp-get to flakes +- llvmpipe: Double number of SSBOs +- r600/sfn: Schedule shift instruction on R600 in t-slot +- r600/sfn: Add GS thread fix just like the TGSI code path +- r600/sfn: Sort FS inputs to make interpolated values come first +- r600/sfn: Fix color outputs when color0 writes all +- r600/sfn: Initialize out buffer when printing op +- r600: Force NOPs when loading AR on R600 class hardware +- r600/sfn: Handle R600 scratch read +- r600: Don't use SB with R600 style scratch reads +- r600: Fix SCRATCH OP de-assembly +- nir_lower_atomics_to_ssbo: Initialize deref struct +- r600/sfn: Handle color0 writes all on R700 like on EG +- r600/sfn: Don't tag mem-ring and stream instructions as exports +- r600/sfn: Don't schedule GDS instructions early +- r600/sfn: Don't scan the whole block for ready instructions +- r600/sfn: Use a heuristic to keep SSBO setup and store close +- r600: Fix reporting TGSI IR support +- r600/sfn: Use a low number for unused target register +- virgl: Fix buffer overflow warning: +- virgl: Fix ubsan warnings: +- virgl: when reading back wait first, then do the transfer +- virgl/ci: remove some flakes +- r600/sfn: override register ID when it doesn't matter +- r600/sfn: ACK all image stores, also at end of loop +- r600/sfn: more peephole optimization with conditionals +- r600/sfn: Override VPM if access in helpers is requested +- r600/sfn: Don't scalarize fdd instructions +- virgl: Add some formats that the CTS uses +- mesa/glsl: Add support for NV_shader_noperspective_interpolation +- r600: set nir option lower_cs_local_index_to_id +- virgl: Report CONSTANT_BUFFER_SIZE according to GL_MAX_UNIFORM_BLOCK_SIZE +- r600/sfn: Lower tex,txl,txb and txf to backend +- r600/sfn: lower txd to backend in nir +- r600/sfn: lower txf_ms in nir +- r600/sfn: copy-propagate single source texture values +- r600/sfn: print tex prepare instructions +- r600/sfn: Don't allocate un-used components in texture ops +- r600/sfn: Add an easy access to get an instruction as ALU +- r600/sfn: Don't assert when setting one value +- r600/sfn: Copy propagate into TEX source +- r600/sfn: drop some unused code +- r600/sfn: copy propagate register load chains +- r600/sfn: VS inputs are effectively SSA +- r600/sfn: Add a free-channel mask when testing whether a register can switch channel +- r600/sfn: fix some channel pinning +- r600/sfn: Handle shifts on Cayman +- r600/sfn: only use 3 channels on Cayman for trans ops +- r600/sfn: Make sure texture lowering is done in the right order +- r600/sfn: lower tg4 to backend in NIR +- r600: Add slot to ALU disassambly +- r600: Make NIR the default shader IR +- nir_lower_to_source_mods: Don't sneek in an abs modifier from parent +- r600/sfn: Only run 64 bit ops lowering passes when really needed +- r600/sfn: Always start a new CF after a KILL instruction +- r600/sfn: don't propagate registers into conditional test +- r600/sfn: Add peephole optimization for kill instructions +- r600/sfn: Delete final lowered nir shader early +- r600/sfn: Fix typo +- r600/sfn: Use the correct allocator for loop lists +- r600/sfn:explicitly initialize the memory pool +- r600/sfn: assert on use of abs modifier in op3 +- r600/sfn: Unroll loops after doing some optimizations +- r600/sfn: Always enforce LDS operation order +- r600/sfn: Make sure all components are usable when lowering TF inputs +- r600/sfn: run cleanup passes after late algebraic opt +- nir: move fusing csel and comparisons to opt_late_algebraic +- r600/sfn: Unify the handling of resource IDs in instruction +- r600/sfn: Handle nir_op_seq and nir_op_sne +- r600: Fix printing t-channel in diss-assambly +- r600/sfn: Add test for channel changes in TEX source from opt +- r600/sfn: Allow copy-prop of group dest into origin +- r600/sfn: Trigger TEX CF based on max TEX CF size +- r600/sfn: improve scheduling of tex sources +- r600/sfn: Increase scheduling priority on uniform reads and non-ssa writes +- r600/sfn: elimiate dead registers too +- r600: Account for color and clipvertex when evaluating LDS space +- r600/sfn: evaluate LDS location for color and clip-vertex too +- r600/sfn: Add .clang-format file and apply style +- r600: declare for counter locally and fix signed/unsigned warning +- r600: Only count ALU registers that are not clause local +- r600: Print MOVA_INT dest on Cayman +- r600/sfn: Handle load_workgroup_size +- r600/sfn: lower uniforms to UBOs +- r600/sfn: remove load_uniform handling +- etnaviv: Create MSAA surfaces with PE compatible tiling +- nir/algeraic_opt: use double options too for lowering ftrunc@64 +- r600/sfn: Fix source modifiers for ffract64 +- r600/sfn: Fix f2u32 and remove backend lowring of f2u64 and f2i64 +- r600/sfn: Honor shader key w.r.t. atomic counter layout +- r600/sfn: Fix location for reading cube array image dimensions +- r600/sfn: sort FS color outputs before all other outputs +- virgl: don't a use staging when a resources created with the shared flag +- virgl: Fix injection of double from const mov instruction +- r600/sfn: always use four slots for Cayman trans ops + +Giancarlo Devich (4): + +- mesa: Fix stack corruption for PIPE_QUERY_TIMESTAMP +- d3d12: Add cache managers for separate buffer usage cases +- gallium/pipe: Align allocation size in \`pb_cache_manager_create_buffer` +- d3d12: Don't align already-aligned size in \`d3d12_bufmgr_create_buffer` + +Glenn Kennard (1): + +- nv30: Fix non-scissored clears after a scissor has been set + +Guilherme Gallo (29): + +- ci/lava: Customise sections timeouts via envvars +- ci/lava: Add timeout header info for LAVA GL Sections +- ci/bin: Add utility to find jobs dependencies +- ci/bin: Sort imports from ci_run_n_monitor.py +- ci/bin: Make ci_run_n_monitor finds dependencies automatically +- ci/bin: Add script to expand jobs manifest +- ci/bin: Cache GQL queries +- ci/bin: Print job needs DAG in ci_run_n_monitor +- ci/skqp: Add an option to run all tests +- ci/skqp: Show reports on crashes +- ci/skqp: Fix paths in skqp-runner +- ci/skqp: Fix Nima-Cpp fetching error +- ci/skqp: Build list_gpu_unit_tests and list_gms +- ci/skqp: Add support for commenting tests files +- ci/skqp: Use SKQP_BIN_DIR instead of hardcoded /skqp dir +- ci/skqp: Put generated tests files in artifacts +- ci/skqp: Supress irrelevant shellcheck warnings +- ci/skqp: Remove .baremetal-skqp-test in favor of .skqp-test +- ci/skqp: Add gitlab sections for uncluttering +- ci/freedreno: skqp: run with new tests files +- ci/radeonsi: skqp: Add fail test files for raven +- ci/radeonsi: Add zork jobs and rules +- ci/bin: Fix requirements.txt +- ci/bin: Remove whitespace from token files +- ci: Fix kernel+rootfs.* jobs +- ci: Update piglit with s3 support +- ci: Update piglit-traces tests expectations +- ci: Update ci-fairy in CI, rootfs and containers +- freedreno/ci: Skip civilization-v/CivilizationV-trim trace + +Hans-Kristian Arntzen (3): + +- vulkan: Update to 1.3.228 headers. +- radv: Implement VK_EXT_mutable_descriptor_type. +- vk/runtime: Fix narrowing of timeline signal and wait value to u32. + +Iago Toral Quiroga (87): + +- broadcom/compiler: don't use imprecise_32bit_lowering for idiv lowering +- broadcom/compiler: use nir_opt_idiv_const +- broadcom/compiler: simplify code emitted for centroid coordinates +- nir/lower_alu: drop unnecessary iand on uadd_carry result +- broadcom/compiler: track number of TMU operations in prog data +- v3dv: move check_needs_load/store helpers to unversioned code +- v3dv: track if a job is compatible with double-buffer mode +- v3dv: fix tile state allocation +- v3dv: add a v3dv_job_allocate_tile_state helper +- v3dv: postpone tile state allocation for render pass jobs +- v3dv: tell job_compute_frame_tiling whether we want to use double-buffer mode +- v3dv: add a heuristic for double-buffer mode +- v3dv: vkCmdClearAttachments no longer generates its own RCL +- v3d,v3dv: lower texel buffer aligment requirements +- v3dv: implement VK_EXT_texel_buffer_alignment +- v3dv: add a helper to destoy pipeline layouts +- v3dv: ref/unref pipeline layout objects +- v3dv: set maxBufferSize property +- v3dv: add a v3dv_image_init helper +- v3dv: add a get_image_memory_requirements helper +- v3dv: implement vkGetDeviceImageMemoryRequirementsKHR +- v3dv: add a get_buffer_memory_requirements helper +- v3dv: add a buffer_init helper +- v3dv: implement vkGetDeviceBufferMemoryRequirementsKHR +- v3dv: implement vkGetDeviceImageSparseMemoryRequirementsKHR +- v3dv: expose VK_KHR_maintenance4 +- nir/lower_variable_initializers: implement non-scoped barrier path +- v3dv: implement VK_KHR_zero_initialize_workgroup_memory +- v3dv: also check GS stage on image access for binning barriers +- v3dv: implement vkCmdPipelineBarrier2 +- v3dv: port remaining entry points to KHR_synchronization2 +- v3dv: use VkPipelineStageFlagBits2 when checking semaphore wait stage +- v3dv: expose VK_KHR_synchronization2 +- v3dv: implement VK_KHR_workgroup_memory_explicit_layout +- v3dv: switch to common code for command buffer lifecycles +- v3dv: expose VK_EXT_tooling_info +- v3dv: fix swap_rb and channel_reverse flags in image views +- v3dv: expose VK_EXT_border_color_swizzle +- v3dv: ref pipeline layout earlier during pipeline init +- v3dv: implement VK_EXT_depth_clip_control +- v3dv: expose VK_EXT_attachment_feeback_loop_layout +- v3dv: fix variable type +- v3dv: limit heap size to 4GB +- broadcom/simulator: add a helper to get the amount of free heap memory +- v3dv: implement VK_EXT_memory_budget +- v3dv: expose VK_EXT_primitive_topology_list_restart +- v3dv: don't load an attachment for unaligned render area if we are not storing +- v3dv: expose VK_EXT_load_store_op_none +- v3dv: don't return incompatible driver if GPU is not present +- v3dv: fix program id for binning shaders +- broadcom/compiler: detect unifa write from signal +- broadcom/compiler: check signal writes to magic regs when updating scoreboard +- v3dv: optimize ldunif load into unifa write +- v3dv: fix VK_EXT_texel_buffer_alignment +- broadcom/compiler: increase V3D_MAX_BUFFE_RANGE to 2^30 bytes +- broadcom/compiler: fix robust buffer access +- broadcom/compiler: handle shared stores with robust buffer access +- broadcom/compiler: don't apply robust buffer access to shared variables +- broadcom/compiler: rename v3d_nir_lower_robust_buffer_access.c +- broadcom/compiler: rename static helpers involved with robust buffer access +- broadcom/compiler: add a lowering for robust image access +- v3dv: expose VK_EXT_image_robustness +- v3dv: use NIR_PASS with v3d_nir_lower_robust_image_access +- v3dv: use enabled features from vk_device +- broadcom/compiler: trivial code clean-up +- vulkan/runtime: include robustness info when hashing a shader stage +- v3dv: implement VK_EXT_pipeline_robustness +- v3dv: drop layout refs for all allocated sets from a pool on destroy / reset +- broadcom/compiler: handle vec2 load/store index +- v3dv: refactor events +- v3dv: re-enable sync_fd import/export +- v3dv: return out of host memory if we fail to create event pipelines +- v3dv: split event implementation to a separate file +- v3dv: remove unnecessary check for NULL +- v3dv: do a better job at cleaning up the device on init failure +- v3dv: fix incorrect return type +- v3dv: handle allocation failure during pipeline initialization +- v3dv: do better cleanup on failure during pipeline cache operation +- v3dv: increase limit for active event objects +- v3dv: always check VK_ACCESS_2_MEMORY_READ_BIT for read accesses +- v3dv: make the helper to emit pipeline barriers public to other files +- v3dv: fix event synchronization +- v3dv: vkCmdWaitEvents2 takes an array of VkDependencyInfo +- v3dv: use vk_alloc instead of malloc +- broadcom/compiler: avoid using ldvary sequence to hide latency of branching +- v3dv: fix debug dump on BO free +- v3dv: ignore imported BOs when tracking BO memory usage + +Ian Romanick (8): + +- nir: spirv: Allow 32-bit version of nir_intrinsic_is_sparse_texels_resident +- radeonsi: r600: d3d12: st: Use NIR lowering for tg4 offset arrays instead of GLSL lowering +- glsl: Remove lower_offset_arrays pass +- nir/comparison_pre: See through an inot to apply the optimization +- intel/fs: Fix constant propagation into 32x16 integer multiplication +- nir/range_analysis: Set higher default maximum for max_workgroup_count +- nir/loop_analyze: Fix get_iteration for nir_op_ine +- nir/loop_analyze: Fix get_iteration for nir_op_fneu + +Igor Torrente (3): + +- vulkan: Add vk_clock_gettime and vk_time_max_deviation +- radv: anv: Use the new vk_clock_gettime and vk_time_max_deviation functions +- venus: Fix dEQP-VK.pipeline.timestamp.calibrated.host_domain_test failure + +Ikshwaku Chauhan (1): + +- Revert "radeon: add EFC support to only VCN2.0 devices" + +Illia Abernikhin (2): + +- utils: Move functions from debug.* to u_debug.* +- utils: Merge util/debug.* into util/u_debug.* and remove util/debug.* + +Illia Polishchuk (2): + +- mesa: skip extra state updates for clear calls +- driconf/Intel: Add lower_depth_range_rate option workaround for Homerun Clash misrendering issue + +Isaac Bosompem (1): + +- virgl: Set use_staging in resource_from_handle + +Italo Nicola (6): + +- etnaviv: skip disk cache initialization on standalone compiler +- etnaviv: fix standalone compiler to work without a tgsi backend +- clc: add 32-bit target +- rusticl: use 32-bit address format for 32-bit devices +- rusticl: correctly check global argument size +- rusticl: fix MemConstant invalid arg size check + +Iván Briano (13): + +- anv: emit scissors when the pipeline changes +- anv: pipelineStageCreationFeedbackCount is allowed to be 0 +- anv: populate rt shader groups if they were found in the cache +- anv: Set meshShaderQueries for mesh shader ext features +- intel/utrace: create the callback events for xfb trace points +- anv/grl: Add a GRL file parser +- anv: set READ/WRITE_WITHOUT_FORMAT for buffer views +- vulkan_hasvk: set READ/WRITE_WITHOUT_FORMAT for buffer views +- anv: compile_upload_rt_shader expects a valid pointer +- anv: use DX rules for point rasterization +- hasvk: use DX rules for point rasterization +- anv: support VK_PIPELINE_CREATE_RAY_TRACING_SKIP_* +- anv: enable rayTraversalPrimitiveCulling feature + +James Park (2): + +- vulkan: Augment _WIN32 stub comparison +- meson,amd: Remove Windows libelf wrap + +James Zhu (4): + +- amd/common: some ASICs with gfx9 use compute rings for render +- util/format: add util format y8_400_unorm +- frontends/va: add support for yuv400 and yuv444 +- radeonsi/vcn: enable jpeg decode of yuv444 and yuv400 + +Jami Kettunen (1): + +- freedreno/ir3: Switch to NIR for a5xx's vertex id lowering. + +Jan Beich (1): + +- util: unify FreeBSD futex_wait signature with Linux/OpenBSD/Windows + +Jason Ekstrand (115): + +- nir: Clean up and improve nir_dedup_inline_samplers +- intel/fs_reg_allocate: Improve compressed instruction self-interference +- intel/rt: Handle halts in any-hit shaders properly +- intel/rt: Handle multiple exits in lower_shader_returns +- genxml: Add BVH data structures +- anv: Don't require 32-bit addresses for scratch on Gen12.5+ +- vulkan,anv,dozen: Use VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT +- radv: Use both aspects for depth/stencil blit destinations +- radv: Use VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT for render loops +- radv/meta: Respect render area for MSAA resolves +- radv/meta: Fix a stack use-after-free with sample locations +- radv/meta: Drop subpass_att from radv_fast_clear_color +- radv/meta: Use Begin/EndRendering for resolves +- radv/meta: s/dest/dst/ in a few functions +- radv/meta: Use Begin/EndRendering for clears +- radv: Drop set/restore_subpass +- radv/meta: Drop redundant depth_view_can_fast_clear +- radv: Add a image_transition helper for separate depth/stencil layouts +- radv: Don't assume layout == stencilLayout for stencil-only +- util,intel: Pull the bit packing helpers from genxml to a common header +- broadcom: Use util/bitpack_helpers.h +- util: Add util_bitpack_[su]fixed_clamp helpers +- asahi: Use util/bitpack_helpers.h +- panfrost: Use util/bitpack_helpers.h +- anv: Use nir_intrinsic_load_layer_id for ViewId in fragment shaders +- anv: Decouple primitive replication check from the fragment shader +- anv: Stop looking at the pipeline in multiview lowering +- nir: Track per-view outputs in shader_info +- intel/compiler: Store the number of position slots in the VUE map +- anv: Drop anv_pipeline::use_primitive_replication +- pvr: Only destroy the physcial device if it exists +- vulkan/cmd_buffer: add record_state to the common command buffer. +- vulkan/cmd_queue: Drop vk_cmd_queue::error +- vulkan: Add a concept of recycling an object +- vulkan: Re-order arguments to vk_command_pool_init +- vulkan: Re-order arguments to vk_command_buffer_init +- vulkan: Add a vk_command_buffer_ops struct +- vulkan: Add a common vkResetCommandBuffer() implementation +- vulkan: Add a common vkAllocateCommandBuffers() implementation +- vulkan: Recycle command buffers in vk_command_pool +- radv: Move to the common command pool framework +- tu: Move to the common command pool framework +- lvp: Move to the common command pool framework +- vulkan,docs: Add documentation Vulkan command pools +- vulkan: Dirty VP_VIEWPORTS/SCISSORS when copying viewports/scissors +- radv: Set the window scissor to the render area, not framebuffer +- radv: Only copy the render area from VRS to HTILE +- radv: Leave image layouts alone when doing HW MSAA resolves +- radv: Switch to dynamic rendering only +- rusticl: Call nir_scale_fdiv +- vulkan/runtime: Add a comon vk_descriptor_update_template +- anv: Switch to the common descriptor update template struct +- vulkan/runtime: Compact descriptor update templates +- panvk: Fix buffer views +- spirv: Don't use libclc for wait_group_events +- nir: Add a helper for finding a function by name +- nir/load_libclc: Don't add generic variants that already exist +- vulkan: Record more enabled robustness features +- vulkan: Add a helper for gathering pipeline robustness +- iris: Use a larger alignment for buffer allocations +- iris: Fix more BO alignments +- iris: Handle resource offsets in buffer copies +- iris: Stop looking at textures_used for samplers +- iris: Split max #defines for textures/samplers/images +- iris: Support up to 64 images +- iris: Support up to 128 textures +- intel/fs: Always use integer types for indirect MOVs +- intel/fs: SEL_EXEC uses the integer pipe for 64-bit stuff +- intel/mi_builder: add a way to reserve a register +- intel/mi_builder: Add a helper for incrementing reference counts +- anv/formats: Advertise ACCELERATION_STRUCTURE_VERTEX_BUFFER_BIT +- anv: Add an anv_address_map helper +- anv: Implement VK_KHR_pipeline_library +- anv: Set up the memory-backed FIFO buffer +- anv: Add extern "C" guards +- anv: Add support for compiling OpenCL-style kernels +- anv: Add support for OpenCL-style kernel dispatch +- anv: Import GRL +- anv/grl: Build OpenCL kernels +- anv/grl: Add a helper for dispatching our pre-built kernels +- intel/grl: Parse GRL files and generate C +- anv: Build BVHs on the GPU with GRL +- anv: Advertise ray-tracing on DG2 +- intel/devinfo: DG2 supports ray-tracing +- vulkan/wsi: Support configuring swapchain images as part of swapchain init +- vulkan/wsi/x11: Configure images via params passed to wsi_swapchain_init() +- vulkan/wsi/wayland: Configure images via params passed to wsi_swapchain_init() +- vulkan/wsi/display: Configure images via params passed to wsi_swapchain_init() +- vulkan/wsi/win32: Configure images via params passed to wsi_swapchain_init() +- vulkan/wsi: Always configure images in wsi_swapchain_init() +- vulkan/wsi: Don't expose as many image configure helpers +- vulkan/wsi: Add a typedef for memory type select callbacks +- vulkan/wsi: Add a supports_scanout flag +- vulkan: Use altlen when available for array lengths +- vulkan: Rename viewport_state::negative_one_to_one +- vulkan: Add a dynamic state for vp.depth_clip_negative_one_to_one +- vulkan: Add more dynamic rasterizer state +- vulkan: Add dynamic state for tessellation domain origin +- vulkan: Add more dynamic multisample states +- vulkan: Add more dynamic color blend states +- vulkan,lavapipe: Use a tri-state enum for depth clip enable +- iris: Set SamplerCount in shader packets +- nir: Reformat a comment +- nir: Use nir_shader_instructions_pass in nir_lower_readonly_images_to_tex +- nir: Misc. style fixes to nir_lower_readonly_images_to_tex +- nir: Unconditionally call nir_trim_vector in nir_lower_readonly_images_to_tex +- nir/types: Add some asserts to glsl_get_struct_field() +- nouveau/codegen: Support bindless texture queries +- intel/lower_mem_access_bit_sizes: Compute alignments automatically +- intel: Don't cross DWORD boundaries with byte scratch load/store +- vulkan: Handle VK_SUBPASS_EXTERNAL at the end of a subpass +- vulkan: Unconditionally add barriers for missing external subpass deps +- vulkan: Add state for extraPrimitiveOverestimationSize +- vulkan: Add a common implementation of CmdSetSampleLocationsEnableEXT +- vulkan: Add a dummy vk_common_CmdSetColorBlendAdvancedEXT() + +Jeremy Rand (1): + +- llvmpipe: bump LP_MAX_THREADS to 32 + +Jesse Natalie (82): + +- ci/windows: Re-enable Windows runners +- dzn: Disable extensions that require multi-view +- dzn: Use D3D12 constants instead of naked numbers for device limits +- ci/windows: Disable Windows builds while the runner is offline again +- ci/windows: Windows runner is back online +- ci/windows: Install Graphics-Tools package on the base image +- egl/wgl: Don't require a native display to support window surfaces +- egl/wgl: Implement QuerySurface +- wgl: Split pbuffer creation into two functions +- egl/wgl: Implement PBuffer surfaces +- egl/wgl: Set YInvert attribute to always true +- egl/wgl: Implement bind/release texture buffer +- wgl: Allow per-framebuffer swap interval overrides +- egl/wgl: Support eglSwapInterval +- egl/wgl: Support eglWaitClient +- egl/wgl: Support eglWaitNative +- egl,wgl: Support eglCreateImageKHR +- wgl: Pass smapi explicitly to context creation +- wgl: Pass smapi explicitly to framebuffer creation +- egl/wgl: Use a per-display st_manager +- egl/wgl: Hook up image validate/get in smapi +- egl/wgl: Support EGL_MESA_query_driver +- egl/wgl: Support sync objects +- wgl: Use pfi instead of iPixelFormat more often +- egl/wgl: Delete unused variables/code +- egl/wgl: Fix some awkward sizeof formatting +- meson: Don't include glsl compiler if flex/bison aren't found +- microsoft/compiler: Discard shouldn't be marked readnone +- dxil_nir_lower_int_cubemaps: When not lowering samplers, don't touch sampler types +- microsoft/compiler: Fix PSV struct when numthreads is 0 +- microsoft/clc: Handle a null clc logger +- gallium/windows: Delete OpenGLOn12.dll target +- meson: Add an option to specify the WGL gallium megadriver filename +- mesa: Expose GL_NV_ES1_1_compatibility +- microsoft/compiler: Always emit a shader at the max-supported shader model +- microsoft/compiler: Support up to shader model 6.5 +- microsoft/compiler: Add struct and function defs for SM6.6 handle funcs +- microsoft/compiler: Add dynamic create handle helper +- microsoft/compiler: Add getters for res bind/props structs +- microsoft/compiler: Delete double-assignment of sampler metadata field +- microsoft/compiler: Pass lower_bound, upper_bound, space to createhandle +- microsoft/compiler: Handle SM6.6 handles +- microsoft/compiler: SM6.6 is supported +- microsoft/compiler: Support SM6.7 +- d3d12: Get max supported shader model +- dzn: Get max supported shader model +- u_atomic: Add a helper for pointer compare-exchange +- d3d12: Change displayable format logic +- ci/windows: Update build container image with all deps so wraps can be disabled +- deps: Update DirectX-Headers req to 1.606.4 +- d3d12: Move some things from screen late-init to early-init +- d3d12: Store the d3d12 module reference on the screen +- d3d12: Support device factories in addition to global device creation +- d3d12: Attempt to use the Agility SDK +- d3d12: Use ID3D12DeviceConfiguration for root signature serialization +- d3d12: Release sharing contract in flush_frontbuffer +- dzn: Store the d3d12 module reference on the instance +- dzn: Support device factories in addition to global device creation +- dzn: Use architecture props to return correct device type +- dzn: Attempt to use the Agility SDK +- dzn: Use ID3D12DeviceConfiguration for root signature serialization +- ci/windows: Update vk-gl-cts to pick up new zlib dependency +- d3d12: Don't put permanently-resident resources in the residency bo list +- d3d12: Don't multiply cube array sizes by 6 +- d3d12: Set PIPE_CAP_MAX_TEXTURE_UPLOAD_MEMORY_BUDGET +- dzn: Set factory flags before creating device +- d3d12: Advertise PIPE_CAP_BUFFER_MAP_PERSISTENT_COHERENT +- include: Add WGL version of GL interop extensions +- mesa, dri2: Move interop helpers from dri2 to state tracker +- include: Add a flush interop method +- mesa: Make fence_sync not static +- mesa/st: Split interop object lookup from export +- mesa/st: Implement the new flush method +- dri: Implement the new flush method +- glx: Implement the new flush method +- egl/dri2: Implement the new flush method +- wgl: Implement interop methods +- egl/wgl: Implement interop methods +- include: Add driver data to v2 of mesa_glinterop_device_info +- gallium: Add screen methods for driver-private data on interop interface +- mesa/st: Query driver for interop data +- d3d12: Hook up driver-private interop data + +Jessica Clarke (1): + +- panfrost/blend: Fix invalid const values leading to NIR validation errors + +Joan Bruguera (1): + +- nv50/ir/nir: ignore sampler for TXF/TXQ ops. + +John Brooks (5): + +- radv: Add get_addrlib function to radv_radeon_winsys +- vulkan: Introduce vk_format_is_block_compressed function +- radv: Only apply mipmap view adjustments to block compressed images +- radv: Fix corrupted mipmap copies on GFX9+ +- radv: Fix mipmap views on GFX10+ + +Jonathan (3): + +- gallium/u_threaded: buffer subdata merging (v2) +- gallium/u_threaded: Prepare for buffer sharedness tracking +- gallium/u_threaded: buffer sharedness tracking + +Jonathan Gray (2): + +- iris: check i915 features after hw gen +- util: include sys/time.h for timespec functions + +Jordan Justen (15): + +- iris: Drop extra file-descriptor dup in iris_drm_screen_create() +- intel/pci_ids: Update ADL-S strings +- intel/pci_ids: Add 0x468b ADL-S PCI-id +- intel/pci_ids: Update ATS-M device names +- intel/pci_ids: Update DG2 device names +- intel/pci_ids: Add dg2 0x5698 pci-id +- intel/pci_ids: Drop non-upstream dg2 pci-ids +- intel/compiler: Use builder to allocate fs regs for gs control data bits +- intel/devinfo: Add MTL platforms enums and intel_device_info_is_mtl() +- intel/l3: Use L3 full-way allocation setting for gfx12.5 (DG2, MTL) +- anv/meson: Use anv_flags and anv_cpp_flags in genX compiles +- meson: Deprecate vulkan-rt-drivers intel +- meson: Define with_intel_vk_rt based on with_intel_clc +- intel/compiler: Broadcast lower code should check 64-bit int support +- ci/d3d12: Update quick_shader results with 24 fixes from !19128 + +Jose Maria Casanova Crespo (4): + +- CI: Igalia farm is down +- Revert "CI: Igalia farm is down" +- vc4: consolidate shader-db output +- v3d: Minor fixes on sand8 blit based on sand30 modifications + +Joshua Ashton (2): + +- radv: Remove workarounds for D3D9 feedback loops +- freedreno: Disable 8bpp_ubwc on a6xx gen2 + +José Fonseca (1): + +- util: Add macro util_get_cpu_caps in u_cpu_detect.c for avoid accidentally call it in u_cpu_detect.c + +José Roberto de Souza (34): + +- intel: Simply intel_gem_create_context_engines() +- anv: Remove duplicated memset() in physical device creation +- anv: Remove anv_app_info +- anv: Free vmas in case device creation fails in pthread_mutex_init(&device->mutex +- anv: Do not duplicate intel_device_info memory in each logical device +- anv: Only wait for queue sync if execbuf was properly executed +- anv: Do not copy garbage to batch_bo +- anv: Nuke anv_execbuf_init() +- anv: Group all context operations during device creation +- anv: Nuke dead code +- anv: Return earlier in anv_gem_get_tiling() when not supported +- intel/compiler/fs: Fix compilation of shaders with SHADER_OPCODE_SHUFFLE of float64 type +- intel/compiler/fs: Use DF to load constants when has_64bit_int is not supported +- vulkan_hasvk: Nuke dead code around I915_ENGINE_CLASS_COMPUTE +- anv: Nuke cmd_parser_version +- intel/dev: Adjust prefetch_size values for MTL engines +- intel: Share code to read render timestamp +- intel: Make engine related functions and types not i915 dependent +- intel/dev: Split i915 specific parts of intel_get_device_info_from_fd() +- anv: Move fetch of i915 physical device parameters +- anv: Split i915 specific parts of anv_queue_submit_simple_batch() +- anv: Use Vulkan types for priority as much as possible +- anv: Split the debug part of anv_queue_exec_locked() +- intel: Convert missing i915 engine types to intel +- intel: Convert i915 engine type to intel in tools/ common/ and ds/ +- intel: Add and use intel_engines_class_to_string() +- hasvk: Nuke code around local memory +- hasvk: Fix build around intel_measure_state_changed() call +- iris: Fix enablement of protected contexts +- intel/perf: Use intel_device_info functions to compute subslice and eu totals +- iris: Nuke pci_id from iris_screen +- iris: Drop duplicated errno handling in iris_bo_wait() +- iris: Do not export iris_bo_wait() +- iris: Set priority for replaced engine context + +Juan A. Suarez Romero (3): + +- vc4/simulator: use i915/amd ioctls for BO +- vc4: properly restore vc4 debug option +- vc4: store tex sampler in proper register + +Julia Tatz (1): + +- zink: Advertise PIPE_CAP_NATIVE_FENCE_FD + +Juston Li (6): + +- venus: Enable VK_EXT_texture_compression_astc_hdr +- venus: sync venus protocol headers for VK_EXT_primitive_topology_list_restart +- venus: add support for VK_EXT_primitive_topology_list_restart +- venus: sync to latest venus protocol headers +- venus: add support for VK_EXT_multi_draw +- venus: use buffer cache for vkGetDeviceBufferMemoryRequirements + +Kai Wasserbäch (22): + +- chore(deps): clover: raise the minimum LLVM version to 11.0.0 +- fix(FTBFS): meson: raise C++ standard to C++17 +- chore(docs): rusticl: improve list of build dependencies +- fix: util/format: unused variable ‘desc’ [-Wunused-variable] +- fix: nir: unused variable ‘else_block’ [-Wunused-variable] +- fix: ac/llvm: unused variable ‘offset’ [-Wunused-variable] +- fix: mesa: unused variable ‘ret’ [-Wunused-variable] +- fix: mesa/st: variable ‘texobj’ set but not used [-Wunused-but-set-variable] +- fix: mesa/st: unused variable ‘src’ [-Wunused-variable] +- fix: gallivm: variable ‘type_kind’ set but not used [-Wunused-but-set-variable] +- fix: r600/sb: unused variable ‘repdep2’ [-Wunused-variable] +- fix: r600/sb: unused variable ‘fop’ [-Wunused-variable] +- fix: r600/sb: unused variable ‘r’ [-Wunused-variable] +- fix: r600/sfn: variable ‘opinfo’ set but not used [-Wunused-but-set-variable] +- fix: r600/sb: warning: unused variable ‘{b,nl}’ [-Wunused-variable] +- fix: r600/sfn: unused variable ‘splitpos’ [-Wunused-variable] +- fix: r600/sfn: unused variable ‘{splitpos,param}’ [-Wunused-variable] +- fix: r600/sfn: variable ‘fail’ set but not used [-Wunused-but-set-variable] +- fix: r600/sfn: unused variable ‘spi_sid’ [-Wunused-variable] +- fix: iris: unused variable ‘devinfo’ [-Wunused-variable] +- fix: zink: unused variable ‘intr’ [-Wunused-variable] +- fix: frontends/va: unused variable ‘max_pipe_hevc_slices’ [-Wunused-variable] + +Karmjit Mahil (30): + +- pvr: Fix bo mapping on alloc with PVR_BO_ALLOC_FLAG_CPU_ACCESS. +- pvr: Compete pvr_calc_fscommon_size_and_tiles_in_flight(). +- pvr: Fix calculation in rogue_max_compute_shared_registers(). +- pvr: Change indentation from tab to spaces for xml files. +- pvr: Set descriptor dirty flag based on other flags. +- pvr: Finish setting up job resolve info. +- pvr: Remove unimplemented push descriptor code. +- pvr: Add depth_bias_array handling on dbenable. +- pvr: Add clear program in pvr_device. +- pvr: Add static clear control stream templates in pvr_device. +- pvr: Add static clear VDM state in pvr_device. +- pvr: Add graphics pipeline barrier handling. +- Revert "pvr: Make pvr_cmd_pack() macro clearly internal" +- pvr: Implement clear ppp state emission from template. +- pvr: Add mid fragment pipeline barrier if needed. +- pvr: Handle VK_CULL_MODE_FRONT_AND_BACK. +- pvr: Emit cs words for load op on vkCmdBeginRenderPass(). +- pvr: Add basic skeleton for event sub cmd. +- pvr: Remove struct pvr_emit_state and emit header directly. +- pvr: Add csb helpers macros to write into raw buffer. +- pvr: Make control stream word writing stricter. +- pvr: Complete pvr_emit_ppp_state(). +- pvr: Add assert for texturestate being 0 in when emitting ppp state. +- pvr: Add EMIT_MASK in pvr_emit_ppp_state(). +- pvr: Fix possible seg fault on csb copy. +- pvr: Fix overflow before widen warning for rgn headers size calculation. +- pvr: Handle pipeline barrier vk_sync. +- pvr: Remove outdated comments. +- pvr: Add PBE accum format size in bytes. +- pvr: Change pvr_get_hw_clear_color() to pack based on accum formats. + +Karol Herbst (189): + +- printf: extract clovers printf impl +- printf: add some unit tests +- nir: serialize printf metadata for CL kernels +- nir/lower_images: extract from clover +- nir: extract the clc inline sampler dedup pass from clc +- microsoft/clc: drop inline sampler reordering +- clc: undefine spirv defs to work around LLVMs headers +- vtn: silence warning about linkage +- gallium: mark the input data as const in pipe_grid_info +- util/memstream: insert null terminator on windows +- ci: update fails list +- ci: bumping all tags to make sure we don't have hidden fails +- nvc0: limit max global and alloc size +- ci: update CI to reflect clovers LLVM version bump +- nv50: fix code heap after pipe_shader_enum change +- nouveau: use the contexts pushbuf and client where possible +- nouveau/mm: make code thread safe +- nouveau/buffer: simplify uses of nouveau_fence_work +- nouveau/fence: rework nouveau_fence_emit so we can call it on emitted fences +- nouveau: move nouveau_context initialization to common code +- nouveau: wrap nouveau_bo_map +- nouveau: wrap nouveau_bo_wait +- nouveau: wrap all nouveau_pushbuf_space calls +- nouveau: wrap nouveau_pushbuf_validate +- nouveau: wrap nouveau_pushbuf_refn +- nouveau: use PUSH_KICK instead of nouveau_pushbuf_kick +- nouveau/fence: make the fence struct part of the fence API +- nv50: remove nouveau_fence_signalled prior nouveau_fence_wait +- nv50: remove nv50_bufctx_fence call in vbo_kick_notify +- nouveau/fence: per context fence +- nouveau: per context client and pushbuf +- nouveau: make fencing race free +- nvc0: make state handling race free +- nv50: race free state tracking +- nv50/ir: fix OP_UNION resolving when used for vector values +- nv50: properly flush the TSC cache on 3D +- rusticl: added +- rusticl: finish implementing clBuildProgram +- rusticl/util: add static_assert macro +- rusticl: implement clFinish and clFlush +- rusticl: implement clEnqueueReadBuffer +- rusticl/mem: support ops on subbuffers +- rusticl/kernel: basic implementation +- rusticl: nir bindings +- rusticl: translate spirv to nir and first steps to kernel arg handling +- rusticl/kernel: more clGetKernelWorkGroupInfo props +- rusticl/kernel: clGetKernelWorkGroupInfo allows a NULL device +- rusticl/mem: use helper context for COPY_HOST_PTR buffers +- rusticl/event: wrong but non crashing impl of clWaitForEvents +- rusticl: hack for CL 3.0 +- rusticl/mesa: add fencing support +- rusticl/event: proper eventing support +- rusticl/kernel: implement clCreateKernelsInProgram +- rusticl/kernel: implement clEnqueueTask +- rusticl/program: undefine __IMAGE_SUPPORT__ if images are unsupported +- rusticl/queue: fix clReleaseCommandQueue +- rusticl/mem: implement clCopyBuffer +- rusticl/mem: implement clFillBuffer +- rusticl/event: implement marker and barrier +- rusticl/mem: add clEnqueueMigrateMemObjects stub +- rusticl/program: support compiling libraries +- rusticl: implement clUnloadPlatformCompiler +- rusticl/api: add param to query which contains application provided values +- rusticl/program: implement clCreateProgramWithBinary +- rusticl/event: add fake impl of clGetEventProfilingInfo +- rusticl/kernel: implement CL_KERNEL_COMPILE_WORK_GROUP_SIZE +- rusticl/mem: finish clEnqueueMigrateMemObjects +- rusticl/kernel: run driver requested lowering passes +- rusticl/kernel: add support for offsets +- rusticl/kernel: add missing preprocessor definitions +- rusticl: advertize CL 1.1 and CL 1.2 extensions +- rusticl/context: implement clSetContextDestructorCallback +- rusticl/kernel: implement clCloneKernel +- rusticl/device: set required double fp config values +- rusticl/device: advertize atomic caps +- rusticl: claim support for generic pointers +- rusticl/program: add il stubs +- rusticl: add svm func stubs +- rusticl/icd: add more func pointers +- rusticl/kernel: report SIMD width as work group sizes +- rusticl/kernel: sweep nir to reduce peak memory usage +- rusticl/device: expose cles_khr_int64 for compatible embedded devices +- rusticl/mem implement the memory \*WithProperties API +- rusticl/pipe: add clGetPipeInfo stub end return CL_INVALID_MEM_OBJECT +- rusticl/program: add stubs for program ctors and dtors +- rusticl/kernel: add stub for clGetKernelSubGroupInfo +- rusticl/event: timeout wait on condvar to abort on reaped worker threads +- rusticl: add support for printf +- rusticl/spirv: add print method +- rusticl/mesa: support mapping textures +- rusticl/mem: support read/write/copy ops for images +- rusticl/memory: rework mapping tracking +- rusticl/mem: implement maping images +- rusticl/mem: implement fill image +- rusticl/mem: implement copy image +- rusticl/mem: implement clCreateImage2D and 3D +- rusticl/mem: implement copies between buffers and images +- rusticl/mesa/context: un Arc the PipeContext +- rusticl/mem: implement clCreateSamplerWithProperties +- rusticl/kernel: support for images +- rusticl/kernel: inline samplers +- rusticl/kernel: lower memcpy +- rusticl/device: add retain/release callbacks for luxmark v4 +- rusticl/device: implement cl_khr_3d_image_writes +- rusticl/kernel: set CL_DEVICE_PREFERRED_WORK_GROUP_SIZE_MULTIPLE +- rusticl/kernel: run some more opt passes +- rusticl: get rid of a few clones +- rusticl/memory: don't map more than necessary for copy operations +- rusticl/device: fix api consistency_device_and_host_timer test +- rusticl/queue: implement missing CL 3.0 bits +- rusticl/device: fix compiler features_macro +- rusticl/mem: add user_ptr fallback shadow buffer +- rusticl/mesa: make resource_from_user_memory an optinal feature +- rusticl/kernel: implement image_format and image_order +- rusticl/mem: fix IMAGE1D_BUFFER +- rusticl/mem: only write pitch when required +- rusticl: port to Rust 2018 +- rusticl: kernel caching +- rusticl/kernel: prepare for nir caching +- rusticl/kernel: cache the nir as well +- rusticl: fix compiler features_macro +- rusticl/kernel: optimize local size +- rusticl: the CTS is a piece of shit +- rusticl/kernel: implement CL_KERNEL_ATTRIBUTES +- rusticl/program: some boilerplate code for SPIR-V support +- rusticl/memory: kernel read_write images prep work +- rusticl: disable fp64 support +- rusticl/device: allow overwriting the device_type via env +- rusticl/program: parse quoted paths in args +- rusticl/kernel: fix local buffers +- rusticl/device: report mesas version for CL_DRIVER_VERSION +- rusticl/icd: implement clGetExtensionFunctionAddressForPlatform +- rusticl/icd: add some way of debugging CL function calls +- rusticl/kernel: use real references for arguments +- rusticl: proper PIPE_MAP flags for internal maps +- rusticl: call glsl_type_singleton_init_or_ref +- rusticl/program: some more API validation +- ci: rusticl with llvmpipe +- rusticl: add README file +- rusticl/device: print error when libclc fails to load +- rusticl/mem: return errors for OOB accesses +- rusticl/device: fix custom device detection +- rusticl/mem: fix image OOB checks +- rusticl/mem: align fill pattern buffer to 4 bytes +- rusticl: limit global mem to 2GB +- rusticl/kernel: assign locations before passing the nir to drivers +- ac/llvm: support non 32 bit sized workgroup ids +- ac/llvm: fix load_const for vectors with more than 4 elements +- ac/llvm: fix load/store_shared for vectors with more than 4 elements +- ac/llvm: fix load_ubo for vectors with more than 4 elements +- radeonsi: stop vectorizing unpack_32_2x16_split +- iris: use images_used instead of num_images +- iris: speed up walking global bindings +- iris: bump IRIS_MAX_GLOBAL_BINDINGS to 128 +- rusticl/kernel: preserve fp16 denorms to fix vload/vstore_half +- rusticl: add support for coherent resources +- rusticl/mesa: add bx() method to PipeTransfer +- rusticl: rework resource mappings a little +- rusticl: add helper ctx wrapper for coherent and direct mapping +- rusticl/mem: rewrite the (un)mapping code +- lp: claim being UMA +- rusticl/mem: propper CL_MEM_ALLOC_HOST_PTR support +- radeonsi: fail creating textures from user memory +- nir/lower_cl_images: set binding +- radeonsi: lower hadd and fisnormal +- rusticl: remove blit code +- rusticl: add RUSTICL_ENABLE to have a global way of enabling devices +- docs: Add documentation for Rusticl's env variables +- ci/llvmpipe: update to new RUSTICL_ENABLE var +- zink: handle more opcodes for CL +- zink: handle f2f16_rtz +- zink: fix spirv_builder_spec_const_uint +- aux/trace: add set_global_binding +- rusticl: force BIND_LINEAR on staging resources +- rusticl/mem: can only map staging textures directly +- rusticl: add small bitset impl +- rusticl/nir: add reads_sysval wrapper +- rusticl/kernel: lower system values before gathering info +- rusticl/kernel: add work_dim lowering +- zink: remove leftover work_dim lowering code +- rusticl/kernel: fix more 32 bit problems +- nir/algebraic: add 8 and 64 bit urol and uror lowering +- nir/algebraic: support CL vector accessors +- nir/algebraic: generalize vector_cmp lowering +- nir/algebraic: add vec8/16 cmp lowering +- iris: invalidate sysvals if grid dimension changes +- glsl: fix buffer texture type +- rusticl/device: put space at the end of CL_DEVICE_VERSION +- nir/lower_int64: fix shift lowering + +Kenneth Graunke (53): + +- intel/compiler: Use named NIR intrinsic const index accessors +- iris: Pass devinfo to iris_resource_level_has_hiz() +- iris: Enable HiZ for non-8x4 aligned miplevels on Icelake and later +- iris: Always retain ISL_AUX_USAGE_HIZ_CCS_WT in texture aux usage +- iris: Clean up iris_sample_with_depth_aux() +- intel/compiler: Drop variable group size lowering +- iris: Fix PIPE_CAP_UMA +- iris: Use linear for exported resources if we can't convey tiling +- intel/compiler: Change dg2_plus check to devinfo->verx10 >= 125 +- iris: Delete unused iris_screen::aperture_bytes field +- iris: Don't print out XXX messages if resource creation fails +- iris: Fall back if iris_map_copy_region can't create a staging resource +- iris: don't create staging resources larger than half the aperture +- crocus: Don't print out XXX messages if resource creation fails +- crocus: Fall back if iris_map_copy_region can't create a staging resource +- crocus: Fix memory leaks on iris_resource_create failure paths +- anv: Fail to create a device on ver < 9 +- anv: Drop checks for version 8 or 9 +- anv: Delete image param support. +- anv: Delete batch buffer growing code. +- anv: Delete shader constants UBO from descriptor sets +- anv/tests: Don't use relocations in a test case +- anv: Delete "back" allocation from state pool +- anv: Delete relocation support from anv_block_pool +- anv: Delete "back" allocation from anv_block_pool +- anv: Drop state pool relocation munging +- anv: Delete use_relocations flag +- anv: Delete softpin checks +- anv: Delete anv_reloc_list_add() +- anv: Delete wrapper BOs for relocations +- anv: Delete relocation support from batch submission +- anv: Delete has_a64_buffer_access flag +- anv: Delete has_bindless_images and has_bindless_samples flags +- anv: Make a helper function for pinning a state pool's BOs +- anv: Drop offset from anv_reloc_list_append +- anv: Inline write_reloc into the only remaining caller +- anv: Remove anv_batch_emit_reloc and just open-code it +- intel/compiler: Use subgroup invocation for ICP handle loads +- iris: Ignore aux for copy_region source if there's no unresolved color +- intel/compiler: Use an existing URB write to end TCS threads when viable +- st/mesa: Optionally call nir_vectorize_tess_levels() +- intel/compiler: Vectorize gl_TessLevelInner/Outer[] writes +- intel/genxml: Add XY_FAST_COLOR_BLT +- blorp: Make blitter_supports_aux accessible from multiple files. +- blorp: Implement blitter clears via XY_FAST_COLOR_BLT +- blorp: Fix typo in blorp_xy_block_copy_blt +- Revert "intel/compiler: Vectorize gl_TessLevelInner/Outer[] writes" +- iris: Add miplevel parameters to iris_resource_texture_aux_usage +- intel/compiler: Vectorize gl_TessLevelInner/Outer[] writes [v2] +- iris: Use nir_intrinsic_load_global_constant for large constants +- intel/compiler: Run nir_opt_large_constants before scalarizing consts +- st/mesa: Let nir_opt_access() infer non-readable +- nir: Drop infer_non_readable option for nir_opt_access() + +Konrad Kleine (1): + +- Update docs/drivers/llvmpipe.rst + +Konstantin Kharlamov (5): + +- loader/dri3: simplify check for reply +- loader/dri3: remove a no-op free call +- meson: remove source_root() call in nir compiler path +- meson: remove source_root() call in main meson.build +- .mailmap: change spelling for Constantine Kharlamov + +Konstantin Seurer (63): + +- radv/nir_lower_abi: Use instructions_pass +- radv: Move accel struct structs to bvh/bvh.h +- radv: Move radv_acceleration_structure +- radv: Remove acceleration structure host builds +- radv: Always emulate fmin/fmax +- radv: Remove accel_struct_build +- radv: Remove unused push constant structs +- radv: Add create_build_pipeline_spv helper +- radv: Add the basics for GLSL bvh kerrnels +- radv: Add a GLSL morton kernel implementation +- radv: Switch to the GLSL morton implementation +- radv: Add a GLSL internal kernel implementation +- radv: Switch to the GLSL internal implementation +- radv: Add a GLSL leaf kernel implementation +- radv: Switch to the GLSL leaf implementation +- lavapipe: Remove st_shader_stage_to_ptarget +- radv: Fix stack size calculation with stage ids +- radv: Inline bvh build headers +- radv/ci: Remove host build related fails +- lavapipe: Set ss_dirty in emit_state +- radv: Advertise subgroup ops for rt stages +- vulkan: Add common physical device management +- radv: Use the common physical device enumeration +- anv: Use the common physical device enumeration +- turnip: Use the common physical device enumeration +- lvp: Use the common physical device enumeration +- panvk: Use the common physical device enumeration +- v3dv: Use the common physical device enumeration +- radv: Proper handling for inactive instance nodes +- radv: Deduplicate push constant structs +- radv/rra: Map accel struct VAs to handles +- radv/rra: Remove redundant bounds validation +- radv: Make the radv_buffer_get_va parameter const +- radv/rra: Replace aliasing assert with a warning +- radv: Explicitly store the VA of accel structs +- radv: Cleanup radv_GetInstanceProcAddr +- radv: Add a common traversal build helper +- radv/rq: Use the common traversal helper +- radv/rtpso: Use the common traversal helper +- radv: Use scalar layout for BDA references +- radv: Add and use AS and scratch layout structs +- radv: Fix incorrect build info indexing +- radv: Use cache_uuid for accel struct compatibility +- radv: Remove main_loop_case_visited +- radv/rra: Transcode nodes recursively +- radv: Rename node_internal to node_box32 +- radv: Add radv_bvh_node_box16 +- radv: Use half floats for box16 coords +- radv/rra: Handle box16 nodes +- radv/rra: Calculate bvh size requirements recursively +- radv: Remove create_accel_build_shader +- radv/rra: Fix dumps in the case of aliasing +- radv/rra: Validate before gathering bvh info +- radv/rra: Add basic header validation +- radv/rra: Continue dumping accel structs if validation fails +- radv/rra: Use the accel struct type for header validation +- radv/rt: Fix setting tmax for opaque AABBs +- radv/rt: Load instance id and custom index on demand +- radv/rt: Set vars.arg in the traversal shader +- radv/rt: Restore prev barycentrics when rejecting hits +- radv/rra: Fix copying accel structs that were not built yet +- radv/ray_queries: Fix AABB handling +- radv/rt: Check space before emitting descriptors + +Kostiantyn Lazukin (1): + +- vulkan/cmd_queue: Do not generate unreachable vk_free_* calls. + +Kuixi Ren (1): + +- radeonsi/vcn: Add ability to encode with ltr + +Laurent Bigonville (1): + +- Try to fix FTBFS on kfreebsd architecture + +Leandro Ribeiro (9): + +- vulkan/wsi/wayland: move some structs to beginning of code +- vulkan/wsi/wayland: remove unnecessary spaces in struct fields +- vulkan/wsi/wayland: introduce struct wsi_wl_surface +- vulkan/wsi/wayland: take ownership of wsi_wl_surface when creating chain +- vulkan/wsi/wayland: move wl_surface and wl_display from chain to struct wsi_wl_surface +- vulkan/wsi/wayland: remove refcount from struct wsi_wl_display +- vulkan/wsi/wayland: add default dma-buf feedback support +- vulkan/wsi/wayland: add per-surface dma-buf feedback support +- vulkan/wsi/wayland: avoid useless re-allocations when receiving per-surface dma-buf feedback + +Leo Liu (2): + +- meson: add with_gallium_virgl to allow it as VA backend driver +- frontends/va: fix build error for vaSyncBuffer with older VA + +LingMan (10): + +- rusticl: Fix compilation if stdout and/or stderr aren't symbols +- rusticl/bindgen: Use \`allowlist-\*\` instead of \`whitelist-\*\` switches +- docs/rusticl: Document minimum required bindgen version +- rusticl/api: Use iterators in create_program_with_source +- rusticl/api: Drop UTF-8 conversion of input source +- rusticl/api: Fix creating a program if a nul byte is within the specified string length +- rusticl/api: Factor pushing to the Vec out of the if +- rusticl/api: Shrink unsafe block +- rusticl/api: Don't check the program source for nul bytes needlessly +- rusticl/api: Interpret \`lengths\` as a slice of Option<NonZeroUsize> + +Lionel Landwerlin (152): + +- anv: fixup PIPE_CONTROL restriction on gfx8 +- intel/ds: track untyped dataport flushes +- intel/fs: store num of resume shaders in prog_data +- intel/fs: fixup simd selection with shader calls +- intel/compiler: document units of brw_ubo_range fields +- intel/nir: specify synchronous value for tracing op +- anv: don't return incorrect error code for vkCreateDescriptorPool +- intel/drm-shim: update shim to support DG2 +- anv: fix GetPipelineExecutableStatistics for ray tracing pipelines +- anv: fixup assertions on lowered storage formats +- anv: fix assert in memory budget code when extension is not supported +- anv: prevent trying to mmap non host visible memory +- anv: remove the LOCAL_MEM allocation bit +- intel/fs: fix load_scratch intrinsic +- intel/fs: fixup scratch load/store handling on Gfx12.5+ +- intel/fs: switch compute push constant loads to LSC +- intel/fs: remove unused opcode +- intel/fs: fixup SEND validation check on overlapping src0/src1 +- intel/fs: switch register allocation spilling to use LSC on Gfx12.5+ +- intel/fs: port block a64/surface messages to use LSC +- intel/fs: bump max SIMD size for A64 atomics with LSC +- intel: add a hasvk vulkan driver +- hasvk: remove entrypoints for gfx9+ +- hasvk: stop advertising Vk 1.3 on non-softpin +- hasvk: remove acceleration structure code +- hasvk: remove ray tracing code +- hasvk: remove mesh code +- anv: silence fedora build warning +- anv: remove support for gfx7/8 +- anv: remove unused gfx7 code +- anv/tests: remove back allocation tests +- anv/hasvk: tweak loading failure messages +- anv: remove non present field in upstream spec +- vulkan: fixup 1.3.226 update +- anv: enable EXT_depth_clamp_zero_one +- hasvk: expose VK_EXT_depth_clamp_zero_one +- anv: add a new NO_LOCAL_MEM allocation flag +- anv: don't export flush_compute_state +- anv: don't export gfx state flushing helper +- anv: combine flushes in Draw/DrawIndexed/DrawIndirectByteCountEXT +- anv: add missing tracepoint +- blorp: defined operations for debug purposes +- intel/utrace: make blorp tracepoints more readable +- anv: trace xfb queries +- anv: clear descriptorsets if AllocateDescriptorSets fails +- anv: add support for EXT_mutable_descriptor_type +- anv: add missing wokraround for texture cache invalidate +- intel/nir/rt: fixup generate hit +- intel/nir/rt: spill/fill the entire ray query data +- intel/nir/rt: remove ray query mem hit writes at initialization +- intel/nir/rt: fix ray query proceed level +- intel/nir/rt: change scratch check validation +- intel/nir/rt: store ray query state in scratch +- intel/fs: fixup a64 messages +- intel/nir/rt: fixup alignment of memcpy iterations +- ci: add python3-ply to debian/fedora images +- ci: disable intel-clc on fedora +- ci: build our own version of the LLVM SPIRV translator +- ci/debian: don't use libclc from the system +- ci: bump llvm to 13 for some builders +- ci: disable intel-clc on debian-vulkan +- anv: remove HDC flush from invalidate bits +- meson: bump required llvm-spirv version with intel-clc +- intel/devinfo: Rename & implement num_dual_subslices +- intel/rt: fix procedural primitive ID access +- intel/nir: reuse rt helper +- intel/fs: take a builder arg for resolve_source_modifiers() +- intel/nir: fixup preserved metadata in rayquery lowering +- intel/nir: fix potential invalid function impl ptr usage +- intel/nir: disable assert on async stack id +- intel/fs: disable split_array_vars on opencl kernels +- genxml: add missing no duplicate anyhit flag +- intel/mi_builder: allow half GP registers for dereferencing +- anv: disable SIMD16 for RT shaders +- anv: add new command buffer space allocation +- anv: setup scratch space correctly for RT shaders +- anv: bump client visible address heap to 32GiB +- anv: use the right dispatch size for tracing shaders +- anv: flag BO for write combine when CPU visible and potentially in lmem +- iris: better error message with GuC loading failures +- intel: add INTEL_DEBUG=capture-all to capture everything upon hang +- pps: enable track_event in intel.cfg +- anv: add grl build dependency on entrypoints +- nir: fix NIR_DEBUG=validate_ssa_dominance +- vulkan/runtime: don't lookup the pipeline disk cache if disabled +- anv: remove unused code +- anv: comment out the Gfx8/9 VB cache key workaround for newer Gens +- anv: limit calls into cmd_buffer_flush_dynamic_state +- anv: optimize 3DSTATE_VF emission +- anv: remove multiple push descriptors +- isl: avoid gfx version switch cases on the hot path +- anv: limit push constant reemission +- anv: track descriptor set layout flags +- anv: initialization pipeline layout to 0s +- anv: add analysis for push descriptor uses and store it in shader cache +- anv: reduce BT emissions & surface state writes with push descriptors +- anv: add a layer for hitman3 +- nir/lower_shader_calls: rematerialize more trivial values +- nir/lower_shader_calls: lower scratch access to format internally +- nir/lower_shader_calls: avoid respilling values +- nir/lower_shader_calls: rematerialize values in more complex cases +- nir/lower_shader_calls: add NIR_PASS_V internally +- nir/lower_shader_calls: cleanup shaders a bit more post split +- nir/lower_shader_calls: add an option structure for future optimizations +- nir/lower_shader_calls: add a pass to trim scratch values +- nir/lower_shader_calls: add a pass to sort/pack values on the stack +- nir/lower_shader_calls: move scratch loads closer to where they're needed +- nir/lower_shader_calls: run opt_cse after lower stack intrinsics +- radv: tweak lower_shader_calls parameters +- anv: enable localized loads for lower_shader_calls +- anv: disable mesh in memcpy +- anv: init major/minor before WSI +- vulkan/wsi/wl: correctly find whether the compositor uses the same GPU +- nir/divergence_analysis: add some missing RT intrinsics +- nir/divergence_analysis: add missing desc_set_address_intel +- intel/clc: assert when libclc shader is not found +- drm-uapi: bump headers +- isl: add new MOCS field for protected buffers +- intel/common: add detection of protected context support +- dri: rename PROTECTED_CONTENT in PROTECTED_SURFACE +- gallium: rename PROTECTED_CONTENT cap into PROTECTED_SURFACE +- egl: Add EGL_EXT_protected_content support +- st/gallium: plumb protected context creation +- iris: handle protected BO creation +- iris: Emit protection & session ID on protected command buffers +- iris: enable protected contexts +- intel/fs: require UNDEFs register offsets to be aligned to REG_SIZE +- intel/fs: make split_virtual_grfs deal with partial undefs +- intel/fs: reduce liveness of variables in lowering passes +- intel/fs: use fs implementation of dump_instructions +- intel/compiler: don't allocate compaction arrays on the stack +- anv: remove shader fp64 inspection after parsing +- anv: Reduce RHWO optimization (Wa_1508744258) +- anv: fix missing VkPhysicalDeviceExtendedDynamicState3PropertiesEXT handling +- anv: fixup invalid enum for nir environment +- anv: split internal surface states from descriptors +- anv: bump pool bucket max allocation size +- anv: get rid of ilog2_round_up +- intel/perf: allocate cleared counter infos +- intel/perf: fix B/C counters accumulation in non query mode +- nir/lower_explicit_io: fix metadata preserve +- nir/lower_shader_calls: update metadata before validation +- nir/lower_shader_calls: wrap only jumps rather than entire code blocks +- Revert "nir/lower_shader_calls: put inserted instructions into a dummy block" +- intel/fs: put scratch surface in the surface state heap +- anv: fix 3d state initialization +- anv: ensure CPS is initialized when KHR_fragment_shading_rate is disabled +- anv: fixup context initialization on DG2 +- intel/fs: improve Wa_22013689345 workaround +- blorp: support negative offsets in addresses +- anv: generate correct addresses for state pool offsets +- nir/divergence: add missing btd_shader_type_intel + +Liviu Prodea (5): + +- d3d12/dzn/spirv2dxil: Require version library +- Microsoft clc: strip lib prefix Otherwise OpenCLon12 ICD can't load it +- meson: Microsoft / maybe Intel CLC need the all-targets workaround just like clover +- meson: Only draw with llvm depends on native directly Tests, softpipe or AMD drivers don't depend on it directly +- clc/clover: Link clang statically when shared-llvm is disabled + +Lucas Stach (38): + +- etnaviv: properly check number of layers in surface creation +- etnaviv: fix TS buffer allocation for 3D textures +- etnaviv: allow 3D textures with TS in transfer +- etnaviv: don't expose array and 3D texture support on pre-halti GPUs +- etnaviv: expose ARB_draw_instanced +- etnaviv: mark instanced draw extensions as supported in docs/features.txt +- etnaviv: move checking for MC2.0 for TS into screen init +- etnaviv: use linear PE rendering only on properly aligned surfaces +- etnaviv: add debug option to disable linear PE feature +- etnaviv: fill alpha channel for DXT1_RGB textures +- etnaviv: don't use TS with multi-layer resources +- etnaviv: texture-desc: sample 1D textures using 2D mode +- etnaviv: texture-desc: drop superfluous condition +- etnaviv: rewrite sampler TS check +- etnaviv: slim down etna_shader_key +- etnaviv: pass shader key by reference +- etnaviv: properly reference flush_resources +- etnaviv: move etna_layout_multiple into etnaviv_resource.c +- etnaviv: assert valid layout in etna_layout_multiple +- etnaviv: compute linear resource Y alignment in etna_layout_multiple +- etnaviv: handle compressed texture formats in etna_layout_multiple +- etnaviv: increase alignment for MSAA resources +- etnaviv: rs: fix MSAA alignment adjustment +- etnaviv: rs: try to find exact format match first +- etnaviv: properly size TS buffer for MSAA resources +- etnaviv: set LOGIC_OP_UNK24 for MSAA rendering on SMALL_MSAA GPUs +- etnaviv: disable PE_COLOR_FORMAT_OVERWRITE with MSAA +- etnaviv: blt: scale operations by MSAA mode +- etnaviv: blt: try to find exact format match first +- etnaviv: update headers from rnndb +- etnaviv: fix late Z with MSAA active +- etnaviv: fix shader register control with MSAA +- etnaviv: switch to late Z when linear PE is used +- etnaviv: fix tile status interaction with write mappings +- etnaviv: always use RS align when GPU has TEXTURE_HALIGN feature +- etnaviv: rs: fix blits with insufficient alignment for dual pipe operation +- etnaviv: blt: use correct TS offset in clear operations +- etnaviv: fix wrong surface TS clear size + +Luis Felipe Strano Moraes (8): + +- anv: fix FTBFS on grl due to changes in clang 15 +- anv: reword info flag in intel_clc's getopt to avoid clash +- anv: add missing separator to help for intel_clc +- anv: fixing typo on description of output flag for intel_clc +- anv: added proper handling for input argument in intel_clc +- anv: adding parsetab.py to the .gitignore for grl +- meson: simplified meson for enabling ray-tracing on Intel +- meson: only enable intel-clc for x86_64 builds + +M Henning (1): + +- nv/nir: Set ssbo CacheMode from intrinsic access + +Marcin Ślusarz (53): + +- intel/compiler: fix mesh urb write regression +- nir/lower_task_shader: print shader after each step +- intel/compiler: add support for non-zero base in [load|store]_shared intrins +- Revert "nir/lower_task_shader: don't use base index for shared memory intrinsics" +- anv: disable task redistribution +- spirv, compiler: add "bool nv" to shader_info.mesh +- intel/compiler: implement EXT_mesh_shader +- anv: check EXT_mesh_shader whenever NV_mesh_shader is checked +- anv: replace VK_SHADER_STAGE_[TASK|MESH]_BIT_NV with VK_SHADER_STAGE_[TASK|MESH]_BIT_EXT +- anv: implement EXT_mesh_shader +- anv: implement draw calls for EXT_mesh_shader +- anv: enable EXT_mesh_shader +- anv: fix emission of primitive replication packet for mesh stage +- nir: add uses_wide_subgroup_intrinsics to task/mesh shader_info +- anv: add support for anv_assume_full_subgroups to task & mesh stages +- anv: small cleanup of anv_graphics_pipeline_compile +- nir, anv, hasvk, radv: pull uses_wide_subgroup_intrinsics into shader_info +- intel/compiler: remove second shading rate lowering for mesh +- intel/compiler: print shader after successful brw_nir_lower_shading_rate_output +- intel/compiler: use nir_lower_task_shader pass +- nir/lower_task_shader: lower small stores & loads to shared when requested +- intel/compiler/task: use shared memory for small task payload loads & stores +- intel/compiler: refactor brw_nir_lower_mem_access_bit_sizes +- intel/compiler: add support for 8/16 bits task payload loads +- radv: use nir_shader_instructions_pass in radv_nir_lower_ycbcr_textures +- iris: remove invalid nir_metadata_preserve from iris_fix_edge_flags +- crocus: remove invalid nir_metadata_preserve from crocus_fix_edge_flags +- glsl: use nir_shader_instructions_pass in gl_nir_lower_atomics +- glsl: use nir_shader_instructions_pass in gl_nir_lower_images +- glsl: use nir_shader_instructions_pass in gl_nir_lower_samplers_as_deref +- nir: use nir_shader_instructions_pass in nir_lower_alu +- nir: use nir_shader_instructions_pass in nir_lower_bool_to_bitsize +- nir: use nir_shader_instructions_pass in nir_lower_bool_to_float +- nir: use nir_shader_instructions_pass in nir_lower_bool_to_int32 +- nir: use nir_shader_instructions_pass in nir_lower_64bit_phis +- nir: use nir_shader_instructions_pass in nir_lower_clamp_color_outputs +- nir: use nir_shader_instructions_pass in nir_lower_clip_disable +- nir: use nir_shader_instructions_pass in nir_lower_clip_halfz +- nir: use nir_shader_instructions_pass in nir_lower_drawpixels +- nir: use nir_shader_instructions_pass in nir_lower_fb_read +- nir: use nir_shader_instructions_pass in nir_lower_frexp +- nir: use nir_metadata_none instead of its value +- nir: use nir_shader_instructions_pass in nir_lower_interpolation +- nir: use nir_shader_instructions_pass in nir_lower_samplers +- nir: use nir_shader_instructions_pass in nir_split_var_copies +- nir: use nir_shader_instructions_pass in nir_split_per_member_structs +- intel/compiler: fix loading of draw_id from task & mesh payload +- anv: add support for mesh shading in INTEL_MEASURE +- intel/ds: add new category/stage for draw mesh events +- anv: add mesh shading tracepoints +- intel/genxml: fix width of 3DSTATE_TASK_CONTROL.MaximumNumberofThreadGroups +- anv: set 3DSTATE_[MESH|TASK]_CONTROL.MaximumNumberofThreadGroups +- anv: program 3DSTATE_MESH_DISTRIB with the recommended values + +Marek Olšák (135): + +- nir: add nir_intrinsic_image_samples_identical +- nir: add nir_intrinsic_image_descriptor_amd +- nir: add nir_texop_descriptor_amd +- nir: add shader_info::uses_resource_info_query for txs, levels, samples, etc. +- ac/llvm: implement nir_intrinsic_image_deref_samples_identical +- ac/llvm: implement nir_intrinsic_image_deref_descriptor_amd +- ac/llvm: implement nir_texop_descriptor_amd +- ac/nir: add ac_nir_lower_resinfo +- radeonsi,radv: run ac_nir_lower_resinfo +- ac/llvm: remove all resinfo code now that it's lowered +- radeonsi: implement a non-scaled compute blit+resolve and use it on gfx11 +- radeonsi: don't assume that TC_ACTION_ENA invalidates L1 cache on gfx9 +- radeonsi/ci: update failing tests on navi21 +- radeonsi: fix a regression due to reordering PIPE_SHADER_* +- ac/llvm: handle external textures in ac_nir_lower_resinfo +- radeonsi: merge both fail paths in si_set_vb_descriptor +- radeonsi: add vertex buffers into the BO list in set_vertex_buffers +- radeonsi: remove vb_descriptors_gpu_list only used for debugging +- radeonsi: remove temporary si_context::vb_descriptor_user_sgprs +- radeonsi: use si_cp_dma_prefetch_inline for prefetching VBO descriptors +- radeonsi: use do..while loops and other cosmetic changes in display list path +- ttn: set the correct sampler declaration type in the presense of txs and lod +- gallivm: fix TXQ crash with MSAA samplers +- gallivm: initialize texture_unit_offset in emit_size_query +- gallium/u_blitter: remove unused code for integer MSAA resolve +- gallium/u_blitter: fix the has_txf support condition +- gallium/u_blitter: treat Z scaling as scaled blits +- gallium/u_blitter: clean up IMMs in util_make_fs_blit_msaa_gen +- gallium/u_blitter: make nearest filtering emulation using TXF conformant +- gallium/u_blitter: make the bilinear filter for MSAA resolving conformant +- glthread: unbind framebuffers in glDeleteFramebuffers +- glthread: call _mesa_glthread_DeleteBuffers unconditionally +- glthread: track GL_READ_FRAMEBUFFER bindings too +- glthread: track glBindFramebufferEXT, not just glBindFramebuffer +- glthread: add missing code for GL_ARB_sparse_texture +- util/xmlconfig: allow drivers to override option values +- radeonsi: rename stop_exec_on_failure -> allow_context_lost +- radeonsi: allow lost context with aux_contexts +- winsys/amdgpu: terminate process on CS rejection when unrobust context is lost +- winsys/amdgpu: flatten huge if and reorder code in amdgpu_cs_submit_ib +- winsys/amdgpu: change num_rejected_cs to a bool flag +- radeonsi/ci: make the running script easy to use +- Revert "mesa: implement a display list / glBitmap texture atlas" +- mesa: create glBitmap textures while creating display lists +- st/mesa: fix potential use-after-free in draw_bitmap_quad +- ci: update pass/fail results for spec@!opengl 1.0@gl-1.0-dlist-bitmap +- winsys/amdgpu: use cached GTT for command buffers and don't set the 32BIT flag +- ci: disable the freedreno farm. +- radeonsi/ci: add skips of tests not built on Linux +- radeonsi: don't pass num_patches via derived_tess_state, pass it via si_context +- radeonsi: make the primitive type constant with tessellation +- radeonsi: move \*rs to its only use in si_draw +- radeonsi: remove the prim_restart_tri_strips_only option +- radeonsi: move fixing ngg_culling into si_update_shaders +- radeonsi: unify the logic that sets rast_prim +- radeonsi: precompute GS_OUT_PRIM in advance +- radeonsi: set GS_STATE_OUTPRIM and PROVOKING_VTX_INDEX only when they change +- radeonsi: cosmetic changes in si_emit_rasterizer_prim_state +- radeonsi: remove 1 draw packet order codepath, keep the first one +- radeonsi: move patch_vertices-related tessellation updates out of si_draw +- radeonsi: move set_patch_vertices into si_state_shaders.cpp +- ac/gpu_info: handle LPDDR4 and 5 in ac_memory_ops_per_clock +- ac/surface: disallow 256KB swizzle modes on gfx11 APUs +- radeonsi: set VS_OUT_MISC_SIDE_BUS_ENA=1 for clip distance exports on gfx10.3 +- mesa: fix glDrawRangeElementsBaseVertex should be compiled into display list +- glthread: fix draws not compiled into a display list should generate an error +- glthread: generate errors for glGet functions between glBegin/End +- glthread: always sync for glShaderSource because invalid params can crash +- glthread: fix glGetIntegerv(GL_CLIENT_ACTIVE_TEXTURE) +- glthread: work around GL_INVALID_OPERATION with OpenGL ES 1.x draws +- frontend/dri: make the backgroundCallable extension optional +- frontend/dri: sync glthread when calling from the app side +- radeonsi: simplify radeonsi_zerovram implementation +- driconf: disable glthread for DeusEx:MD and F1 2015 due to a perf drop +- radeonsi/ci: add glx@glx-swap-event_async to CI failures for glthread +- radeonsi: don't flush asynchronously for fence_server_signal +- glthread: execute glSignalSemaphoreEXT synchronously +- radeonsi: enable glthread by default +- gallium/u_threaded: add an option for unsychronized create_fence_fd +- mesa: replace ALLOC_PRIMS with persistently-allocated context arrays +- mesa: remove _mesa_draw_gallium_fallback by adapting st_feedback_draw_vbo +- mesa: remove unused code using _mesa_prim +- mesa: clean up st_indirect_draw_vbo interface and callers +- mesa: trivial simplifications in _mesa_reference_buffer_object_ +- mesa: inline _mesa_reference_buffer_object +- mesa: inline _mesa_handle_bind_buffer_gen and correct things +- mesa: restructure bind_buffer_object for better performance +- mesa: remove unused gl_buffer_object::Data and Written +- mesa: reorder and pack gl_buffer_object +- vbo: don't call _glapi_set_dispatch for GL_COMPILE_AND_EXECUTE loopback +- mesa: don't restore the dispatch at the end of glCallList +- mesa: remove unused function _mesa_get_dispatch +- mesa: initialize OutsideBeginEnd directly instead of through Exec +- glapi: rename 'vtxfmt' to 'beginend' to make it clear +- api_hw_select_init_h.py: simplify the code +- mesa: remove deprecated comments +- mesa: rename GL dispatch initialization functions to make the intent clear +- glthread: use GLenum16 for enums, but clamp it to 0xffff to get correct errors +- glthread: use 8-bit GLenum for prim modes where it results in better packing +- glthread,hud: draw per-frame values and expose the number of batches per frame +- glthread: skip glMultMatrixf if it's identity +- mesa: use memcmp instead of floating-point comparisons in glMultMatrixf +- mesa: make glPopMatrix a no-op if the matrix hasn't changed +- radeonsi: rename si_create_multi_fence -> si_alloc_fence +- winsys/amdgpu: fix (enable) preemption for chained IBs +- radeonsi: fix max_dw computation for CS preambles +- gl_marshal.py: remove/simplify parameters +- gl_marshal.py: move the unmarshal table into a separately generated file +- gl_marshal.py: rework how the marshal dispatch table is initialized +- gl_marshal.py: remove the -O1 hack and manual SET_* inlining +- gl_marshal.py: inline functions for readability +- gl_marshal.py: simplify print_sync_call and rename to print_call +- glthread: use a constant expression instead of cmd_size in custom functions +- glthread: add more DrawArrays/Elements variants with fewer fields +- glapi: remove EXT and ARB suffixes from Draw functions +- glthread: demystify Draw function names +- glthread: rewrite CallList merging and do it in the app thread +- gallium/u_threaded: don't call simplify_draw_info redundantly +- gl_marshal.py: C style fixups +- glthread: merge and collapse glBindBuffer calls that unbind and then bind +- glthread remove the unused \*last pointer from unmarshal functions +- glthread: don't sync for glIsEnabled(GL_BLEND, GL_LIGHTING, GL_POLYGON_STIPPLE) +- cso: start without u_vbuf by default if it's not always used +- cso: constify some parameters to remove typecasts +- cso: fix broken optimization for sampler state lookups +- cso: don't destroy CSOs that are saved +- cso: inline more functions because some parameters like key_size are literals +- cso: make memcmp and hash computation use a literal key size for blend state +- driconf: add a workaround for a cursor issue with kwin_wayland +- driconf: add a workaround for Cossacks 3 +- nir: add nir_intrinsic_optimization_barrier_vgpr_amd for LLVM +- radeonsi: force the MSAA resolve shader to use 1 clause for MSAA loads +- radeonsi/gfx11: fix compute scratch buffer - WAVES is always per SE +- st/mesa: suppress async glthread flushing for GLX_EXT_texture_from_pixmap +- st/vdpau: fix interop with GL + +Marek Vasut (1): + +- etnaviv: Use old set of state registers for PE configuration on GC880 + +Mario Kleiner (1): + +- vulkan/wsi/display: Reset connector state in vkReleaseDisplay(). + +Mark Collins (8): + +- tu: Implement VK_EXT_non_seamless_cube_map +- tu: Allocate private memory per-device instead of per-pipeline +- tu: Clamp priority in DRM submitqueue creation +- tu: Expose VK_EXT_tooling_info using common implementation +- tu: Retain allocated CSes in tu_autotune_on_submit +- tu: Optimize hash_renderpass_instance by removing XXH64_update +- tu: Only write \`A6XX_PC_PRIMITIVE_CNTL_0` if changed +- tu: Clean up variable usage in \`tu6_draw_common` + +Mark Janes (6): + +- Revert "anv: Do not copy garbage to batch_bo" +- anv: Track BOs that need a write-combined mapping +- anv: Use WC mapped local memory for block pool BO +- anv: Allocate buffers with write-combined local memory +- anv: compile anv_acceleration_structure.c +- anv: work around improper buffer usage in hitman3 + +Martin Krastev (1): + +- meson: add svga gallium driver to 'auto' group on arm/aarch64 + +Martin Roukala (né Peres) (20): + +- radv/ci: document a recent regression +- radv/ci: move some tests from the vega10 fail to its flake list +- radv/ci: add more tests to the flake list of vega10 +- radv/ci: skip host_write_transfer_src.1048576 on renoir +- radv/ci: bump the console activity timeout of VanGogh to 3 minutes +- radv/ci: uprev boot2container +- radv/ci: disable the POLARIS10 jobs +- radv/ci: disable kabini's vkd3d job +- radv/ci: run vkcts on the two steam decks in parallel +- radv/ci: document an unstable test +- radv/ci: move some tests from the renoir fail to its flake list +- ci/b2c: add support for the new format of CI_RUNNER_TAGS +- radv/ci: bump the job count on navi21 +- radv/ci: run two jobs in parallel to reduce the execution time +- zink: update the RADV expectations +- zink: mark a test as a flake in RADV expectations +- zink/ci: mark another test as fixed after the frontend caching series +- zink/ci: add another subtest to the list of known failures for radv +- Revert "glx: Fix drawable refcounting for naked Windows" +- Revert "glx/dri: Fix DRI drawable release at MakeCurrent time" + +Matt Coster (3): + +- util: Include stddef.h in util/macros.h +- pvr: Add pvr_csb_unpack(). +- pvr: Fix assignment to ISP back_a.sref + +Matt Turner (4): + +- intel/tools: Also look for 'batch' tag +- freedreno/ir3: Don't use the broken idiv lowering +- freedreno/ir3: Use nir_opt_idiv_const +- intel/dev: Set display_ver = 13 on all ADL/RPL/DG2 + +Mauro Rossi (5): + +- util: glsl2spirv.py: ensure '#endif' is printed in new line +- hasvk: fix android build and reported API version +- Android.mk: Fix gnu++14 related build failures +- r600/sfn: allow building with clang 6 (Android 9) +- AOSP: Add intel_hasvk vulkan library suffix + +Max Kellermann (2): + +- gallium/u_threaded: add missing reference counts for draw_multi slots +- gallium/u_threaded: fix offset calculation for draw_multi slots + +Michael Skorokhodov (1): + +- egl: Return EGL_BAD_MATCH for invalid share_list + +Michael Tang (3): + +- microsoft/compiler: Add dxil_logger type and parameter to nir_to_dxil +- nir: add nir_instr_as_str +- microsoft/compiler: Change calls to NIR_INSTR_UNSUPPORTED to log to dxil_logger instead + +Michel Dänzer (6): + +- gallivm: Fix LLVMAtomicRMWBinOpFMax spelling +- lavapipe: Fix float32_atomic_min_max spelling +- winsys/amdgpu: Set RADEON_FLAG_32BIT again +- radeonsi/ci: Test piglit quick_shader profile as well +- Revert "egl/glx: add fallback for zink loading" +- vulkan/wsi/wayland: Set num_modifier_lists = 0 if num_drm_modifiers == 0 + +Mihai Preda (21): + +- gallivm: LLVM-15 opaque pointers: disable LLVMGetElementType(ptr_type) +- gallivm: drop unused "emit_load" in lp_llvm_buffer_member() +- gallium: drop unused macros in draw_llvm.h +- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_nir_aos.c +- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_nir_soa.c +- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_jit_types.c +- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_format_s3tc.c +- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_struct.c +- gallivm: fixes for LLVM-15 opaque pointers in lp_bld_format_soa.c +- gallium/draw: fix LLVM opaque pointer warnings in store_clip() +- gallium/draw: fix LLVM opaque pointer warnings in generate_clipmask() +- gallium/draw: fix LLVM opaque pointer warnings in draw_gs_llvm_fetch_input() +- gallium/draw: fix LLVM opaque pointer warnings in draw_gs_llvm_end_primitive() +- gallium/draw: fix LLVM opaque pointer warnings in draw_gs_llvm_epilogue() +- gallium/draw: fix LLVM opaque pointer warnings in draw_tcs_llvm_emit_fetch_input() +- gallium/draw: fix LLVM opaque pointer warnings in draw_tcs_llvm_emit_fetch_output() +- gallium/draw: fix LLVM opaque pointer warnings in draw_tcs_llvm_emit_store_output() +- gallium/draw: fix LLVM opaque pointer warnings in store_aos() +- gallivm: LLVM opaque pointer fixes in lp_bld_sample.c +- gallivm: LLVM opaque pointers: add lp_build_array_get[_ptr]2 in lp_bld_struct.c +- gallivm: LLVM opaque pointers: use lp_build_array_get_ptr2() in lp_bld_nir_soa.c + +Mike Blumenkrantz (443): + +- zink: handle mutable swapchain images with dmabuf +- zink: store VkFormatFeatureFlags on creation +- zink: use stored format features for blit functionality +- zink: use modifier feature flags during surface creation when necessary +- zink: init cache_put program fence on program creation +- zink: always set vertex dynamic states +- zink: hook up pipeline_library extensions +- zink: change zink_create_gfx_program signature +- zink: break out program primtype->idx conversion function +- zink: reorganize zink_gfx_pipeline_state a little +- zink: set program pipeline array idx earlier in zink_get_gfx_pipeline +- zink: add a graphics pipeline library implementation +- tgsi_to_nir: fix clipdistance store writemask +- tgsi_to_nir: handle compact arrays for clipdistance +- zink: force flush clears on fb change if fb geometry changes +- zink: force a new framebuffer for clear_depth_stencil if the clear region is big +- zink: use maintenance4 +- zink: allow sw loading with D3D_ALWAYS_SOFTWARE for nine +- zink: drop mode_changed check from linewidth/depthbias draw updates +- zink: handle unscaled depth bias from nine +- zink: handle !half_pixel_center +- zink: fix return for PIPE_CAP_DEPTH_CLIP_DISABLE +- zink: explicitly set nir cursor in rewrite_and_discard_read +- zink: handle invalid gl_TexCoord reads +- zink: rewrite all undefined shader reads as 0001 instead of undef +- zink: export PIPE_CAP_MULTISAMPLE_Z_RESOLVE +- zink: support PIPE_QUERY_TIMESTAMP_DISJOINT +- nine: check return on resource_get_handle +- zink: add u_queue.h to kopper header +- zink: demote dmabuf tiling to linear if modifiers aren't supported +- zink: use modifier_aspect to check for modifier plane in zink_resource_get_param +- zink: don't add modifiers if EXT_image_drm_format_modifier isn't present +- zink: add all format modifiers when adding for dmabuf export +- zink: don't fixup sparse texops +- zink: remove an old RGBA4 nvidia workaround +- vulkan: Update the XML and headers to 1.3.224 +- radv: fix return type for meta resolve shaders +- nir/validate: clamp unsized tex dests to 32bit +- nir/validate: add some (light) validation for sampler type matching +- radv: remove stupid conditional from draw path +- radv: force inlining on another draw function +- mesa: fix blending when using luminance/intensity emulation +- mesa: require render target bind for A/L/I in format selection +- mesa/st: add ignore_srgb_decode param to st_convert_sampler +- mesa/st: plumb glsl130_or_later through sampler creation +- mesa/st: use plumbed value for srgb_skip_decode in sampler conversion +- gallium: convert PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE to enum +- gallium: add PIPE_QUIRK_TEXTURE_BORDER_COLOR_SWIZZLE_ALPHA_NOT_W +- util/format: handle SRGB in util_format_luminance_to_red() +- zink: stop modifying samplerview swizzle on base struct +- zink: add some format functions for emulating alpha formats +- zink: reject native blits for emulated alpha formats +- zink: handle emulated alpha format clears +- zink: support emulating alpha formats using new border color quirk +- zink: add a driver workaround for broken L4A4 on nvidia +- mesa: track format swizzles on teximage objects +- mesa/st: use pre-calculated format swizzle for samplerviews +- mesa: track gl_texture_object swizzle internally +- mesa/st: use tracked samplerview swizzle values +- mesa/st: simplify sampler border color swizzling +- zink: remove cached descriptor mode +- zink: remove checks for lazy descriptor mode + templates ext +- zink: remove descriptor vtable +- zink: delete a lot of descriptor code +- zink: move lazy descriptor functions into zink_descriptors.c +- zink: delete unused struct types +- zink: move all types/enums into zink_types.h +- zink: move lazy descriptor data types into zink_types.h +- zink: delete some unused descriptor struct members +- zink: remove descriptor dummy_set +- zink: merge context descriptor data structs +- zink: merge batch descriptor data structs +- zink: merge context descriptor data onto context struct +- zink: merge program descriptor data onto program struct +- zink: merge batch descriptor data onto batch state +- zink: reorder some descriptor structs +- zink: rename _lazy descriptor stuff +- zink: use statically sized array for descriptor allocation +- zink: ZINK_SHADER_COUNT -> ZINK_GFX_SHADER_COUNT +- zink: explicitly define ZINK_GFX_SHADER_COUNT +- zink: PIPE_SHADER_TYPES -> MESA_SHADER_STAGES +- zink: remove all pipe_shader_type usage +- zink: delete zink_shader_stage() +- zink: fix program cache comparisons +- zink: require EXT_non_seamless_cube_map for GPL support +- zink: (correctly) require extendedDynamicState2PatchControlPoints for GPL +- zink: use a maybe more accurate wild guess for pcp-less gpl +- zink: require VK_KHR_descriptor_update_template +- zink: add an id member for zink_descriptor_pool_key +- zink: use a dynarray instead of hash table for tracking pools +- zink: use a single allocation for zink_descriptor_layout_key +- zink: rework descriptor pool overflow +- nir: add uses_bindless flag for shader_info +- zink: bail out of dmabuf resource creation when srgb explosion is expected +- zink: iterate over all modifiers and nuke the ones that aren't supported +- zink: try to reuse swapchain modifier for dmabuf export +- ci: remove broken device-select-layer from build +- vk/render_pass: don't deref null resolve attachments +- Revert "zink: try to reuse swapchain modifier for dmabuf export" +- Revert "zink: add all format modifiers when adding for dmabuf export" +- zink: deduplicate some program creation code +- zink: deduplicate some program destroy code +- zink: move gfx program update/creation into zink_program.c +- zink: remove cpu conditional render hack +- zink: add c++ guards to zink_pipeline.h +- zink: move pipeline cache structs to zink_program.h +- zink: move gfx pipeline recalc to c++ +- zink: move pipeline state comparison to c++ template +- zink: use template to eliminate more instructions in pipeline state comparison +- zink: remove some gfx pipeline state members +- zink: fix shader key struct packing +- zink: rework bit iteration in update_gfx_shader_modules to use for loop +- zink: simplify shader key comparison by splitting out non-generated tcs +- zink: plumb some screen info through gfx shader creation +- zink: plumb screen info through shader key comparison +- zink: split out nonseamless/inline shader module info calculation +- zink: split out gfx shader module creation from cache searching +- zink: split out first-time shader module creation +- zink: use a list for free batch states +- zink: add templates for pipeline stage comparison +- zink: use stage comparison template to ignore vertices_per_patch +- zink: cache dynamic render vs renderpass pipelines separately +- zink: make program cache values more intelligible +- zink: change u_foreach_bit to regular for loop in zink_gfx_program_update +- zink: use dynarray instead of list for internal shader cache +- zink: pass shader stage as param for gfx module update +- zink: constify shader module variable +- zink: make zink_bo.h compile in c++ +- zink: add more c++ guards to headers +- zink: remove c++ guard from zink_resource.h +- zink: move zink_batch_resource_usage_set to be static inline +- zink: add is_buffer param to zink_batch_resource_usage_set +- zink: use PIPE_CAP_RGB_OVERRIDE_DST_ALPHA_BLEND +- zink: merge all the batch state sets onto the struct +- zink: make batch internal api static again +- zink: split resource tracking into two sets for buffer/texture +- zink: track which stages in gfx programs are using inlined uniforms +- zink: add gfx pipeline lookup shortcut +- st_pbo/compute: handle download failures with fallback +- st_pbo/compute: fix 1D_ARRAY offsets +- st_pbo/compute: fix 1D coord dimension by pre-trimming vectors +- st_pbo/compute: pre-clamp shader geometry to 16bit +- st_pbo/compute: pre-clamp loaded geometry based on coord components +- zink: handle nir_intrinsic_sparse_residency_code_and mechanics +- zink: ignore nir_texop_lod for tex dest matching +- vulkan: Update the XML and headers to 1.3.225 +- zink: support PIPE_CAP_FBFETCH_COHERENT +- tu: fix invalid free on alloc failure +- lavapipe: stop tracking descriptor image layout +- lavapipe: use cso for compute samplers too +- lavapipe: remove lvp_sampler::state +- lavapipe: replace lvp_sampler internals with pipe_sampler_state +- lavapipe: use pipe_sampler_state directly in descriptor info +- lavapipe: delete lvp_buffer::offset +- lavapipe: use gallium buffer descriptor types directly +- lavapipe: apply VK_WHOLE_SIZE to bufferviews at creation +- lavapipe: create gallium descriptor image/sampler view types for views +- lavapipe: delete some code +- zink: use screen param for zink_descriptor_program_deinit +- zink: allow programs to solely manage descriptor deinit +- zink: create compute programs from compute shaders directly +- zink: initialize cache_get_thread unconditionally +- zink: add use_local_size flag to compute programs +- zink: don't call util_queue_fence_init in zink_screen_get_pipeline_cache() +- zink: add param to allow cache_get jobs to run directly +- zink: add a pipeline shortcut for basic compute programs +- zink: merge compute program hash table onto program struct +- zink: don't add compute base pipeline to hash table +- zink: remove use_local_size from compute pipeline state +- zink: stop zeroing local size if current compute doesn't use it +- zink: use different key comparison for local_size-using compute pipelines +- zink: track whether shader has cube samplers +- zink: move compute pipeline cache update to caller +- zink: add a param to allow zink_screen_update_pipeline_cache to run directly +- zink: move compute descriptor binds down in execution +- zink: add precompilation for compute shaders +- zink: add partial async shader compile support +- zink: add locking for descriptor layout caches +- zink: add a context pointer to zink_program struct +- zink: init fbfetch/bindless ctx stuff on shader create +- zink: generate sha1 for compute programs +- zink: add function to check whether a shader has cube samplers +- zink: defer all compute shader creation +- zink: don't emit entrypoints for function temp variables +- zink: check the variable mode before taking samplemask path in ntv +- zink: explicitly use unsigned types for bit shifts +- zink: store VkImageViewUsageCreateInfo for surface creation +- zink: only add srgb mutable for images with modifiers +- zink: avoid needing mutable for z/s sampling +- zink: don't set MUTABLE by default +- zink: don't emit illegal interpolation +- vulkan: initialize a variable +- llvmpipe: don't assume pipe_context is always available in flush_frontbuffer +- mesa: add GLSL_SOURCE +- zink: add spirv builder function for demote +- zink: hook up demote extension +- zink: fix sharedmem ops with bit_size!=32 +- zink: flag all assigned output slots as mapped +- zink: add a ci skip for anv +- mesa/st: split out builtin shader finish +- st_pbo/compute: use split shader finish funcs +- mesa/st: move compute pbo shutdown to compute pbo file +- gallium: add pipe_screen::driver_thread_add_job +- st_pbo/compute: use new shader interface to perform async shader creation +- mesa/st: add specialized pbo download shaders +- mesa/st: add MESA_COMPUTE_PBO env var +- zink: implement async nir creation +- lavapipe: VK_EXT_attachment_feedback_loop_layout +- lavapipe: ARM/EXT_rasterization_order_attachment_access +- docs: add more features +- radv: avoid bottlenecking on sequential sparse buffer binds +- zink: handle split acquire/present +- zink: uncap ssbo size +- zink: remove special-casing for 64bit runtime array emission +- zink: always set var used by get_ssbo_size to the 32bit var +- zink: simplify ntv shader descriptor emission +- zink: emit Aliased decoration for aliased bo descriptors +- zink: fix atomic ssbo indexing with non-32bit values +- zink: handle 64bit float atomics +- zink: export PIPE_CAP_IMAGE_ATOMIC_FLOAT_ADD +- zink: export PIPE_CAP_SHADER_ATOMIC_INT64 +- zink: don't add void clears if a full clear already exists +- zink: split up get_clear_data() +- zink: make void clears more robust +- zink: rewrite clears on fb bind if only the format has changed +- zink: add have_vulkan13 to device info +- zink: add functions for using '2' variants of pipeline barriers +- zink: add screen interfaces for pipeline barriers +- zink: use screen interfaces for pipeline barriers +- zink: fix/relax resolve geometry check +- zink: defer acquire semaphore destruction +- zink: fix atomic ssbo fadd offsets +- st/bitmap: use normalized coords in samplers with lowered rects +- st/drawpixels: use normalized coords in samplers with lowered rects +- zink: disable GPL when sample shading is in use +- anv: force inline more pipe flush functions +- ci: add a panfrost flake +- ci: add nouveau flakes +- egl/glx: add fallback for zink loading +- zink: don't call CmdBindVertexBuffers2EXT with no attributes +- zink: don't always set VK_DYNAMIC_STATE_VERTEX_INPUT_BINDING_STRIDE_EXT with gpl +- zink: clamp z/s formats to aspected format for border colors +- zink: move color channel clamping to zink_format.c +- zink: add border color clamping when customBorderColorWithoutFormat isn't supported +- zink: add srgb border color clamping +- zink: add stencil border color clamping +- zink: check format compatibility up front when creating framebuffer surface +- zink: rework flush_frontbuffer to always flush +- zink: defer more semaphore destruction +- zink: rework batch tracking for resources +- zink bump batch state limit a bit +- zink: split out bind_sampler_states hook for drivers without EXT_non_seamless_cube_map +- zink: remove batch tracking/usage from view types +- zink: remove batch usage from sampler states +- zink: delete more sw conditional render handling +- zink: delete some descriptor caching cruft +- glsl: fix gl_CullDistance xfb linking +- zink: handle culldistance xfb outputs like clipdistance +- zink: track a mask of gfx stages remaining in gfx programs +- zink: track the owner context for gfx programs +- zink: use screen params for program reference functions +- zink: don't try to delete programs from the cache if stages have been modified +- zink: add locking for program caches +- zink: use program's context for cache deletion +- zink: use screen param for zink_shader_free() +- zink: PIPE_CAP_SHAREABLE_SHADERS +- radv: ALWAYS_INLINE radv_after_draw +- radv: ALWAYS_INLINE radv_flush_descriptors +- radv: ALWAYS_INLINE radv_is_streamout_enabled() +- radv: store pointer to sgprs for last vertex stage +- radv: use direct access to last_vgt_api_stage_locs for sgpr emission +- zink: add a bool flag for decomposed vertex elements +- zink: check on init whether decomposed attrs are needed +- zink: wrap zink_shader_key_base access for nonseamless cubemap use +- zink: remove a bunch of casts from shader key inlines +- zink: handle inline uniforms during first-time shader generation +- zink: copy the non-dirty shader stages when updating gfx program +- zink: move to AoS for gfx program shader modules +- zink: ALWAYS_INLINE bind_gfx_stage +- zink: add some padding to small shader keys for byte-alignment +- zink: add 'optimal_keys' handling for shader keys +- zink: use optimal_keys to massively reduce size of pipeline cache comparisons +- zink: split dirty_shader_stages for gfx and compute +- zink: simplify bit tracking in zink_gfx_program_update() +- zink: be even more granular with optimal_key program updates +- zink: unspaghettify some program update code +- lavapipe: set writemask for compute shader buffers +- lavapipe: propagate shader access info across libraries +- zink: copy optimal key always +- zink: add asserts to verify optimal key state +- zink: fix some batch tracking leaks +- zink: free vertex_state_cache on shutdown +- zink: always unset var->data.explicit_xfb_buffer +- zink: fix sparse queue creation +- zink: fix handling for ssbos that are just runtime arrays +- zink: name bo variables using bitsize +- zink: avoid overflow when clamping bufferviews +- zink: allow reordered clear_buffer calls +- zink: remove color_write_missing driver workaround +- zink: only emit runtime array for ssbos if present +- mesa: set normalized_coords for bindless texture buffer samplers +- zink: enable VK_ATTACHMENT_LOAD_OP_DONT_CARE for zs in renderpasses +- zink: remove zink_context::new_swapchain +- zink: rename fb_binds -> fb_bind_count +- zink: reorder zink_resource a little +- zink: add a mask of fb attachment idx for resources +- zink: use feedback loop layout to correctly handle implicit feedback loops +- zink: always set VK_PIPELINE_CREATE_COLOR_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT sometimes +- nir/opt_undef: add a pass to clean up 64bit undefs +- zink: don't flatten 64bit arrays during rewrite +- zink: always run optimize_nir after lower_64bit_vars +- zink: split up lower_64bit_vars pass +- zink: also lower 64bit function temps +- aux/trace: dump line_rectangular member of rasterizer state +- aux/trace: dump depth_clamp for rasterizer state +- zink: always unflag unordered_write when binding image descriptors +- zink: set layouts before possibly reordering image copies +- zink: move zink_screen_get_pipeline_cache() calls out to callers +- zink: rework ZINK_PIPELINE_LIBRARY_FORCE into ZINK_DEBUG flag +- zink: move create_pipeline_lib to zink_program.c +- zink: hook up VK_EXT_depth_clamp_zero_one +- zink: pass actual screen->threaded to zink_tc_context_unwrap() +- vulkan: Update the XML and headers to 1.3.230 +- lavapipe: dynamic state3 +- zink: add a draw barrier when unbinding a sampler-bound fb surface +- zink: replace mixed_zs with zs feedback loops +- glthread: use signed vertex buffer offsets when available, don't require them +- glthread: handle DeleteBuffers(n=-1) gracefully +- zink: fix/improve handling for multi-component bitfield ops +- zink: export PIPE_CAP_ALLOW_MAPPED_BUFFERS_DURING_EXECUTION +- zink: export PIPE_CAP_MAP_UNSYNCHRONIZED_THREAD_SAFE +- pipe-loader: fix zink driinfo header path +- zink: enable glthread by default +- zink: call flush_resource when presenting garbage +- lavapipe: improve blend handling on pipeline binds +- zink: use dynamic state3 rasterization states when available +- zink: use more dynamic state3 when available +- zink: add tgl ci skip +- zink: don't access prog->shaders array during pipeline compile +- zink: compress zink_gfx_output_key members +- zink: further compress zink_gfx_output_key lookups +- zink: set VK_PIPELINE_CREATE_RETAIN_LINK_TIME_OPTIMIZATION_INFO_BIT_EXT +- zink: require optimal_keys for GPL +- zink: add param to disable optimization when combining pipeline libraries +- zink: move binding_map to hw_state part of vertex state +- zink: reorder zink_destroy_gfx_program() +- zink: don't deref prog->shaders during pipeline construction +- zink: use an extra pipeline state bit to track coherent fbfetch usage for gpl outputs +- zink: reenable GPL +- zink: define zink_pipeline_dynamic_state3 as zink_rasterizer_hw_state +- zink: set depth clamp +- remove xvmc +- delete rbug +- delete graw tests +- vk/graphics_state: don't set default sample locations if rast samples is dynamic +- zink: allow creating full pipelines unoptimized +- zink: gfx_pipeline_cache_entry -> zink_gfx_pipeline_cache_entry +- zink: change zink_gfx_program::last_pipeline -> struct zink_gfx_pipeline_cache_entry +- zink: implement async gfx precompile +- zink: add ZINK_DEBUG=shaderdb +- zink: move descriptor layout/pool stuff to screen object +- zink: add a safety assert in descriptor updates +- zink: add some bda handling +- zink: handle global and scratch vars +- zink: move descriptor function up in file +- zink: handle oom better during descriptor updating +- zink: fix pool overflow handling on batch reset +- zink: rework/improve descriptor pool overflow handling on batch reset +- zink: delete view objects when unsetting resource usage in batch reset +- zink: prevent ballooning of view object memory +- lavapipe: store compiler options to physical device +- zink: check core feature for pipeline cache control +- zink: don't add other usage bits for transient images +- zink: clamp line_stipple_factor to 1 if stipple is disabled +- zink: unset rp_changed after initializing renderpass attachments +- zink: disable fbfetch when flushing clears +- glthread: fix buffer allocation size with non-signed buffer offset path +- zink: enable tc cpu storage on some buffers +- zink: clamp feedback loop layout to GENERAL if the ext isn't supported +- zink: delete some big struct members from zink_context +- zink: reorder context init to handle COMPUTE_ONLY +- zink: clamp shader bindings for compute descriptors +- zink: add ci flake +- zink: move spec constant emission to the types/consts block +- zink: set default spec constant value to 1 +- zink: add docs for zink_batch.c +- zink: delete/cleanup some descriptor #defines +- zink: delete unused descriptor struct field +- zink: start on some type docs +- zink: fix duplicate VK_DYNAMIC_STATE_LINE_STIPPLE_EXT setting +- zink: fix ds3 line stipple enable handling +- driconf: add mesa_glthread=false for gfxbench +- lavapipe: store binding offset for buffers +- lavapipe: fix BufferDeviceAddress usage +- zink: fix gpl compile optimization +- zink: decrement ZINK_FBFETCH_BINDING +- zink: don't flag ubo0 as used in shaders with indirect ubo access +- zink: allow direct memory mapping for any COHERENT+CACHED buffer +- zink: allocate/place memory using memoryTypeIndex directly +- zink: support multiple heaps per memory type +- zink: delete unused zink descriptor mode +- zink: rename some variables in descriptor code +- zink: ZINK_DESCRIPTOR_TYPES -> ZINK_DESCRIPTOR_BASE_TYPES +- zink: add more zink_descriptor_type enum values to improve readability +- zink: delete unused descriptor function +- zink: stop passing pointer to changed_sets to populate_sets() +- zink: add some descriptor docs +- zink: add some docs for c++ pipeline state functions +- zink: simplify some program descriptor helpers +- zink: split out uniform decriptor bindings in shader data +- zink: flag push state changed in batch changed conditional +- zink: rescope/simplify some push descriptor code +- zink: elide a buffer samplerview update conditional +- zink: handle broken resource mapping deadlocks +- zink: simplify swapchain imageview handling +- zink: delete stale comment for zink_surface +- zink: simplify conditional for surface rebind no-ops +- zink: use zink_resource_object::views to defer deferred storage view deletion +- zink: delete unused zink_surface member +- zink: add some breadcrumbs for VK_EXT_multisampled_render_to_single_sampled +- zink: add docs for zink_surface +- zink: match bitsizes in bo rewriting +- zink: pass KERNEL shaders through successfully +- zink: rework sampler emission +- zink: pass image type to image emission +- zink: add a nir pass for CL image typing and sampler tracking +- zink: simplify image deref handling +- zink: add some spirv builder handling for sampled image ops +- zink: emit explicit samplers in ntv based on sampler_mask +- zink: always take the compact descriptor size path with kernel shaders +- zink: add handling on the vk side for discrete sampler descriptors +- zink: add handling for CL-style discrete shader samplers +- zink: rip out unused kernel push constant +- zink: fix weird formatting +- zink: don't force attachment usage for images when unsupported +- zink: always add MUTABLE with EXTENDED_USAGE for image creation +- zink: don't double-deref bindless texture arrays in shaders +- util/tc: split out flush and deferred flush calls +- util/tc: split out dsa and fs state cso handling +- util/tc: implement renderpass tracking +- zink: add a context flag to indicate when blitter is running +- zink: use tc renderpass tracking to optimize renderpasses +- zink: enable renderpass optimizing for turnip jobs + +Ming Li (1): + +- docs: Add d3d10umd and lavapipe to the doc for the gallium frontends. + +Mykhailo Skorokhodov (8): + +- util: Add glsl2spirv.py script +- nir: Make lower_double_ops recognize SPIR-V mangling +- nir: Add assert in nir_lower_doubles +- glsl/meson: Add variable to export float64.glsl +- anv/meson: Add float64_spv_h custom target +- anv: Add softtp64 workaround +- drirc: Add fp64_workaround_enabled option +- drirc: Apply fp64_workaround_enabled to DOOM Eternal + +Nanley Chery (8): + +- mesa/st: Set PIPE_MAP_DISCARD_RANGE in fallback_copy_image +- mesa/st: Add and use st_texture_image_insert_transfer +- mesa/st: Add and use st_texture_image_transfer::box +- mesa/st: Don't map compressed texture fallbacks for reads +- mesa/st: Fix indentation in st_MapTextureImage +- mesa/st: Drop st_texture_image_transfer::map +- iris: Enable INTEL_MEASURE for compute dispatches on XeHP +- iris: Reduce use of RHWO optimization (Wa_1508744258) + +Nicholas Bishop (2): + +- crocus: Advertise PIPE_CAP_NATIVE_FENCE_FD +- crocus: fix off-by-one error when clearing stale syncobjs + +Oleksii Bozhenko (2): + +- vulkan/wsi: Pass wsi_image_create_info into anv_GetPhysicalDeviceImageFormatProperties2 +- anv: Allow aliasing with modifiers for WSI images + +Omar Akkila (5): + +- ci: Merge common builds in debian test jobs +- ci: Move crosvm to the base test image +- ci: Build crosvm with opaque fd mapping support +- ci: Enable virtio-experimental in testing +- ci: Add vulkan dEQP tests for venus + +Paul Gofman (1): + +- vbo/dlist: keep buffers used in loopback_vertex_list() mapped. + +Pavel Ondračka (22): + +- r300: fix variables detection for paired ALU and TEX instructions in different branches +- r300: allow constant swizzles with inline constant +- r300: fix negate mask computation when merging movs +- r300: check for identical saturate mode when merging MOVs +- r300: run dataflow optimizations in separate loops +- r300: generalize the merge_movs pass +- r300: don't merge w channel in fragment shaders +- r300: merge MOVs into ADD using the 0 swizzle +- r300: merge MOVs with MULs or ADDs in merge channels +- r300: Set more shadow sampler lowering in precompiles. +- r300: allow presubtract when both ADD sources are negative +- r300: run copy propagate once more after merging channels +- r300: merge together MOV and MAD instructions +- r300: add special path for merging movs with the same source +- r300: add some synchronization for KIL +- r300: fix register rewrite when converting rbg instructions to alpha +- r300: reduce CPU overhead in IF transformation pass +- Revert "r300: be less agresive with copy propagate in loops" +- r300: fix reader detection with breaks +- r300: be more careful when pair merging with presubtract +- r300: fix generation of invalid swizzles in regalloc +- r300: respect buffer offset in r300_set_constant_buffer + +Pedro J. Estébanez (5): + +- nir2dxil: Lower texture projections +- nir2dxil: Use native helper lane intrinsic on SM >= 6.6 +- microsoft/spirv2dxil: Refactor to make SM and validator versions public +- microsoft/spirv2dxil: Change \`spirv_to_dxil()` to receive SM and validator versions +- microsoft/spirv2dxil: Support pack_32_2x16 and pack_32_4x8 + +Pierre Moreau (11): + +- nv50,nvc0: Do not resize global residents if unnecessary +- nv50: Rename fixups to relocs +- nv50: Rename interps to fixups +- nv50: Report actual VRAM size +- nv50: Disallow allocating more than VRAM size +- nv50: Mark RESOURCE_FROM_USER_MEMORY_COMPUTE_ONLY as unsupported +- nv50/ir: Handle non-32-bit values when cst folding SPLIT +- nv50/peephole: Disallow combining sub 4-byte ld/st for now +- nv50/ra: Fix the offset computation for compounds +- nv50/ir: Avoid generating splits of splits +- nv50/nir: A group barrier is CTA-level not global-level + +Pierre-Eric Pelloux-Prayer (72): + +- loader: don't return empty string in loader_get_dri_config_device_id +- loader: allow DRI_PRIME=vendor_id:device_id syntax +- vulkan/device_select: print the dri_prime warning only if needed +- vulkan/device_select: allow DRI_PRIME=vendor_id:device_id +- docs: document DRI_PRIME +- gallium/dri2: make protected buffer checks opt-in +- amdgpu/bo: update uses_secure_bos when importing buffers +- radeonsi: allocate BIND_RENDER_TARGET as tmz instead of SCANOUT +- radeonsi: print radeon_bo_flag when AMD_DEBUG=vm is used +- nir: add a nir_opt_if_options enum +- radeonsi/tests: remove left-over debug print +- radeonsi/tests: add testing of the khr-single tests +- radeonsi/tests: update raven expected results +- radeonsi/tests: update navi21 expected results +- radeonsi/tests: cosmetic changes +- radeonsi: prevent u_blitter recursion in si_update_ps_colorbuf0_slot +- radeonsi: use LOAD_CONTEXT_REG_INDEX for VGT_STRMOUT_DRAW_OPAQUE +- ac/llvm: add a return value to ac_nir_translate +- radeonsi: deal with ac_nir_translate failures +- radv: deal with ac_nir_translate failures +- radeonsi: use nir_opt_large_constants earlier +- radeonsi/sqtt: set stable pstate if possible +- mesa: avoid reading back textures from VRAM +- mesa: simplify _mesa_texstore_z24_s8 and s8_z24 +- mesa: remove fallback for GL_DEPTH_STENCIL +- radeonsi: invalidate L2 when using dcc stores +- egl/wayland: fix glthread crashes +- winsys/amdgpu: init 'r' before using it +- radeonsi/gfx11: don't set VERTS_PER_SUBGRP to 0 +- tc: do a GPU->CPU copy to initialize cpu_storage +- tc: don't use CPU storage for glBufferData +- ac/llvm: remove unused vars hash table +- radeonsi: remove USE_LDS_SYMBOLS +- radeonsi: use nir_lower_explicit_io to get rid of mem_shared deref +- ac/llvm: drop visit_deref +- ac/llvm: simplify get_memory_ptr +- ac/llvm: add helpers to get pointer types of ac_arg +- ac/llvm: remove LLVMBuildGEP usages +- ac/llvm: add a ac_llvm_pointer type +- ac/llvm: port functions to use ac_llvm_pointer +- ac/llvm: switch ac_build_gep_ptr to LLVMBuildGEP2 +- ac/llvm: switch ac_build_pointer_add to LLVMBuildGEP2 +- ac/llvm: add type param to ac_build_load_invariant +- ac/llvm: add xxxx2 functions to iteratively port to opaque pointers +- ac/llvm: store lds as ac_llvm_pointer +- ac/llvm: store constant_data and scratch as ac_llvm_pointer +- ac/llvm: remove unneeded casts +- radv/llvm: use ac_build_gep0_type to get args types +- radeonsi: use ac_get_arg_pointee_type to get the right type +- radeonsi: use LLVMBuildGEP2 in si_build_gep_i8_var +- radeonsi: make ngg_gs_get_vertex_storage return a ac_llvm_pointer +- radeonsi: port the remaining code to opaque pointers +- ac/llvm: remove gep_2 and others temporary functions +- radv/llvm: replace LLVMBuildLoad by LLVMBuildLoad2 +- ac/llvm: add ac_get_ptr_arg +- ac/llvm: pass ac_llvm_ptr instead of separate type/ptr +- Revert "ac: use LLVMContextSetOpaquePointers if available" +- radeonsi: zero init using the correct components count +- radeonsi/sqtt: notify sqtt when we skip a flush +- radeonsi/sqtt: re-export shaders in a single bo +- radeonsi/sqtt: allow to disable spm counters +- radeonsi/tests: handle flakes file for all test suite +- radeonsi/tests: simplify the script +- radeonsi/tests: update expected results for navi21 +- radeonsi/tests: update expected results for navi10 +- radeonsi/tests: update expected results for polaris11 +- radeonsi: simplify si_prefetch_shaders +- radeonsi/sqtt: simplify condition to determine if sqtt is on +- radeonsi: store the shader gpu adress in si_shader +- radeonsi: handle sqtt pipeline in shader prefetch +- radeonsi/gfx11: enable sdma copy DRI_PRIME +- radeonsi: update db_eqaa even if msaa is disabled + +PixelyIon (1): + +- tu: Make KGSL wait IOCTLs interrupt-safe + +Qiang Yu (62): + +- nir/lower_gs_intrinsics: fix primitive count for points +- radeonsi: fix tcs_out_lds_offsets arg alignment +- winsys/amdgpu: fix non-page-aligned sparse buffer creation +- nir,ac/llvm: add nir_intrinsic_load_half_line_width_amd +- nir,ac/llvm: add nir_intrinsic_load_viewport_xy_scale_and_offset +- ac/nir/ngg,radv: use nir_load_viewport_xy_scale_and_offset +- ac/nir/cull: support caller react when primitive is rejected +- ac/nir/ngg: support line culling +- ac/nir/ngg: fix and simplify gs store output lower +- ac/nir/ngg: use same driver location for gs output +- ac/nir/ngg: save and restore output bit size for gs +- ac/nir/ngg: save and restore position output base for nogs +- ac/nir/ngg: add gs culling +- ac/nir/ngg: support component position store +- ac/nir/ngg: support clipdist culling +- ac/llvm: cast tes_u/v_replaced to float +- radeonsi: implement nir_intrinsic_load_half_line_width_amd +- radeonsi: implement nir_intrinsic_load_viewport_xy_scale_and_offset +- radeonsi: implement cull nir intrinsics +- ac/llvm: move culling replaced registers to abi +- radeonsi: use replaced values when present +- radeonsi: vs load input re-calculate vertex index after culling +- radeonsi: add and implement load_user_clip_plane abi +- nir: fix nir_xfb_info buffer_to_stream length +- nir: add nir_intrinsic_load_num_vertices_per_primitive_amd +- nir: add nir_intrinsic_load_streamout_buffer_amd +- nir,ac/llvm: add nir_intrinsic_load_ordered_id_amd +- nir: add nir_intrinsic_ordered_xfb_counter_add_amd +- ac/llvm: implement nir_intrinsic_ordered_xfb_counter_add_amd +- ac/nir/ngg: cleanup prim id to prepare for streamout +- ac/nir/ngg: nogs support streamout +- ac/nir/ngg: ngg_gs_load_out_vtx_primflag support stream +- ac/nir/ngg: support multi stream per output slot for gs +- ac/nir/ngg: support gs streamout +- radeonsi: implement nir_intrinsic_load_num_vertices_per_primitive_amd +- radeonsi: implement nir_intrinsic_load_streamout_buffer_amd +- radeonsi: print out remove_streamout shader key +- nir,ac/llvm,radv: add nir_intrinsic_load_provoking_vtx_in_prim_amd +- ac/nir/ngg: use nir_load_provoking_vtx_in_prim_amd in ngg lower +- radv: count gen_prims_queries_enabled +- radv: split active_pipeline_gds_queries +- nir,ac/nir/ngg,radv: split shader_query_enabled_amd +- ac/nir/ngg,radv: use different counters for shader queries +- nir,ac,radv: add primitive count add intrinsics +- nir: add nir_load_prim_xfb_query_enabled_amd +- ac/nir/ngg: add query param to ac_nir_lower_ngg_gs +- ac/nir/ngg: add streamout emitted primitive query +- radeonsi: implement llvm abi atomic_add_prim_count +- radeonsi: implement nir shader query enabled intrinsics +- ac/llvm: get back intrinsics used by NGG +- ac/nir/ngg: support user edge flags for ngg lower +- ac/nir/ngg: pass primitive_id_location as param for nogs lower +- nir: add two amd ngg lds base load intrinsics +- ac/nir/ngg,radv: move LDS layout calculation out of nir ngg lowering +- ac/nir/ngg: allow passthrough with vs primitive id output +- ac/nir/ngg: save and restore no_varying/no_sysval_output +- ac/nir/ngg,ac/llvm,aco: save nogs ngg culling one lds dword +- ac/nir/ngg: add one odd dword to nogs culling pervertex lds +- ac/nir/ngg: fix nogs culling with nuw add +- ac/nir/ngg: remove nuw for negative value add +- ac/llvm: fix gfx11 fs input load for 16bit varying +- ac/llvm: fix 16bit varying llvm compile error + +Rajnesh Kanwal (48): + +- pvr: Implement vkResetDescriptorPool API. +- pvr: Fix clang-format issue. +- pvr: Implement vkGetPhysicalDeviceImageFormatProperties2 API. +- pvr: Implement vkCmdDrawIndirect API. +- pvr: Implement vkCmdDrawIndexedIndirect API. +- pvr: Fix multiple file descriptor leaks. +- pvr: Implement vkCreateEvent and vkDestroyEvent APIs. +- pvr: Implement vkCmdSetEvent2 API. +- pvr: Implement vkCmdResetEvent2 API. +- pvr: Implement vkCmdWaitEvents2 API. +- pvr: Implement vkSetEvent API. +- pvr: Implement vkResetEvent API. +- pvr: General reformatting and renaming some fields. +- pvr: Replace pvr_renderpass_surface_initop with VkAttachmentLoadOp. +- pvr: Add support for generating render pass hw setup data. +- pvr: Use ~0 for masks instead of ~0U. +- pvr: Add missing output register write in pvr_CreateRenderPass2. +- pvr: Rename userpass_spawn to isp_userpass. +- pvr: Add const to structs that are not supposed to be modified. +- pvr: Add attachment resolve support. +- pvr: Add support to create subpass load ops. +- pvr: Pass load-op struct instead of idx to support subpass load-ops. +- pvr: Implement vkCmdNextSubpass API. +- pvr: implement vkcmddispatchindirect api. +- pvr: Handle indirect buffer address in pvr_setup_descriptor_mappings. +- pvr: Convert attachment indexes to unsigned. +- pvr: Remove redundant instruction buffer check. +- pvr: Remove redundant assignment error.instruction. +- pvr: Remove framebuffer NULL check from pvr_CmdBeginRenderPass2. +- pvr: Remove dead code from pvr_perform_start_of_render_clears. +- pvr: Zero init pbe_cs_words to avoid passing on garbage. +- pvr: Fix Unintentional integer overflow (OVERFLOW_BEFORE_WIDEN) errors. +- pvr: Call VALGRIND_FREELIKE_BLOCK before unmapping to avoid use after free. +- pvr: Add assert to check for non-NULL binding_desc. +- pvr: Implement vkGetEventStatus API. +- pvr: Add pvr_csb_emit_link function to support secondary buffer links. +- pvr: Fix check in pvr_cmd_uses_deferred_cs_cmds. +- pvr: Add support for PVR_CMD_STREAM_TYPE_GRAPHICS_DEFERRED stream. +- pvr: Create deferred control stream for secondary command buffers. +- pvr: Reserve space for vdm state for deferred secondary buffer. +- pvr: Add support to copy a control stream to another control stream. +- pvr: Implement vkCmdExecuteCommands API. +- pvr: Update buffer type macro names for consistency. +- pvr: Fix heap type of availability_buffer allocation. +- pvr: Fix allocation size passed in pvr_cmd_buffer_alloc_mem. +- pvr: Remove double error reporting. +- pvr: Split pds compute shader create and upload code for reuse. +- pvr: Add vulkan shader factory headers for Query and clear APIs. + +Renato Pereyra (1): + +- venus: Increase vn_relax sleep time + +Rhys Perry (117): + +- aco: fix LdsBranchVmemWARHazard with 2+ branch chains +- aco: set has_VMEM,has_DS=false after a branch +- aco: only add vscnt wait when visiting VMEM/DS +- aco: improve VcmpxPermlaneHazard workaround +- aco: fix hash statistic +- docs/radv: advertise VK_KHR_global_priority +- aco: fix consecutive exec writes when finding exec_copy instruction +- aco: rename is_cmp to is_fp_cmp +- aco: fix assembly of vopc_sdwa writing exec +- aco: fix re-write of uses of exec_val's lo/hi half +- aco: test branch opcode if removing it in try_optimize_branching_sequence +- aco: remove val_and_copy_adjacent +- aco: improve vcc check for instructions between exec_val and exec_copy +- aco: test for one and_savexec opcode in try_optimize_branching_sequence +- aco: fix long-jump version of discard early exit +- radv: remove claimed support for sRGB vertex buffer formats +- radv: fix 16-bit support in radv_lower_vs_input +- aco: fix 16-bit VS inputs +- radv/llvm: fix packed VS inputs on GFX6/GFX10+ +- aco: don't expand vec3 VS input load to vec4 on GFX6 +- aco: allow direct_fetch=true for vec4 VS input loads +- nir/print: support nir_texop_descriptor_amd +- aco: add SCC clobber in build_cube_select +- nir/lower_tex: ignore width of cube textures +- radv: enable ac_nir_lower_resinfo for ACO +- aco: remove dead code for querying image size/samples/levels +- util/ralloc: add HEADER_ALIGN macro +- util: add freelist allocator with mark/sweep +- nir/serialize: remove unused parameter from read_src() +- Revert "nir: Drop the unused instr arg for src/dest copy functions." +- nir: adjust nir_src_copy signature to take a nir_instr * +- nir: use a GC context for instructions +- util/format,vulkan/format: update p_format for Vulkan vertex buffers +- ac: add ac_vtx_format_info +- radv: add radv_is_vertex_buffer_format_supported +- radv,aco: use pipe_format for static vertex input state +- radv,aco: use pipe_format for dynamic vertex input state +- radv: remove radv_translate_vertex_format +- radv: enable R8G8B8, B8G8R8 and R16G16B16 vertex formats +- radv,aco: implement 64-bit vertex inputs +- radv: enable 64-bit vertex formats +- aco/ra: handle empty def_reg interval in get_regs_for_copies +- aco/ra: remove bounds parameter from get_regs_for_copies() +- aco/ra: rework fixed operands +- nir/algebraic: optimize bits=umin(bits, 32-(offset&0x1f)) +- nir/algebraic: optimize fabs(bcsel(b, fneg(a), a)) +- radv: disable EXT_vertex_input_dynamic_state when using DGC +- radv: shrink zero-initialization in vkCmdSetVertexInputEXT +- radv: use nir_ubfe_imm +- radv: remove unnecessary .align_mul=4 +- radv: fix dynamic RT stack size with VGPR spilling +- aco: DCE ra_ctx::defs_done +- aco: rename Interp_instruction to VINTRP_instruction +- aco: add reg() helper to assembler +- aco: fix assembly of MUBUF-to-LDS loads +- aco: add GFX11 opcode numbers +- aco/gfx11: don't use more than 1 NSA dword +- aco: update assembler for GFX11 +- aco: limit GFX11 to 128 VGPRs for now +- aco: add LDSDIR instruction format +- aco: add VINTERP instruction format +- aco: omit read-only memory_sync_info when printing +- aco/tests: add GFX11 assembly tests +- aco: mostly implement FS input loads on GFX11 +- aco: fix VMEMtoScalarWriteHazard s_waitcnt mitigation +- aco: improve VMEMtoScalarWriteHazard s_waitcnt mitigation +- aco: use some helpers in GFX10 hazard workarounds +- aco: improve printing of sgpr_null +- aco: improve printing of s_waitcnt_depctr +- aco: add VMEMtoScalarWriteHazard tests +- aco/gfx11: swap ds_cmpst_* data operands +- aco: improve wait_imm unpack +- aco/gfx11: fix s_waitcnt printing +- aco: update sendmsg enum from LLVM +- aco/gfx11: deallocate VGPRs at the end of the shader +- aco/gfx11: update form_hard_clauses +- aco: limit hard clauses to 63 instructions +- aco: fix assembler.gfx11.vinterp test +- aco: add search_backwards helper +- aco/gfx11: workaround VcmpxPermlaneHazard +- aco/gfx11: workaround LdsDirectVALUHazard +- aco/gfx11: workaround LdsDirectVMEMHazard +- aco/gfx11: workaround VALUTransUseHazard +- aco/gfx11: workaround VALUPartialForwardingHazard +- aco/gfx11: workaround VALUMaskWriteHazard +- aco: add ACO_DEBUG=force-waitdeps +- nir/algebraic: optimize b<<a + c<<a +- ac/llvm: fix mixing non-uniform/uniform sampler/texture descriptors +- ac/llvm: ignore g16 if the image opcode doesn't use derivatives +- aco/gfx11: optimize LS/HS load_local_invocation_index +- aco: swap v_perm_b32 operands +- radv,nir: add intrinsics for streamout and GS copy shaders +- ac/nir: add legacy streamout and GS copy shader helpers +- ac/nir: micro-optimize boolean expression +- ac/nir: add ac_nir_lower_ngg_options +- aco: add storage_gds +- aco: insert waitcnt before/after ds_ordered_count +- nir,ac/nir,aco,radv: replace has_input_*_amd with more general intrinsics +- aco: don't split swizzled store_buffer_amd on GFX9+ +- ac/nir: lower gfx11 vertex parameter exports +- radv/gfx11: don't create fmask copy/expand pipelines +- radv/gfx11: don't create texop_samples_identical +- radv,aco: don't use lower_to_fragment_fetch_amd on GFX11+ +- aco: fix typo in branch lowering +- aco/gfx11: perform FS input loads in WQM +- aco/gfx11: fix FS input loads in quad-divergent control flow +- radv/llvm: fix GS shaders on GFX8/9 +- docs: update new_features.txt for non-extension additions to RADV +- radv: increase gfx1100/gfx1101 physical vgprs +- ac/gpu_info: increase gfx1100/gfx1101 physical vgprs +- aco/gfx11: increase gfx1100/gfx1101 physical vgprs +- radeonsi: increase gfx1100/gfx1101 physical vgprs +- nir/lower_bit_size: lower uadd_carry +- radv: lower 8/16-bit uadd_carry/usub_borrow +- radv: disable MRT compaction with dual-source blending +- aco: ensure MRT0 is written with dual source blending +- ac/nir: mask shift operands + +Riteo (1): + +- vulkan/device_select_wayland: fix a memory leak with DRM device handling + +Rob Clark (106): + +- gbm: Add USE_FRONT_RENDERING flag +- freedreno: Disable UBWC for frontbuffer rendering +- freedreno/drm: Fix potential bo cache vs export crash +- freedreno/gmem: Fix col0 calc +- freedreno: Drop fixed upper bound on # of tiles +- freedreno/drm: Add FD_VA_SIZE param +- freedreno: Cap reported video memory by VA size +- freedreno/drm: Avoid lock in fd_pipe_del() +- egl: atrace support +- freedreno/drm/virtio: Add some atrace +- util: Fix vs2019 build error +- egl: Drop _eglSetFuncName() return +- egl: Drop unused _EGL_FUNC_START() arg +- egl: Remove redundant fxn param +- egl: Convert to simple_mtx_t +- egl: Add some locking asserts +- egl/dri2: Indenting fix +- egl/dri2: Error path cleanups +- llvmpipe: Add some missing locking +- freedreno: Extract helper to check for batch dependency +- freedreno: Simplify add_dep logic +- freedreno: Avoid deferred-flush dependency loops +- Revert "ci: disable the freedreno farm." +- freedreno/a6xx: Random indent fixes +- freedreno/drm: Let backend hint about upload vs map +- freedreno/drm: Add offset param for fd_bo_upload() +- freedreno: Add support for upload transfers +- Revert "ci: disable the freedreno farm." +- egl/wgl: Make ref_count atomic +- egl/dri2: Make ref_count atomic +- egl/dri2: Add display lock +- egl: Make RefCount atomic +- egl: Introduce rwlock to protect eglTerminate() +- egl: Relax locking +- freedreno: Misc indent fix +- freedreno: nr_rings -> nr_priorities +- freedreno: Remap high/norm/low priorities +- freedreno/virtio: Don't upload if we have valid range +- freedreno/drm/virtio: Don't prefer upload for mapped buffers +- freedreno/drm/virtio: Handle read after upload +- freedreno: We really don't need aligned vbo's +- freedreno: Update github wiki links +- freedreno/ir3: GC unused macro +- freedreno/drm: Inline fd_bo_get_iova() +- freedreno/drm: Add fd_ringbuffer_attach_bo() +- freedreno/a6xx: Drop "hardpin" support +- freedreno/drm: Simplify emit_reloc_common +- freedreno/a6xx: Skip IBO state when unused +- freedreno/a6xx: Pre-calculate user const state size +- freedreno: Drop unused arg +- freedreno/a6xx: Move user const upload to bind +- freedreno/a6xx: Simplify fd6_build_user_consts() +- freedreno: Handle xfb invalidation +- freedreno: Use TC cpu-storage to shadow buffers +- freedreno/drm: Don't call kernel with no ops +- freedreno: Add perf-debug trace +- isaspec: Fix out of date comment +- gallium/u_threaded: Add some atrace/perfetto +- gallium/u_threaded: Add optional call-id tracing +- freedreno: Fix fence unref race +- docs: Update freedreno features +- mesa/st: ARB_vertex_attrib_64bit depend on glsl>=410 +- freedreno/a6xx: Remove unused param +- freedreno/ir3: Fix clipvertex with GS+tess +- freedreno/ir3: Lower all the 64b +- freedreno/a6xx: Fix MAX_GEOMETRY_OUTPUT_VERTICES cap +- freedreno/a6xx: Fix primitives-generated query +- freedreno/a6xx: Update caps +- freedreno/a6xx: Drop max_scissor tracking +- Revert "egl: Factor some common terminate cleanup up to common code" +- nir: Add helper to create passthrough TCS shader +- radeonsi: Use nir passthrough TCS helper +- freedreno: Dirty the TCS state if patch_vertices change +- freedreno/ir3: Passthrough TCS support +- freedreno: Add tess_state support +- freedreno: Passthrough TCS support +- v3d/ci: Add a flake +- Revert "mesa/st: ARB_vertex_attrib_64bit depend on glsl>=410" +- freedreno: Don't advertise TGSI support for GS/tess +- freedreno/a6xx: Move layer_zero handling to interp state +- freedreno/ir3: Drop unused view_zero/layer_zero lowering +- freedreno/a6xx: Fix buffer size clamping +- freedreno/a6xx: Support AMD_vertex_shader_layer +- freedreno: support multiple viewports +- freedreno: Massage scissor state at bind time +- freedreno: Move guardband calc to bind time +- freedreno/a6xx: Support GL_ARB_viewport_array + gl43 +- freedreno/a6xx: Enable GL_ARB_texture_mirror_clamp_to_edge +- freedreno: Enable GL_ARB_enhanced_layouts +- freedreno/ci: Small bit of xfails cleanup +- docs: Update features.txt +- nir: Add way to create passthrough TCS without VS nir +- freedreno: Add ARB_gl_spirv support +- freedreno/a6xx: Use box to pass 2d clear params +- freedreno/a6xx: Implement ARB_clear_texture +- freedreno/a3xx+: Enable ARB_shader_texture_image_samples +- freedreno/a3xx+: Enable ARB_derivative_control +- freedreno/ir3: Unconditionally lower subgroup ops +- freedreno/a6xx: Enable ARB_shader_group_vote +- freedreno/a6xx: Fix occlusion queries +- freedreno/a6xx: Remove unused field +- freedreno/a5xx+a6xx: Add base class for query samples +- freedreno/batch: Add a global epilogue +- freedreno: Core ARB_query_buffer_object support +- freedreno/a6xx: ARB_query_buffer_object support +- freedreno/a6xx: Mark gl45 supported + +Rohan Garg (3): + +- anv, iris: Disable pre fetching the binding table entries on DG2 +- intel/compiler: Support 16 bit float ops +- anv: Enable 16 bit float ops on devices that have a LSC + +Roland Scheidegger (4): + +- lavapipe: initialize index_bias to zero for non-indexed draws +- gallivm: always do per-pixel LOD for cube maps +- llvmpipe: fix front/back face system value in the fs +- gallivm: remove dead cube map lod calculation code + +Roman Gilg (2): + +- loader/dri3: Simplify buffer age query +- loader/dri3: Add DRI performance option to wait for next buffer on swap + +Roman Stratiienko (6): + +- v3dv: Enable sync_fd importing/exporting on Android +- v3dv: Limit API version to v1.0 for Android +- v3dv: Avoid swapchain_info handling on Android +- lima: Ensure jobs initialized before calling lima_job_fini() +- Android: Use libgbm_mesa name for SDK30+ +- meson: Enable system_has_kms_drm for android + +Ruijing Dong (30): + +- frontends/va: improve enc quality interface change +- radeonsi/vcn: add enc quality bits interface +- radeonsi/vcn: dpb change for supporting pre-encoding +- radeonsi/vcn: support encoding preset modes +- radeonsi/vcn: support VBAQ modes +- radeonsi/vcn: remove rate control double begin IBs +- radeonsi/vcn: add AUD syntax to h264 encoding +- frontend/va: remove some unnecessary code +- frontends/va: change getEncParamPreset location +- frontends/va: add max_frame_size into rate control +- frontends/va: add HRD, filler data enable and etc +- frontends/va: enable sao in hevc encoding +- radeon/vcn: process invalid frame rate in encoding +- radeonsi/vcn: support async +- radeonsi/vce: using fixed value for vce 52 +- gallium/pipe/video: group avc encoder seq structure +- radeonsi/vce: apply avc seq parameters +- frontends/omx: apply avc seq parameters +- frontends/va: apply avc seq parameters +- radeonsi/vcn: use avc seq paramters +- frontends/va: add aspect ratio and vui timing info +- radeonsi/vcn: support aspect ratio and vui timing info +- d3d12: Updating refactored fields in pipe_h264_enc_picture_desc +- frontends/va: fix av1 decoding image distortion issue +- gallium/pipe/video: add data struct for dbk in avc encoding. +- frontends/va: add h264 dbk enc parameters +- radeonsi/vcn: enable dbk in avc encoding +- radeonsi/vcn: enable multi-slice encoding +- frontends/va: fixed an av1 decoding image corruption issue. +- frontends/va: fixed an av1 dec image corruption. + +Ryan Houdek (1): + +- vulkan/wsi: Add dep_libudev to idep dependencies + +Sagar Ghuge (5): + +- intel/isl: Setting L1 caching policy to Write-back mode +- anv: Specify Untyped L1 cache policy for stateless accesses +- iris: Specify Untyped L1 cache policy for stateless accesses +- anv: Handle bits to flush data-port's Untyped L1 data cache +- iris: Handle new untyped dataport cache flush PIPE_CONTROL field + +Sajeesh Sidharthan (1): + +- radeonsi/vcn: set current pic index correctly + +Sami Kyöstilä (1): + +- intel/ds: Update to Perfetto API v28.0 + +Samuel Pitoiset (329): + +- vulkan: add support for VK_IMAGE_LAYOUT_ATTACHMENT_FEEDBACK_LOOP_OPTIMAL_EXT +- radv: implement VK_EXT_attachment_feedback_loop_layout +- radv: ignore out-of-order rasterization if stencil write mask is dynamic +- radv: rework shaders ref counting +- radv: use ref counting for VS prologs and PS epilogs +- radv: remove RADV_META_SAVE_SAMPLE_LOCATIONS +- radv: simplify saving/restoring all dynamic states +- radv: remove unused states parameter from some radv_emit_XXX() helpers +- radv: simplify radv_bind_dynamic_state() slightly +- radv: fix gathering XFB info if there is dead outputs +- radv: respect the render area for depth/stencil resolves +- radv: fix cleaning the meta query state if an error occured +- radv: properly ignore PSO states that are dynamic +- radv: remove one useless check about line width and dynamic state +- radv: simplify dynamic states uses when determining NGG culling settings +- radv: do not set registers for PSO states that are dynamic +- radv: remove radv_invariant_geom workarounds for DX games +- radv: explain why radv_invariant_geom is still needed for SotTR DX +- radv: remove unecessary radv_pipeline::uses_dynamic_stride +- radv: copy line stipple factor/pattern only if the state isn't dynamic +- radv: copy the sample locations only if the state isn't dynamic +- zink: stop skipping bunch of GLESx tests +- vulkan: initialize discardRectangleCount even if the state is dynamic +- vulkan: fix initializing the primitive topology when it's dynamic +- vulkan: add attachment sample count info AMD support +- radv: store the provoking vertex mode in the graphics pipeline info +- radv: simplify the depth clip enable/disable logic +- radv: do not translate polygon mode during gfx info initialization +- radv: do not translate logic op during gfx info initialization +- radv: do not translate blend op/factor during gfx info initialization +- radv: do not translate primitive topology during gfx info initialization +- radv: slightly change the color write enable mask +- radv: call vk_graphics_pipeline_state_fill() when initializing gfx pipelines +- radv: convert radv_tessellation_info to vk_tessellation_state +- radv: convert radv_viewport_info to vk_viewpoert_state +- radv: convert radv_discard_rectangle_info to vk_discard_rectangles_state +- radv: convert radv_rasterization_info to vk_rasterization_state +- radv: convert radv_depth_stencil_info to vk_depth_stencil_state +- radv: convert radv_fragment_shading_rate_info to vk_fragment_shading_rate_state +- radv: convert radv_input_assembly_info to vk_input_assembly_state +- radv: convert radv_multisample_info to vk_multisample_state +- radv: convert radv_color_blend_info to vk_color_blend_state +- radv: convert radv_rendering_info to vk_render_pass_state +- radv: convert radv_vertex_input_info to vk_vertex_input_state +- radv: remove radv_graphics_pipeline_info completely +- radv: advertise VK_KHR_global_priority +- radv: remove unused pCreateInfo from radv_pipeline_init_blend_state() +- radv: add radv_graphics_pipeline_import_info() helper +- radv: introduce a new pipeline type for graphics libs +- radv: store vk_graphics_pipeline_state to radv_graphics_pipeline +- radv: prevent accessing rasterization state when it's NULL +- radv: add a very initial implementation of VK_EXT_graphics_pipeline_library +- radv: do not compress DCC in presence of render loops on GFX10+ +- radv: remove useless check when creating the noop FS +- radv: copy linked I/O info for merged stages in radv_fill_shader_info +- radv: copy the whole radv_shader_info struct for merged stages +- radv: add a radv_postprocess_nir() helper +- radv: move lowering FS intrinsics to radv_postprocess_nir() +- radv: move dumping NIR shaders to the previous similar loop +- radv: remove unused parameter in radv_lower_io_to_mem() +- radv: use NIR_PASS with radv_export_implicit_primitive_id +- radv: remove useless parameter from radv_link_shaders() +- radv: add radv_remove_point_size() linking helper +- radv: rework linking shaders in NIR +- radv: move assigning driver locations to the new linking helpers +- radv: move lowering patch vertices to radv_pipeline_link_tcs() +- radv: move lowering the view index to radv_pipeline_link_shaders() +- radv: remove redundant assignment of tcs.tcs_vertices_out +- radv: remove unused tcs_vertices_out assignment for VS +- radv: remove unused num_tess_patches assignment for VS +- radv: merge gather_tess_info() with radv_fill_shader_info() +- radv: stop setting CB_COLOR_CONTROL.ROP3 from the pipeline +- radv: stop clearing bitfields for registers that are emitted dynamically +- radv: cleanup dynamic states in radv_emit_graphics_pipeline() +- radv: stop emitting RMW context registers for updating sample locations +- radv: re-emit viewports if negative one to one or depth clamp mode changed +- radv: remove bogus assertion about independent set layouts with GPL +- radv: fix missing initialization of the pipeline layout when creating a lib +- radv: destroy the pipeline layout if creating a library failed +- aco: fix wrong size for 1D images and A16 on GFX9 +- ac: constify ac_compute_cs_workgroup_size() +- radv: stop duplicating radv_es_output_info +- radv: use esgs_itemsize when calling ac_nir_lower_es_outputs_to_mem +- radv: compute the ESGS itemsize outside of radv_nir_shader_info_pass() +- radv: remove dead code about task ring when binding a compute pipeline +- radv: replace cs.uses_task_rings by ms.has_task +- radv/llvm: remove unused parameter in handle_vs_outputs_post() +- radv: stop duplicating radv_vs_output_info +- radv: fill radv_vs_output_info unconditionally for vertex related stages +- radv: remove redundant VS output parameter assignments +- radv: add a helper that links shader info between stages +- radv: link primitive ID/clip distance shader info from the new helper +- radv: stop gathering info for FS before other stages +- radv: move filling cs.block_size +- radv: add gather_shader_info_fs() helper +- radv: add gather_shader_info_mesh() helper +- radv: add gather_shader_info_gs() helper +- radv: add gather_shader_info_tes() helper +- radv: add gather_shader_info_tcs() helper +- radv: add gather_shader_info_vs() helper +- radv: add gather_shader_info_cs() helper +- radv: add gather_shader_info_task() helper +- radv: move more CS info to gather_shader_info_cs() +- radv: move more MS info to gather_shader_info_ms() +- radv: move determining NGG shader info to radv_fill_shader_info() +- radv: store binning settings into the physical device +- radv: make sure to emit BREAK_BATCH when color write enable is dynamic +- radv: remove useless parameter in si_write_scissors() +- radv: split emitting guardband into a separate helper +- radv: rework dirtying scissors when the line width changes +- radv: emit the guardband state separately from the scissor state +- radv: allow null descriptor set layouts in pipeline layouts +- radv: determine if the topology uses adjacency from the shaders on GFX9+ +- radv: fix bogus assertion with RADV_FORCE_VRS +- aco: remove unused isel_context::tcs_num_patches +- radv: stop setting as_ls for the TCS stage +- radv: stop setting as_es for the GS stage +- radv: stop setting num_tess_patches for the GS stage +- radv: stop setting esgs_itemsize to the GS stage +- radv: cleanup helpers that compute NGG info and GS info on GFX9+ +- radv: move computing wave_size/ballot_bit_size to the shader info pass +- radv: move computing NGG info and GS info to radv_nir_shader_info_link() +- radv: refactor radv_nir_shader_info_link() +- radv: move determining other NGG settings in radv_link_shaders_info() +- radv: simplify determining if NGG passthrough can be enabled +- radv: cleanup computing the workgroup size for all stages +- radv: add and use a helper that merges shader info for merged stages on GFX9+ +- radv: remove unused parameters in radv_fill_shader_info() +- radv: determine as_ls/as_es in radv_link_shaders_info() +- radv: constify radv_shader_info where it's missing +- radeonsi/ci: trigger radeonsi-raven-skqp for any RADV changes +- radv: gather MRTs that are written by the fragment shader +- radv: compact MRTs to save PS export memory space +- radv: only expose sparseResidencyImage3D on GFX9+ +- radv: emit SQTT markers for RT related commands +- radv: capture RT pipelines from the SQTT layer +- radv: fix reporting RT shaders in RGP +- radv: do not lower color exports for FS that need an epilog +- radv: add radv_remove_color_exports() helper +- radv: do not try to remove color exports for FS that need an epilog +- radv: keep track of the code size for VS prologs and PS epilogs +- radv: create a PS epilog from a library without the main FS +- radv: add support for emitting and prefetching PS epilogs +- radv: import PS epilog from libraries if present +- radv: fix hw remapping of MRT holes with color attachments without export +- radv: fix pipelineStageCreationFeedbackCount when it's 0 +- radv: add a pointer to radv_shader_binary in radv_shader +- radv/ci: re-enable dEQP-VK.renderpass2.depth_stencil_resolve.*_samplemask +- radv: store the VS prologs/PS epilogs VA at upload time +- radv: split upload_shader_part() in two parts +- radv: store the binary to radv_shader_part +- radv: upload the PS epilog in the existing pipeline BO +- radv: reword a comment about dynamic states when rasterization is disabled +- radv: emit the LDS size for TCS directly from the pipeline on GFX9+ +- radv: add radv_pipeline_key::dynamic_patch_control_points +- radv: skip computing some tess info when patch control points is dynamic +- radv: set workgroup_size to 256 when patch control points is dynamic +- aco: prevent a division by zero when patch control points is dynamic +- radv: add shader arguments for dynamic patch control points +- radv: add ABI lowering support for dynamic patch control points +- radv: pass the number of patch control points to si_get_ia_multi_vgt_param() +- radv: move emitting PRIMGROUP_SIZE for <= GFX9 from the cmdbuf +- radv: move emitting GE_CNTL for non-NGG pipelines from the cmdbuf +- radv: implement dynamic patch control points +- radv: advertise extendedDynamicState2PatchControlPoints +- radv: stop setting redundant viewport/scissor for internal operations +- radv: do not bind NULL graphics pipeline when restoring the meta state +- radv: reset the compute pipeline when the saved one was NULL +- radv: stop dirtying the graphics pipeline when restoring it +- radv: stop checking for NULL pipelines in radv_CmdBindPipeline() +- radv/ci: cleanup lists of failures/flakes +- radv: disable VK_EXT_graphics_pipeline_library with LLVM +- radv: rename radv_pipeline_key::vs::dynamic_vs_input to has_prolog +- radv: prepare the VS input state for prologs created with GPL +- radv: bind the VS input state for prologs created with GPL +- radv: enable the VS prologs cache if graphicsPipelineLibrary is enabled +- radv: do not remove PSIZ for VS when the topology is unknown +- Revert "radv: upload the PS epilog in the existing pipeline BO" +- nir: add nir_intrinsic_load_sample_positions_amd +- radv: add nir_intrinsic_load_sample_positions_amd in the ABI +- radv: run nir_opt_cse before lowering FS intrinsics +- radv,aco: lower barycentric_at_sample in NIR +- radv: do not link shaders when the next stage is unknown +- radv: allow to build the main FS in a graphics pipeline library +- radv,aco: do not compact MRTs if the pipeline uses a PS epilog +- radv: rename radv_thread_trace_set_pstate() to radv_device_set_pstate() +- radv: acquire pstate on-demand when capturing with RGP +- radv: constify radv_lookup_user_sgpr() +- radv: add radv_pipeline_key::dynamic_rasterization_samples +- radv: declare shader arguments for the number of samples for FS +- nir: add nir_load_rasterization_samples_amd +- radv: lower nir_load_rasterization_samples_amd in ABI +- nir,radv: pass the number of samples to load_sample_positions_amd +- radv: add barycentric_at_sample lowering when the number of samples is dynamic +- radv: emit the rasterization samples through an user SGPR if needed +- radv: make sure to initialize wd_switch_on_eop before checking its value +- radv: pass a VkSampler to write_sampler_descriptor() +- radv: remove unnecessary radv_buffer_view::vk_format +- radv: rework and rename radv_make_buffer_descriptor() +- zink/ci: skip ext_external_objects from Piglit with RADV +- zink/ci: skip arb_shader_clock with RADV +- zink/ci: skip GLX tests with RADV +- zink/ci: update list of expected failures with RADV on NAVI10 +- zink/ci: update list of flakes for Piglit with RADV on NAVI10 +- radv/ci: add piglit testing with Zink on NAVI10 +- radv: do not free the GS copy shader binary if created from a library +- radv: import the GS copy shader from a library if present +- radv: determine the last VGT api stage from the active_stages bitfield +- radv: allow to build the pre-rasterization stages in a library +- radv: remove useless gfx10_ngg_info::enable_vertex_grouping +- radv: determine the last VGT api stage also for GPL +- radv: use the maximum number of vertices per primitives for NGG with GPL +- radv: enable NGG culling unconditionally for GPL but disable it dynamically +- radv: rework multiview lowering in NIR slightly +- radv: fix handling ViewportIndex with GPL +- radv: fix handling primitive ID and clip/cull distances with GPL +- radv: fix handling multiview with GPL +- radv: fix emitting RBPLUS state when MRTs are compacted +- radv: remove NULL checks when binding a graphics pipeline +- radv: fix initializing the attribute mask for VS input state and GPL +- radv: fix initializing the blend state for NULL render pass with GPL +- radv: ignore shader stages that don't need to be imported with GPL +- radv: add a new drirc option to enable a unified heap on APUs +- radv: enable the unified heap on APUs for Red Dead Redemption 2 +- radv: fix and rework shaders upload with GPL +- radv: always re-emit patch control points when a pipeline with tess is bound +- radv: move the disk cache to the vk_physical_device struct +- radv: use SHA1_DIGEST_LENGTH in the pipeline cache +- radv: do not create a noop FS when the FS is imported from a library +- radv: discard the PS epilog when the pipeline doesn't use a fragment shader +- ac/llvm: add missing nir_intrinsic_bindless_image_atomic_{fmin,fmax} +- radv: set SAMPLE_MASK_TRACKER_WATERMARK to 15 on GFX11 +- radv: use correct VGT_TESS_DISTRIBUTION settings on GFX11 +- radv: do not use memory for waiting for cache flushes on GFX11 +- radv: drop the ES vertex count requirement on GFX11 +- radv: add RADEON_FLAG_DISCARDABLE +- radv: create and configure the ring BO for attributes +- radv: declare shader arguments for attributes via memory +- nir: add nir_intrinsic_load_ring_attr_{offset}_amd +- radv: add lowering for nir_intrinsic_load_ring_attr_{offset}_amd +- radv: allocate more space for pipeline statistics query on GFX11 +- ac: add radeon_info::has_vrs_ds_export_bug +- radv: advertise fragmentShadingRateWithShaderDepthStencilWrites on NAVI23+ +- aco: fix tcs_wave_id unpacking on GFX11 +- radv: declare new dynamic states +- radv: add support for dynamic polygon mode +- radv: add support for dynamic tessellation domain origin +- radv: add support for dynamic logic op enable +- radv: add support for dynamic line stipple enable +- radv: add support for dynamic alpha to coverage enable +- radv: add support for dynamic sample mask +- radv: add support for dynamic depth clip enable +- radv: add support for dynamic conservative rasterization mode +- radv: add support for dynamic depth clip negative one to one +- radv: add support for dynamic provoking vertex mode +- radv: add support for dynamic depth clamp enable +- radv: advertise VK_EXT_extended_dynamic_state3 +- radv: disable VK_AMD_texture_gather_bias_lod on GFX11 +- radv: disable AMD_shader_explicit_vertex_parameter on GFX11 +- radv/llvm: do not set COMPR for exports on GFX11 +- ac/nir,radv/llvm: add support for the attribute ring +- aco,radv/llvm: do not export parameters on GFX11 +- ac: fix has_vrs_ds_export_bug for VanGogh +- radv: disable {shader,sparse}ImageFloat32AtomicMinMax on GFX11 +- radv: disable shaderBufferFloat64AtomicMinMax on GFX11 +- radv: disable dual source blending in more situations +- ac/llvm: add support for device clock on GFX11 +- aco: add support for s_sendmsg_rtn_b{32,64} +- aco: split the sendmsg enumeration into sendmsg_rtn +- aco: add support for device clock on GFX11 +- ac/nir/ngg: fix emitting streamout output by using packed location +- radv: fix suspending/resuming pipeline statistics queries with GDS +- radv: fix VRS limit when attachmentFragmentShadingRate is disabled +- radv/llvm: fix dual source blending on GFX11 +- radv: move nir_opt_idiv_const/nir_lower_idiv after NGG lowering +- ac/nir/ngg: fix emitting streamout output by using packed location +- radv: implement transform feedback queries with NGG streamout +- Revert "radv: add a pointer to radv_shader_binary in radv_shader" +- aco: create a new builder variant for ds_add_rtn +- aco: implement NIR intrinsics for NGG streamout +- aco: remove invalid assertions for NGG streamout +- radv/llvm: prevent emitting streamout outputs for NGG +- radv: call nir_io_add_intrinsic_xfb_info() after IO lowering +- radv: lower NIR intrinsics for NGG streamout +- radv: do not unconditionally disable NGG streamout lowering in NIR +- radv: reword an incorrect comment about primitives generated query +- radv: use 32-bit GDS counters for primitives generated query with NGG +- radv: suspend/resume primitives generated query separately for legacy/NGG +- radv: fix primitives generated query with NGG only +- radv: set the correct buffer size for NGG streamout +- radv: enable all possible features with NGG streamout +- radv: allow to enable NGG streamout with RADV_PERFTEST=ngg_streamout +- radv: enable VK_EXT_transform_feedback with NGG streamout on GFX11 +- ac/nir: stop using nir_gather_xfb_info_from_intrinsics +- radv: assign IO var locations for VS/TES with NGG streamout +- radv/amdgpu: add a kernel GDS management workaround for -ENOMEM +- radv: use defines instead of magic values for GDS counters offset +- radv: move GDS counters after reserved GDS offsets for streamout +- radv: fix fallback for extreme geometry with tessellation on GFX11 +- radv: fix setting MIN_LOD for texture descriptors on GFX11 +- radv: make the GDS/GDS OA buffer objects resident +- radv: invalidate L2 instead of only writeback L2 when using DCC stores +- radv: re-emit the guardband state when restoring meta operations +- ac/nir,radv: rework and fix NGG queries enables for VS/TES +- ac/nir: do not convert GS outputs to the expected variable size on GFX11 +- radv: re-emit NGG culling settings when conservative rast mode is dynamic +- radv: enable lowering of subgroup shuffle in NIR on GFX11+ +- aco: fix p_interp_gfx11 to not overwrite SCC +- aco: fix missing SCC for p_interp_gfx11 in emit_interp_mov_instr() +- aco: add p_dual_src_export_gfx11 for dual source blending on GFX11 +- aco: fix dual source blending on GFX11 +- aco: fix FS inputs loads in WQM with 16-bit +- radv: suspend/resume XFB queries with NGG for meta operations +- zink: require extendedDynamicState3ColorBlendEquation for full ds3 +- radv: stop overallocating LDS for VS/TES when NGG streamout is enabled +- radv: fix computing the pervertex LDS size with NGG streamout +- radv: enable NGG XFB queries only if streamout is enabled +- radv: re-mit streamout buffers to unbind them when NGG streamout is disabled +- radv: fix NGG streamout when it's never enabled in runtime +- radv: fix possible hangs with NGG streamout and secondary cmdbuf +- aco: fix emitting DEALLOC_VGPRS in the discard block +- radv: re-emit dynamic depth clamp enable if depth clip enable changed +- radv: make sure to mark DCC as compressed on GFX11 +- radv: introduce RADV_DEBUG=nofmask +- radv: set missing SPI_SHADER_PGM_xxx registers on GFX11 +- radv: set INTERPOLATE_COMP_Z to 0 on GFX11 +- radv: do not enable DCC for MSAA images without FMASK +- radv: do not enable NGG culling on GFX11 +- radv: disable VRS entirely on GFX11 + +Sarah Walker (3): + +- pvr: Update FWIF compute register structure +- pvr: Remove STRIP_RENDERING members from FWIF +- pvr: Update FWIF 3d and compute register structures + +Sathishkumar S (2): + +- frontends/va: support yuv 400/444 rt_formats in vaconfig +- frontends/va: reallocate surface for yuv400/yuv444 picture + +Sergei Chernyadyev (1): + +- radv: fix setting results for initialization failures in thread trace and trap handler + +Shuicheng Lin (1): + +- iris: return failure if iris_resource_configure_main fail + +Sil Vilerino (76): + +- gallium/va: vaDeriveImage to check PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP +- d3d12: Implement cap PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP +- frontends/va: Add HEVC decode slice descriptors +- frontends/va: Add HEVC decode args: IntraPicFlag, no_pic_reordering_flag, no_bipred_flag +- frontends/vdpau: Fill HEVC Decode param IntraPicFlag +- frontends/omx: Fill HEVC Decode param IntraPicFlag +- frontends/va: Mark IsLongTerm in HEVC decode args +- frontends/va: Add HEVC Encode support multi slice and extend pipe args +- frontends/va: Extend single to multiple L0-L1 references for HEVC Encode +- frontends/va: Support HEVC caps regarding features, block sizes, prediction direction +- d3d12/va: Name convention rename PIPE_VIDEO_SUPPORTS_CONTIGUOUS_PLANES_MAP to PIPE_VIDEO_CAP_SUPPORTS_CONTIGUOUS_PLANES_MAP +- gallium/vl: Allow vl_zscan.h to be included from C++ +- gallium/vl: Rename s_addr variable in vl_idct.c as it conflicts with windows existing inaddr.h keyword definition +- d3d12: Add HEVC Decode/Encode +- d3d12: Avoid extra allocation, copies when generating DXVA_Slice_Hxxx_Short arrays +- d3d12: Avoid local allocations for D3D12_RESOURCE_BARRIER on hot paths +- d3d12: Avoid heap allocations on hot path d3d12_video_decoder_dxva_picparams_from_pipe_picparams_hevc +- d3d12: Fill feedback in d3d12_video_encoder_encode_bitstream so vaSyncSurface properly populates buf->coded_size +- d3d12: Fix leak in d3d12_video_proc when re-creating ID3D12VideoProcessor +- d3d12: Fix winsys displaytarget leak in d3d12_resource +- d3d12: Fix leak in d3d12_resource_from_resource and usage in d3d12 video dec, enc +- d3d12: Add support for importing d3d12_video_buffer from handle +- d3d12: Allow formats other than NV12 in d3d12_video_buffer +- d3d12: Allow video processing for formats other than NV12 +- d3d12: Add VPBlit processor check for D3D12_FEATURE_VIDEO_PROCESS_MAX_INPUT_STREAMS +- d3d12: d3d12_video_buffer_create_impl make resident after checking for resource creation +- frontends/va: Implement vaSyncBuffer +- d3d12: Update HEVC Encode GOP on I frames too +- d3d12: Video process - Remove unnecessary batches flush +- d3d12: Add initialization values for d3d12_video_encoder +- d3d12: Report PIPE_VIDEO_CAP_REQUIRES_FLUSH_ON_END_FRAME +- d3d12: Add support for PIPE_VIDEO_CAP_ENC_SUPPORTS_ASYNC_OPERATION +- d3d12: Change type of m_FenceValue to uint64_t in d3d12_video_encoder +- d3d12: Export some util functions from d3d12_fence for d3d12 video +- d3d12: Keep track of feedbacks from previous encode executions +- d3d12: Change d3d12_video_encoder to not automatically flush +- d3d12: Move encode configuration and capabilities into separate structs +- d3d12: Encoder to use independent command allocators per in flight resource set +- d3d12: d3d12_video_encoder add reset() to prevent leaks on re-allocation +- d3d12: Have d3d12_video_encoder keep separate references for enc, heap and dpb allocations for in flight resources +- d3d12: Make d3d12 encode operations async and do not block waiting +- d3d12: Add output buffer to inflight resourceset +- d3d12: Fix redundant/inconsistent initializations for d3d12_video_encoder +- d3d12: Enhance get_feedback not enough buffers message +- d3d12: Extract into common variable for metadata slot calculation +- d3d12: Make get_feedback return correctly on error when writing \*size=0 +- d3d12: Relax slice caps requirements for video encode +- d3d12: Fix out of bounds index check for texture array DPB +- d3d12: Add debug verbosity for output reconstructed picture +- d3d12: Fix texture array DPB encode path +- d3d12: Fix usage of %ld with PRIu64 instead for cross-platform build +- d3d12: Replace size_t with uint64_t in d3d12_video_encoder definitions for cross-plat compatibility +- frontends/va: Add more params from VADecPictureParameterBufferAV1 and VASliceParameterBufferAV1 to pipe_av1_picture_desc +- frontends/va: Support AV1 Decode with multiple tiles (num_elements > 1) in a single VASliceParameterBufferAV1 buffer +- frontends/va: Use is_video_format_supported in vlVaVidEngineBlit instead of format list +- frontends/va: Fix AV1 decode qm_* parameters reading from VADecPictureParameterBufferAV1 +- d3d12: Add support for AV1 Video Decode +- d3d12: Support more video formats +- d3d12: Fix aliased usage of m_SliceControlBuffer in d3d12_video_decoder_prepare_dxva_slices_control_XXXX +- d3d12: Style variable fix in d3d12_video_decoder_prepare_dxva_slices_control_av1 +- frontends/va: Add VP9 decode multi slice information +- frontends/va: Add VP9 decode ref/mode_deltas information +- d3d12: Add VP9 Decode support +- d3d12: Fix get_index7bits - Reuse previously freed indices +- d3d12: HEVC Encode workaround for edge case in caps reporting not contemplated by upper layer interface +- d3d12: Fix HEVC wrong caps detection due to bad parenthesis in condition +- d3d12: HEVC Set D3D12_VIDEO_ENCODER_CODEC_CONFIGURATION_HEVC_FLAG_USE_ASYMETRIC_MOTION_PARTITION when required by caps +- gallium/vl: Add vl_winsys_win32 support +- frontends/va: Add windows VA frontend support via vl_winsys_win32 and libva-win32 +- d3d12: resource_from_handle to validate importing resource ptr from same d3d12 device +- CI: Add gallium-va and video-codecs in windows-vs2019 and debian-mingw32-x86_64 +- d3d12: Create d3d12_resource objects for video usage keeping their DXGI\_..._UNORM formats +- d3d12: HEVC Decode - Fix slice size and offsets translation from VA to DXVA +- frontend/va: Fix WIN32 VA_DRIVER_INIT_FUNC declaration. Remove declspec as it uses .def file +- ci: Add va frontend to windows-build-rules +- d3d12: Video Screen - Do not crash if HEVC not supported, return no support instead + +Simon Fels (1): + +- venus: drop duplicated YCBCR_2_PLANE_444_FORMATS_FEATURES_EXT + +Simon Ser (2): + +- vulkan/wsi/x11: detect Xwayland via the XWAYLAND extension +- egl: fix error string for dri2_create_image_dma_buf() + +Simon Zeni (10): + +- mesa: fix EXT_EGL_image_storage target validation +- gallium: track if st_egl_image was created by a dmabuf +- mesa: simplify st_egl_image binding process for texture storage +- mesa/st: check egl image and texture target match before binding +- mesa: fix GL_INVALID_OPERATION in glEGLImageTargetTexStorageEXT +- mesa: unlock texture on error path in glEGLImageTargetTexStorageEXT +- egl: fix Xlib include in egldisplay.h +- egl: set API symbols public +- EGL: sync headers with Khronos +- meson: remove unused EGL_NO_X11 definition + +Sonny Jiang (1): + +- radeonsi/vcn: add decode support for gfx1101 and gfx1103 + +SoroushIMG (22): + +- zink: Fix incorrect emission of SPIR-V shift ops +- zink: track min_samples state for per sample shading +- zink: lower 64bit vars for drivers without float64 +- zink: optimise lower_64bit_vars for when int64 is supported +- zink: lower 64bit pack when float64 not supported +- zink: fix nir_op_unpack_64_2x32 emission +- zink: fix spirv temp func variable class emission +- nir: track whether a loop contains soft fp64 ops +- nir: allow to fine tune unrolling for loops with soft fp64 ops +- zink: enable loop unrolling for loops that have soft fp64 +- zink: enable native ETC2 if vk driver supports it +- zink: fix invalid Offset set for variables which do not need an offset +- zink: stop enabling minmax filtering when not supported +- zink: fix isNan mismatch between NIR and SPIR-V +- lavapipe: stop allocating 0 size const buffer +- zink: cleanup pushconst interface between driver/compiler +- zink: use unified pushconst layour for passthorugh tcs +- zink: cleanup zink_pipeline_layout_create +- zink: add pushconst only pipeline layout +- zink: add new framebuffer_is_layered state +- zink: clamp gl_Layer output to 0, if framebuffer is not layered +- zink: limit gl_Layer clamping to drivers that need it + +Sunil Khatri (1): + +- winsys/amdgpu: clamp up the alignment if zero + +Sviatoslav Peleshko (2): + +- iris: Always initialize shader compilation queue ready fence +- driconf/intel: Use fake vendor to WA bad detection in Source engine games + +Tapani Pälli (40): + +- glsl: allow image*Shadow keywords on ES and GLSL >= 420 +- glsl: add check for too large atomic counter buffer offset +- iris: implement Wa_14015946265 for DG2 +- anv: implement Wa_14015946265 for DG2 +- intel/compiler: implement Wa_14014595444 for DG2 +- intel/compiler: fix assert from ver to verx10 +- intel/genxml: add VFG_PREEMPTION_CHICKEN_BITS register +- anv: disable preemption on VFG, Wa_14015207028 for DG2 +- iris: disable preemption on VFG, Wa_14015207028 for DG2 +- anv: remove vk_sample_locations_state from emit_multisample +- intel/common: clamp sample location coordinate range +- anv: remove primitive_topology from 3DPRIMITIVE calls +- mesa/st: fix a set_sampler_views call parameter order +- anv: implement Wa_14016118574 +- iris: implement Wa_14016118574 +- anv: limit to default fast clear color when image used for transfer +- intel: revert preemption disable via VFG changes +- intel/fs: mark debug variables with ASSERTED +- anv: mark debug variables with ASSERTED +- mesa: mark debug variables with ASSERTED +- anv: dynamic state for polygon mode +- anv: dynamic state for tessellation domain origin +- anv: dynamic rasterization stream +- anv: dynamic line stipple enable +- anv: dynamic line rasterization mode +- anv: dynamic state for logic op enable +- anv: dynamic alpha to one enable +- anv: dynamic states for depth clip and clamp +- anv: dynamic provoking vertex mode +- anv: dynamic color write mask +- anv: dynamic color blend equation +- anv: dynamic multisample sample mask +- anv: toggle on EXT_extended_dynamic_state3 +- anv: move emit_shading_rate to gfx8_cmd_buffer +- anv: fill AlphaToCoverageEnable lazily from state +- intel/compiler: run nir_opt_idiv_const before nir_lower_idiv +- anv: remove some unused functions +- hasvk: remove some unused functions +- iris: setup stage bitmask for Wa_22011440098 +- anv: setup stage bitmask for Wa_22011440098 + +Tatsuyuki Ishi (4): + +- radv: Implement radv_flush_before_query_copy to workaround UE Vulkan bugs. +- radv: Remove unused functions. +- radv: Remove unused radv_deferred_queue_submission. +- radv,lvp,anv: Delete pre-common dispatch leftover. + +Thomas Debesse (7): + +- r600: set clear_buffer = u_default_clear_buffer +- r600: info.stage MESA_SHADER_KERNEL as MESA_SHADER_COMPUTE +- nv50: call nir_lower_flrp +- clover: implement CLOVER_DEVICE_TYPE like RUSTICL_DEVICE_TYPE +- Revert "gallium/clover: pass -no-opaque-pointers to Clang", opaque pointers are now implemented +- gallium/clover: pass -opaque-pointers to Clang on LLVM 15 and 16 +- crocus: set clear_buffer = u_default_clear_buffer + +Thomas H.P. Andersen (15): + +- nvc0: fix a warning -Wconstant-conversion +- radeonsi: avoid a use-after-free +- d3d12: fix warnings for missing-braces on clang +- r600: fix warnings for missing-braces on clang +- ci: remove missing-braces from no-error +- util: avoid deprecated builtin has_trivial_destructor +- spirv: avoid allocating memory twice +- hasvk: Switch to the common descriptor update template struct +- panvk: Implement VK_KHR_descriptor_update_template +- spirv: mark function unused +- llvmpipe: mark function unused +- llvmpipe: drop unused function +- panfrost: avoid warning about unused function +- vulkan: avoid warning about unused function +- vulkan: drop const for descriptor set + +Timothy Arceri (24): + +- Revert "nir: Preserve offsets in lower_io_to_scalar_early" +- glsl: dont lower precision for textureGatherOffsets +- glsl: make packed varying helper needs_lowering() external +- glsl: fix location for array subscript +- nir: update nir_is_supported_terminator_condition() +- nir: support loop unrolling with inot conditions +- glsl: remove GLSL IR inverse comparison optimisations +- nir/loop_analyze: delay instruction cost calculation +- nir/loop_analyze: remove cost of redundant selects +- util/conf: enable init to zero workaround for Exanima +- util/radeonsi: enable zerovram workaround for Exanima +- radv: add radv_zero_vram workarounds for OpenGL games +- glthread: fix matrix stack depth tracking +- mesa: add EXT_debug_label support +- glthread: leave dlist dispatch in place for Begin/End +- glsl/glsl_to_nir: remove unreachable code +- glsl: move rule inside lower_packing_builtins() +- glsl: drop sub to add neg lowering in GLSL IR +- glsl: always do {CARRY,BORROW}_TO_ARITH lowering +- glsl: move lower instructions logic inside that pass +- st/glsl_to_ir: remove stale comment +- st/glsl_to_ir: remove stale include +- mesa: fix typo from adding glGetObjectLabelEXT +- nir: fix typo in lower_double options handling + +Timur Kristóf (64): + +- ac/nir/cull: Move some code from cull_bbox into helper functions. +- ac/nir/cull: Move the contents of cull_bbox into ac_nir_cull_triangle. +- ac/nir/cull: Change if condition for bounding box culling. +- ac/nir/ngg: Move LDS store of accepted flag into the inner branch. +- ac/nir/cull: Make cull functions more consistent. +- aco: Optimize branching sequence during SSA elimination. +- aco: Remove branch instruction when exec is constant non-zero. +- ac/nir/cull: Tweak phi for cull_small_primitive branch. +- ac/nir/cull: Fix typo in bounding box culling. +- aco: Add faster code path to store_lds for consecutive write mask. +- aco: Fix invalidated reference in branching sequence optimization. +- aco: Check for instructions that inhibit the branching sequence optimization. +- aco/optimizer_postRA: Don't try to optimize dead instructions. +- aco: Support s_cselect_b64 in SCC no-compare optimization. +- aco: Improve SCC nocompare optimization when SCC is clobbered. +- aco: Fix p_init_scratch for task shaders. +- vulkan, spirv: Update to Vulkan 1.3.226 and latest SPIR-V headers. +- spirv: Support EXT_mesh_shader indices and mark them per-primitive. +- spirv: Add mesh_shading capability for EXT_mesh_shader. +- spirv: Support EXT_mesh_shader mesh/task stages. +- spirv: Support EXT_mesh_shader SetMeshOutputsEXT. +- spirv: Support the CullPrimitiveEXT mesh shader built-in. +- spirv: Support TaskPayloadWorkgroupEXT storage class. +- spirv, nir: Handle EmitMeshTasksEXT opcode. +- ac/nir/ngg: Add EXT_mesh_shader primitive indices. +- ac/nir/ngg: Add EXT_mesh_shader CullPrimitiveEXT output. +- ac/nir/ngg: Add EXT_mesh_shader vertex/primitive count. +- radv: Implement EXT_mesh_shader draw calls. +- radv: Enable EXT_mesh_shader on RDNA2 with RADV_PERFTEST=ext_ms +- nir/lower_system_values: Add shortcut for 1D workgroups. +- nir/gather_info: Clear cross-invocation output mask. +- radv: Replace NV_mesh_shader macros with EXT. +- radv: Enable subgroup feature bits for mesh and task shaders. +- radv: Change max preferred task workgroup invocations to 64. +- radv: Remove dead shader temps after linking. +- ac/nir/ngg: Don't use LDS for same-invocation indices and cull outputs. +- ac/nir/ngg: Don't initialize same-invocation mesh shader outputs. +- aco/tests: Add post-RA optimizer testcase for partially overwritten VCC. +- aco/tests: Add post-RA DPP test cases with control flow. +- aco/tests: Add post-RA SCC no-compare tests cases with control flow. +- aco/optimizer_postRA: Mark a register overwritten when predecessors disagree. +- aco/optimizer_postRA: Don't assume all operand registers were written by same instr. +- aco/optimizer_postRA: Fix logical control flow handling. +- aco/optimizer_postRA: Clarify terminology. +- aco: Change inverse-comparison optimization to work with s_not +- spirv: Treat EmitMeshTasksEXT as a terminating instruction. +- radv: Use a fallback for marketing name when libdrm doesn't know it. +- ac/nir/ngg: Fix cross-invocation indices and cull outputs. +- aco: Fix build error with std::max on GCC 12 +- nir: Document the flatten/dont_flatten selection control options. +- nir: Add selection control enum for always taken divergent branches. +- aco: Allow explicitly removing jumps on GFX10+ when beneficial. +- ac/nir/ngg: Remove jumps from some branches where we know LGKMCNT==0. +- ac/nir/tess: Remove jump from tess factor writes. +- nir, ac, aco: Add ACCESS intrinsic index to load/store_buffer_amd. +- aco: Cleanup load_vmem_mubuf and store_vmem_mubuf functions. +- nir, ac, aco: Add index src to load_buffer_amd/store_buffer_amd. +- aco: Optimize MUBUF 0 offset when idxen is also being used. +- aco/optimizer_postRA: Use unique_ptr + array for instruction indices. +- aco/optimizer_postRA: Speed up reset_block() with predecessors. +- aco/optimizer_postRA: Properly handle vccz/execz/scc in reset_block. +- aco/optimizer_postRA: Delete dead instructions more efficiently. +- aco: Move is_dead to aco_ir.h to allow it to get inlined. +- aco: Add ACO_DEBUG=novalidateir option. + +Tomeu Vizoso (16): + +- radeonsi/ci: Move libva and piglit jobs to Raven Ridge boards +- radv/ci: Put one more board to run the CTS on Stoney Ridge +- iris/ci: Test on Jasper Lake Chromebooks +- iris/ci: Set FDO_CI_CONCURRENT for all Chromebook jobs +- ci: Report failures to GitLab for LAVA jobs +- ci: Make sure that the Crosvm control socket has been removed +- Revert "ci: set venus on lavapipe to manual due to flakes" +- ci: Crosvm won't remove the control socket file on stop +- Revert "Revert "Revert "ci: set venus on lavapipe to manual due to flakes""" +- ci: Uprev ci-templates +- ci: Use --no-install-recommends to avoid problems with --no-remove +- ci: Install sysvinit-core without --no-remove +- ci: Stop explicitly passing env vars to FDO_DISTRIBUTION_EXEC command +- gallium: Add PIPE_CAP_QUERY_TIMESTAMP_BITS +- ci: Run RadeonSI jobs on changes to src/amd/ci +- radeonsi,radv/ci: Increase coverage + +Victor Hermann Chiletto (1): + +- radv: refactor descriptor set layout tracking + +Vinson Lee (10): + +- bin/perf-annotate-jit.py: Update to Python 3. +- zink: Remove duplicate variable zero. +- pan/bi: Fix memory leaks. +- egl/dri2: Fix missing return with dri2_egl_error_unlock. +- radv: Use count_tes_user_sgprs return value. +- d3d12: Initialize d3d12_video_encoder_references_manager_hevc::m_current_frame_idx. +- radv: Fix file descriptor leak. +- zink: Remove duplicate variable zm. +- r600/sfn: Initialize TestInstrFromString member m_instr_factory. +- v3d: Fix initializer-overrides warning. + +Väinö Mäkelä (1): + +- hasvk: Correctly set NonPerspectiveBarycentricEnable on gfx7 + +Xin LI (1): + +- meson: Improve detection of qsort_r(). + +Yiwei Zhang (52): + +- venus: increase busy wait order in vn_relax +- venus: double the abort timeout to allow long shader compiles +- venus: fix external memory ext filtering +- venus: require necessary extensions for common wsi support +- venus: avoid feedback for external fence +- venus: use a separate sync fence for Android wsi +- venus: lazily create queue wait fence and make it non-external +- venus: avoid pre-allocating the feedback pool +- venus: sync to latest venus protocol headers for syncFdFencing +- venus: stub out renderer sync_fd fencing features +- venus: put android wsi on the sub-optimal path +- venus: fix vn_GetFenceFdKHR +- venus: fix vn_GetSemaphoreFdKHR +- venus: query renderer sync_fd props to fill the feature stubs +- venus: re-implement sync_fd external fence +- venus: re-implement sync_fd external sempahore +- venus: avoid scrubing wsi/external sempahores +- venus: allow no external memory handle when renderers lacks support +- venus/ci: mark a test as flake +- venus: some clang format fixes +- venus: enable zink required extensions on Android +- venus: enable KHR_driver_properties on Android +- venus: double the abort timeout +- loader: use os_get_option for driver override +- zink: fix core support on Android +- zink: fix zink_create_fence_fd to properly import +- zink: fix in-fence lifecycle +- zink: implement fence_get_fd required by EGL android platform +- venus: clean up vn_QueueSubmit +- venus: force synchronous submission for external signal semaphore +- venus: fix an oob in descriptor pool state restoration +- venus: vn_ResetDescriptorPool to reset mutable type states +- venus: zero out the pipeline handles +- venus: avoid fixing pipeline if not derivative +- venus: ignore pInheritanceInfo if not secondary command buffer +- venus: ensure to clear VkPipelineCreationFeedback::flags if needed +- mesa/st: emit string marker based on PIPE_CAP_STRING_MARKER +- meson: libelf is optional +- venus/ci: remove redundant crosvm patch +- venus: move vendor ext to bottom and sort the ext list +- docs: update venus extension support +- docs: move VK_EXT_extended_dynamic_state3 out from 1.3 +- ci/venus: enable render server for venus testing +- venus: avoid accessing local var in VN_ADD_EXT_TO_PNEXT_OF +- venus: remove redundant codes +- venus: handle VkAndroidHardwareBufferFormatProperties2ANDROID +- venus: sync to latest venus protocol headers +- venus: add VK_EXT_primitives_generated_query support +- venus: add VK_EXT_depth_clip_control support +- docs: update to latest venus driver support +- venus: handle VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT +- venus: fix android wsi with global fencing disabled + +Yonggang Luo (218): + +- meson: Place all pre_args about HAVE_DRI* into single place +- meson: Cleanup usage dri_drivers as it's a dead option +- d3d12: Fixes compile error with mingw/gcc-x64 when static linkage to runtime library +- meson: clc compiler test only works on win32 +- microsoft/clc: Fixes compiling errors with clang/mingw64 in clc/clc_compiler_test.cpp +- util: open_memstream also support on MacOS +- c11: Fixes the declaration of timespec_get in time.h for macOS +- egl: Remove eglextchromium.h and import eglext_angle.h +- egl: wire up ANGLE_sync_control_rate +- egl/x11: Split declaration of dri2_format_for_depth into header platform_x11.h +- egl/x11: implement ANGLE_sync_control_rate +- ci/windows: Setup proper MESA_IMAGE_TAG for Windows build/test image +- c11: It's should detect PTHREAD_DESTRUCTOR_ITERATIONS when define TSS_DTOR_ITERATIONS +- c11: call impl_tss_dtor_invoke with tls callback for win32 +- c11: Implement and use of thrd_current properly on win32 +- util: Implement util_thread_get_time_nano on win32 +- util: Fixes typo in function os_get_total_physical_memory +- util: Implement os_get_available_system_memory on Win32 +- util: Use \`DETECT_OS_WINDOWS` instead \`WIN32` in u_process.c +- util: Enable tests that disabled for msvc/mingw +- zink: Use DETECT_OS_WINDOWS instead of 'WIN32' +- meson: Enable graw tests on mingw +- util: Add os_create_anonymous_file support on win32 +- nine: Fixes compile of nine_pipe.h with clang ../../src/gallium/frontends/nine/nine_pipe.h:377:9: error: unannotated fall-through between switch labels [-Werror,-Wimplicit-fallthrough] default: ^ ../../src/gallium/frontends/nine/nine_pipe.h:377:9: note: insert 'break;' to avoid fall-through default: +- panfrost: Do no use designated initializer for union +- intel: Fixes compile error of aubinator_viewer.cpp with gcc +- util: Fixes memory leak in create_cache_item_header_and_blob +- util: Fixes memory leak in u_printf_test.cpp +- util: Fixes invalid assumption that return non null by function util_format_fetch_rgba_func +- util: Disable usage of __attribute__((__const__)) when the compiler is clang +- ci: Improve MacOS github ci +- ci: Enable llvmpipe on MacOS +- c11: Remove mtx_try from mtx enums as it's not a part of c11 standard +- c11: #include <threads.h> when the os/platform provide it +- anv: Fixes struct anv_device::info is not initialized with struct anv_physical_device +- meson: Use different STRACEDIR folder for intel_devinfo_override_test +- meson: add enable-glcpp-tests option +- ci: disable glcpp tests for now +- Revert "ci: disable unit tests" +- util: Use literal instead call to list_inithead in u_queue.c +- mapi: Remove unused ENTRY_CURRENT_TABLE_GET +- mapi: Remove usage of _glapi_check_multithread and _glapi_destroy_multithread +- mapi: Remove reference to unused _gl_DispatchTSD +- mapi: Move shared _glapi_set_context and _glapi_set_dispatch into u_current.c +- mapi: Move shared stub into u_current.c +- mapi: Access _glapi_tls_Dispatch directly without need of macro +- docs: Remove document about USE_ELF_TLS +- mapi: Improve comment about _glapi_tls_Dispatch and _glapi_tls_Context +- aco: Use unreachable instead assert(false) +- amd/compiler: Fixes warning [-Wunused-variable] in test_to_hw_instr.cpp +- amd/compiler: Fixes warning [-Wunused-variable] in test_optimizer_postRA.cpp +- nir: Fixes [-Wdeprecated-declarations] in serialize_tests.cpp +- util: Fix [-Wdeprecated-declarations] in mesa-sha1_test.cpp +- glsl: Fixes [-Wdeprecated-declarations] in list_iterators.cpp +- intel/compiler: Fixes [-Wdeprecated-declarations] in test_eu_compact.cpp +- intel/compiler: Fixes [-Wdeprecated-declarations] in test_eu_validate.cpp +- virgl: Fixes [-Wdeprecated-declarations] in virgl_staging_mgr_test.cpp +- osmesa: Fixes [-Wdeprecated-declarations] in test-render.cpp +- ci: Remove -Wno-error=deprecated-declarations cpp option from debian-clang +- util: Remove unused rtasm_cpu_has_sse2 +- util: Rename util_cpu_detect to _util_cpu_detect_local +- util: Improve has_sse has_sse2 has_sse3 has_sse4_1 has_avx disable handling +- util: Handling GALLIUM_NOSSE in u_cpu_detect.c +- util: Handling LP_FORCE_SSE2 in u_cpu_detect.c +- llvmpipe: Remove the hack for LLVM4 in lp_bld_init.c +- gallium: Fixes memory leak of translate_test.c +- util: Use environment variable GALLIUM_OVERRIDE_CPU_CAPS to remove usage of util_cpu_caps in translate_test.c +- util: Move detect_done out of struct util_cpu_caps_t +- util: Add api util_call_once_with_context +- util: Add extern "C" guard in simple_mtx.h +- util: Remove the need of _MTX_INITIALIZER_NP in simple_mtx.h +- meson/panfrost: Add dep_valgrind for libpanfrost_pixel_format to fixes the compiling error: +- llvmpipe: Fixes warning [-Wimplicit-fallthrough] in lp_bld_swizzle.c +- llvmpipe: Fixes warning [-Wimplicit-fallthrough] in lp_bld_sample_soa.c +- radeonsi: Fixes warning [-Wimplicit-fallthrough] in si_descriptors.c +- freedreno: Fixes warning [-Wimplicit-fallthrough] in freedreno_util.h +- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd4_texture.h +- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd2_texture.c +- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd3_texture.c +- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd5_texture.h +- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd5_emit.c +- freedreno: Fixes warning [-Wimplicit-fallthrough] in fd6_emit.c +- vdpau: Fixes warning [-Wimplicit-fallthrough] in vdpau_private.h +- v3dv: Fixes warning [-Wimplicit-fallthrough] in v3dvx_pipeline.c +- v3dv: Remove redundant unreachable in v3dvx_pipeline.c +- st: Fixes warning [-Wimplicit-fallthrough] in st_texture.c +- llvmpipe: Remove llvm version guard LLVM_VERSION_MAJOR <=> 4 +- meson: bump the minimal required llvm version to 5.0.0 +- util: Refactoring check_cpu_caps_override out +- util: Add ssse3 and avx cpu caps override option +- util: Move the detect of max_vector_bits into u_cpu_detect.c +- util: add support for detecting avx512 vector bit size +- llvmpipe: Always using util_get_cpu_caps to get cpu caps for llvm on x86 +- llvmpipe: Enable avx512 support in llvm +- docs: document LP_NATIVE_VECTOR_WIDTH GALLIUM_OVERRIDE_CPU_CAPS GALLIUM_NOSSE LP_FORCE_SSE2 +- ci: EXTRA_OPTIONS -> EXTRA_OPTION for debian-clang +- ci: Enable GALLIUM_DUMP_CPU=true only in the clang job +- ci: Enable all possible meson build options for clang +- ci: Add debian-clang-release build pipeline +- ci/fedora: Re-enable vulkan-layers=device-select,overlay +- gallium/hud: Fixes compile error in hud/hud_context.c: +- util: Fixes comment about debug_get_option +- mesa: direct use of mtx_recursive is not c11 conformance +- radeonsi: direct use of mtx_recursive is not c11 conformance +- svga: direct use of mtx_recursive is not c11 conformance +- c11: Update the values of enum mtx_plain, mtx_recursive, mtx_timed to make sure mtx_recursive != mtx_plain | mtx_recursive +- drm-shim: drop gnu99 override +- clover: Fixes use of designated initializers requires in c++ that doesn't support by MSVC +- radv: Remove the redundant #include <gelf.h> and #include <libelf.h> in ac_binary.c +- aco: Convert to use u8 literal for Unicode character to fixes msvc warning +- util: Add util_call_once for optimize call to util_call_once_with_context out for hot path +- util: getting get_tracefile in u_trace.c to be thread safe +- util: Remove the macro DEBUG_GET_ONCE_FILE_OPTION +- radv/r600/clover: Getting libelf to be optional +- radv: Getting radeon_icd to be generated properly on win32 +- dzn: rename module_dir to icd_lib_path for consistence +- lavapipe/meson: rename module_dir to icd_lib_path and swap two lines for consistence +- Update 22.2.0.rst +- ci: debian-clang: drop -Wno-error=unused-function +- r600/sfn: Fixes 'numeric_limits' is not a member of 'std' error in sfn/sfn_instr.cpp +- aco: Fixes compiling error about char8_t with c++20 +- ci/docs: Stick to specific version of python packages +- util: Remove the include of windows.h when compiling with MSVC +- util: Implement atomic operations consistently across compilers and testing for it +- util/mesa/wide: Rename _SIMPLE_MTX_INITIALIZER_NP to SIMPLE_MTX_INITIALIZER +- util: Turn -DWINDOWS_NO_FUTEX to be pre_args +- ci/windows: Enable gles1 for msvc +- ci/windows: Remove -Dlibelf:warning_level=1 as libelf subproject are already removed +- ci/windows: Getting the default supported windows version to be 7 when using mingw +- gallium: delete graw +- nir: No need redefine snprintf anymore in nir.h +- d3d10ump: No need define snprintf anymore in InputAssembly.cpp +- util: Move the implementation of futex_wake and futex_wait from futex.h to futex.c +- docs: Remove graw related words +- docs: Remove rbug related documents +- nouveau: Remove XVMC_VL environment variable usage +- docs: Remove document about XvMC +- meson: add dep_valgrind dependency for 'dxil_compiler' +- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_stack.c +- util: Call function mtx_init on exit_mutex to replace the usage of _MTX_INITIALIZER_NP in u_queue.c +- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_symbol.c +- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_stack_android.cpp +- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in u_debug_memory.c +- util: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_M in u_debug_refcnt.c +- mapi: Use util_call_once to init exec_mem and mutex instead _MTX_INITIALIZER_NP +- mapi: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER in stub.c +- gallium/auxiliary: Remove the need of _MTX_INITIALIZER_NP by using simple_mtx_t/SIMPLE_MTX_INITIALIZER +- egl/wgl: Use util_call_once_data to replace usage of mtx_t glFlushMutex +- egl/dri2: Use util_call_once_data to replace usage of mtx_t glFlushMutex +- mesa: Use util_call_once_data instead of mtx_t init_once_lock +- util: #include "c11/threads.h" when necessary in simple_mtx.h +- util: #include "util/macros.h" in u_cpu_detect.h for ATTRIBUTE_CONST +- c11: Remove the code for Windows XP +- meson/mesa/wide: define WIN32_LEAN_AND_MEAN with pre_args +- util: Remove the guard WIN32_WINNT >= 0x0600 in u_thread.h +- c11: Getting the macro guard of HAVE_PTHREAD in c11/threads.h to be consistence with util/u_uthread.h +- util: #include <windows.h> in disk_cache_os.c +- llvmpipe: #include <windows.h> in lp_rast.c +- util: Remove include of windows.h in memstream.h +- util: Remove include of windows.h in rwlock.h +- util: Move the function implementation from u_thread.h to u_thread.c +- glx: Guard usage of infer_zink && !explicit_zink in glxext.c +- glx: Trim trailing spaces in in glx/glxext.c +- glx: Remove redundant #ifndef GLX_USE_APPLEGL in glx/glxext.c +- glx: Adopt guard of glx_direct and glx_accel in #if defined(GLX_DIRECT_RENDERING) && !defined(GLX_USE_APPLEGL) +- glx: Adot #ifdef GLX_USE_WINDOWSGL in #if defined(GLX_DIRECT_RENDERING) && !defined(GLX_USE_APPLEGL) +- st: Remove st_api_type because ST_API_OPENVG is not an option anymore +- ci: libxvmc-dev is not a dependence anymore +- meson: upgrade zlib wrap +- ci: upgrade zlib to 1.2.13 in mesa_deps_build.ps1 +- ci: Add Get-Date in each package build in mesa_deps_build.ps1 +- ci: Move building libva before clone llvm-project in mesa_deps_build.ps1 +- ci: Install the clang-cl compiler within mesa_deps_vs2019.ps1 +- ci: Trigger building visual studio docker image +- meson: with_glx never assigned to 'gallium-xlib', so need add with_glx == 'xlib' for dep_glproto +- meson: with_glx never assigned to 'gallium-xlib', remove 'gallium-xlib' +- xlib: Remove macro guard with GLX_INDIRECT_RENDERING in xlib.c +- ci/macos: Building both dri and xlib version of glx +- hgl: Remove non-nused hgl_create_st_api +- st: move function st_visual_have_buffers into st_manager.c +- st: OpenVG are removed, remove st_api::profile_mask as it's constant +- st: OpenVG are removed, remove st_api::feature_mask as it's constant +- st: Remove st_api::destroy +- st: Turn st_api::destroy_drawable to be direct accessed function st_api_destroy_drawable +- st: Turn st_api::get_current to be direct accessed function st_api_get_current +- st: Turn st_api::make_current to be direct accessed function st_api_make_current +- st: Turn st_api::create_context to be direct accessed function st_api_create_context +- st: Turn st_api::query_versions to be direct accessed function st_api_query_versions +- xlib: Implement xmesa_get_name by return literal "Mesa " PACKAGE_VERSION directly +- st: Remove struct st_api, st_gl_api_create and st_gl_api.h +- aco: Do not define NOMINMAX as it's already defined in pre_args now +- util: Remove non used UTIL_MAX_L3_CACHES in u_thread.h +- util: Merge duplicated function prototype in u_thread.h that named with util_barrier* +- util: Move UTIL_MAX_CPUS from u_thread.h to u_cpu_detect.h +- util: #include "c11/threads.h" instead of #include "util/u_cpu_detect.h" u_format.c +- xlib: Fixes compiling on linux +- xlib: Remove the linkage hack in lib.c by use meson link_whole option +- gallium: Remove mgl* prefix in linker script osmesa.sym and libgl-xlib.sym +- xlib: Indent with space and trim trailing spaces of xlib files +- ci/macOS: Getting the installed binary to be artifacts +- ci: Getting debian-clang-release to building with glx=xlib +- xlib: Getting xmesa_strict_invalidate to be function +- xlib: Remove the usage of global constructor in xlib.c, so the xm_public.h are removed +- docs: -Ddri-drivers= option is deprecated, do not use it anymore +- docs: There is only glx=xlib option now +- docs: There is no more swrast driver +- android: -Ddri-drivers= is not needed anymore +- glx: Fixes apple/apple_visual.c that include non-exist util/debug.h +- ci: Fixes macos.yml +- radv: Fixes prototypes +- meson: Refactoring shared gen_vs_module_defs_normal_command out +- meson: fixes mingw-clang32 building +- clover: Rename \*OpenCL.def to \*OpenCL.def.in +- clover: Fixes building with mingw-x86 +- meson: Fixes name_prefix for clover on mingw +- clover: empty soversion when on win32 +- util: Test __PPC64__ for getting PIPE_ARCH_PPC_64 respond to __PPC64__ take effect +- gallivm: use valid LLVM_VERSION_MAJOR instead of LLVM_VERSION + +Yusuf Khan (7): + +- nouveau: implement and enable PIPE_CAP_QUERY_MEMORY_INFO +- tu: remove tu_syncobj_to_fd +- nouveau: get timestamp from os_time_get_nano instead of os_time_get +- nouveau: put nv04 push macros in nouveau_winsys +- nv50/ir: nir_op_b2i8 and nir_op_b2i16 +- gallium/util: add a helper for get_timestamp +- nouveau: enable PIPE_CAP_UMA when appropriate + +jenneron (1): + +- docs/panfrost: Document Mali-T620 support + +noasakurajin (1): + +- disable zinks shader cache when the needed functions do not exist + +nyanmisaka (2): + +- gallium: Do not include offscreen winsys if glx is disabled +- radeonsi: Inherit the old modifier when reallocating the texture. + +semjon00 (4): + +- hasvk: don't export flush_compute_state +- hasvk: don't export gfx state flushing helper +- hasvk: combine flushes in Draw/DrawIndexed/DrawIndirectByteCountEXT +- hasvk: force inline more pipe flush functions + +sjfricke (2): + +- isl: fix bug where sb.MOCS is not being set +- anv: fix assert to build with shader cache disabled diff --git a/lib/mesa/docs/relnotes/22.3.1.rst b/lib/mesa/docs/relnotes/22.3.1.rst new file mode 100644 index 000000000..cadb451c8 --- /dev/null +++ b/lib/mesa/docs/relnotes/22.3.1.rst @@ -0,0 +1,207 @@ +Mesa 22.3.1 Release Notes / 2022-12-14 +====================================== + +Mesa 22.3.1 is a bug fix release which fixes bugs found since the 22.3.0 release. + +Mesa 22.3.1 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.3.1 implements the Vulkan 1.3 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + 3c9cd611c0859d307aba0659833386abdca4c86162d3c275ba5be62d16cf31eb mesa-22.3.1.tar.xz + + +New features +------------ + +- None + + +Bug fixes +--------- + +- Intel Vulkan driver failed to handle robustness behavior for single-layered array images +- ANV: single-layer 2D_ARRAY VkImageViews read from base layer instead of out-of-bounds result with robustImageAccess2. +- RADV: Doom Eternal: missing reflection with raytracing +- DG2 raytracing incorrect rendering on Sascha Willems reflection demo +- Intel Vulkan driver: Depth-Stencil MSAA mask behavior changes +- [ANV] [A770] The Witcher 3 is a black screen and causes GPU reset +- vulkan/hasvk: vkquake: ../src/util/bitpack_helpers.h:61: util_bitpack_uint: Assertion \`v <= max' failed. +- [RADV] Death Stranding: Director's Cut (Steam) freezes 6800XT (ring gfx_0.0.0 timeout) +- [regression] [bisected] i915 0000:00:02.0: [drm] GPU HANG: ecode 12:1:859ffffb, in CanvasRenderer +- [ANV] Geometry corruption/flickering in GravityMark on GFX12+ +- NIR validation failed after nir_lower_subgroups + + +Changes +------- + +Alyssa Rosenzweig (1): + +- pan/mdg: Emulate 8-bit with the 16-bit pipe + +Bas Nieuwenhuizen (1): + +- radv: Handle nodes with 2 invalid children in internal node converter. + +Boyuan Zhang (1): + +- radeonsi: disable av1 decode for navi24 + +Chia-I Wu (1): + +- nir: fix nir_link_varying_precision + +Connor Abbott (1): + +- freedreno/fdl: Set sRGB bit for storage images + +Daniel Stone (1): + +- ci: Rebalance radv/grunt testing + +Danylo Piliaiev (3): + +- ir3: Reduce the maximum allowed imm offset for shared var load/store +- nir/nir_opt_offsets: Prevent offsets going above max +- meson: Do not enable drm for KGSL Turnip build + +Dave Airlie (1): + +- glx: fix xshm check to init xshm_opcode. + +Dawn Han (1): + +- venus: fix deqp tests failed on iub descriptor type + +Eric Engestrom (3): + +- docs: add release notes for 22.3.0 +- .pick_status.json: Update to 264a0cabd1dd2a619f24e34f21f767c6bf36fe76 +- v3dv: skip some invalid tests + +Erik Faye-Lund (1): + +- zink: don't use defunct custom-flag + +Friedrich Vock (2): + +- nir: Do not consider phis with incompatible dests equal +- radv: Handle NULL miss shaders + +Georg Lehmann (1): + +- aco: Use wave size specific opcode for s_or in cube map coord code. + +Iago Toral Quiroga (2): + +- v3dv: make single-sync paths more explicit +- v3dv: fix job serialization for single sync path + +Ian Romanick (1): + +- glsl_to_nir: Fix NIR bit-size of ir_triop_bitfield_extract and ir_quadop_bitfield_insert + +Iván Briano (2): + +- vulkan: VkPolygonMode has a bit more than two values +- hasvk: pipelineStageCreationFeedbackCount is allowed to be 0 + +Jan Alexander Steffens (heftig) (2): + +- d3d12: Don't crash when libd3d12.so can't be found +- dzn: Don't crash when libd3d12.so can't be found + +Jan Palus (1): + +- vdpau: allow building vdpau st with virgl only + +Jason Ekstrand (4): + +- nir: Use nir_const_value_for_int in nir_lower_subgroups +- st/mesa: Use nir_const_value_for_bool() in ATIFS +- r600/nir: Fix u64vec2 immediate lowering +- dxil: Use nir_const_value_for_uint in dxil_nir_lower_int_samplers + +Kenneth Graunke (1): + +- intel/compiler: Set NoMask on cr0 access for float controls mode + +Konstantin Seurer (1): + +- radv/rra: Set the metadata size correctly + +Lionel Landwerlin (11): + +- iris: move bindless surface state heap inside the surface state heap +- isl: don't report I915_FORMAT_MOD_Y_TILED_CCS on Gfx8 +- anv: correctly predicate ray tracing +- intel: factor out dispatch PS enabling logic +- intel: add missing restriction on fragment simd dispatch +- Revert "anv: compile anv_acceleration_structure.c" +- intel/nir/rt: fixup primitive id +- anv: fixup descriptor copies +- isl: make Wa_1806565034 conditional to non robust access +- intel/fs: make Wa_1806565034 conditional to non robust access +- anv: disable Wa_1806565034 when robustImageAccess is enabled + +Marcin Ślusarz (4): + +- intel/compiler: adjust [store|load]_task_payload.base too +- nir/lower_task_shader: allow offsetting of the start of payload +- intel/compiler: user payload starts after TUE header & its padding +- nir/lower_task_shader: fix task payload corruption when shared memory workaround is enabled + +Otavio Pontes (1): + +- anv/hasvk: Clamping Scissor Rect values in a valid range + +Peng Huang (1): + +- intel: Fix crashes for importing drm buffer + +Rhys Perry (2): + +- aco: more carefully apply constant offsets into scratch accesses +- aco/ra: don't swap p_create_vector operand with definition blocker for scc + +Ruijing Dong (2): + +- radeonsi/vcn: adding av1 decoding film grain block +- frontends/va: fix gst videotestsrc h264 enc fail issue. + +Samuel Pitoiset (6): + +- radv: fix emitting invalid color attachments +- radv: do not set ZPASS_INCREMENT_DISABLE on GFX11 +- radv: use LATE_Z for depth/stencil attachments used in feedback loops +- radv: fix guardband if the polygon mode is points or lines +- radv: fix hashing descriptor set layout +- radv: do not remove the PointSize built-in for polygon mode as points + +Sathishkumar S (4): + +- radeonsi/vcn: enable yuv422 jpeg decode +- radeonsi/vcn: fix regression in yuv422 jpeg decode +- frontends/va: fix crash during grayscale rendering +- gallium/vl: return the buffer plane order for yuv444p format + +Sviatoslav Peleshko (1): + +- anv: Defer flushing PIPE_CONTROL bits forbidden in CCS while in GPGPU mode + +Tapani Pälli (2): + +- intel/fs: implement Wa_14017989577 +- anv: emit sample mask state independent of fragment stage diff --git a/lib/mesa/docs/relnotes/22.3.2.rst b/lib/mesa/docs/relnotes/22.3.2.rst new file mode 100644 index 000000000..9ff126bcb --- /dev/null +++ b/lib/mesa/docs/relnotes/22.3.2.rst @@ -0,0 +1,115 @@ +Mesa 22.3.2 Release Notes / 2022-12-29 +====================================== + +Mesa 22.3.2 is a bug fix release which fixes bugs found since the 22.3.1 release. + +Mesa 22.3.2 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.3.2 implements the Vulkan 1.3 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + c15df758a8795f53e57f2a228eb4593c22b16dffd9b38f83901f76cd9533140b mesa-22.3.2.tar.xz + + +New features +------------ + +- None + + +Bug fixes +--------- + +- Mesa 22.3.0 SEGFAULT in nir shader creation for r600 cards on FreeBSD +- radv: Doom Eternal randomly hangs GPU with RT on +- etnaviv: mesa-22.3.0 regression on i.MX8M Mini +- Static linking LLVM 15 fails with unresolved symbols if not building CLC +- Microsoft CLC/clover: Fails to link against Clang 15 libraries + + +Changes +------- + +Alyssa Rosenzweig (1): + +- panfrost: Align WSI strides for tiled AFBC + +Emma Anholt (1): + +- zink: Fix reversed cap declarations for ImageBuffer + +Eric Engestrom (3): + +- docs/relnotes: add sha256sum for 22.3.1 +- docs/relnotes: add 22.3.1 to index +- .pick_status.json: Update to def474e916b69b86ed49c895c7418ba54a0e8252 + +Friedrich Vock (3): + +- radv/rt: PLOC fixes for active_leaf_count +- radv/rt: Execute memory barrier before updating the phase end count +- radv/rt: Refactor exiting PLOC + +Gert Wollny (1): + +- r600/sfn: Don't try to re-use the iterator when uses is updated + +Iago Toral Quiroga (6): + +- v3dv: always store/restore attachment state during meta operations +- v3dv: don't resolve by averaging samples on depth/stencil resolves +- v3dv: handle depth/stencil resolves we can't implement via TLB +- v3dv: honor render area in subpass resolve fallback +- v3dv: pipeline creation feedback may not request all stages +- v3dv: be more careful when restoring dirty state after meta operations + +Konstantin Seurer (1): + +- radv/rra: Emit leaf node ids for leaf nodes instead of internal nodes + +Lionel Landwerlin (2): + +- anv: fixup another dirty issue with gpu_memcpy +- anv: handle null push descriptors in deferred optimization + +Liviu Prodea (1): + +- OpenCL/draw module: Support linking with LLVM and clang 15 static libraries + +Lucas Stach (1): + +- etnaviv: disable linear PE by default + +Marek Vasut (1): + +- etnaviv: Do expose 2D texture support on pre-halti GPUs + +Pavel Ondračka (1): + +- r300: be more careful with presubtract and non-native swizzles + +Rhys Perry (2): + +- aco/gfx11: export mrtz in discard early exit for non-color shaders +- radv/gfx11: disable mesh shaders + +Rohan Garg (1): + +- anv: Ensure we clear ANV_PIPE_PSS_STALL_SYNC_BIT on flush + +Timur Kristóf (3): + +- aco: Don't accept constants on p_bpermute. +- radv: Destroy mesh shader scratch ring in radv_queue_state_finish. +- radv: Include view index SET_SH_REG in ace_predication_size. diff --git a/lib/mesa/docs/relnotes/22.3.3.rst b/lib/mesa/docs/relnotes/22.3.3.rst new file mode 100644 index 000000000..f74090454 --- /dev/null +++ b/lib/mesa/docs/relnotes/22.3.3.rst @@ -0,0 +1,181 @@ +Mesa 22.3.3 Release Notes / 2023-01-11 +====================================== + +Mesa 22.3.3 is a bug fix release which fixes bugs found since the 22.3.2 release. + +Mesa 22.3.3 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.3.3 implements the Vulkan 1.3 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + bed799788bf2bd9ef079d97cd8e09348bf53cb086818578e40773b2b17812922 mesa-22.3.3.tar.xz + + +New features +------------ + +- None + + +Bug fixes +--------- + +- radeonsi: Broken graphics in game ibb & obb with mesa_glthread=true +- Hotline Miami 2 crashes on startup. (\`mesa_glthread` defaults true) +- [RadeonSI] Dune: Spice Wars blue/purple screen +- iris: assertion failure in iris_resource_from_memobj +- Limbo (48000) goes white when starting a new game (Raven Ridge 2400G) +- plasmashell sometimes hangs with mesa_glthread +- virgl: enum pipe_shader_type reorder completely breaks virglrenderer +- radv: Crash when creating graphics pipeline libraries with module identifier +- Segmentation fault in KiCAD with version mesa-22.3+ +- radeonsi glthread crash regression +- Uninitialized barrier.dst_stage_mask in radv_cmd_buffer_resolve_subpass_fs + + +Changes +------- + +Adam Stylinski (1): + +- nv30: Fix an offset for vbos being applied to a buffer twice + +Asahi Lina (1): + +- kmsro: Fix renderonly_scanout BO aliasing + +Daniel Schürmann (1): + +- aco: fix reset_block_regs() in postRA-optimizer + +Danylo Piliaiev (1): + +- tu: Fix varyings interpolation reading stale values + +Emma Anholt (2): + +- zink: Only expose PIPE_CAP_IMAGE_ATOMIC_FLOAT_ADD if we can actually add. +- zink: Only expose PIPE_CAP_SHADER_ATOMIC_INT64 if we can do shared and ssbos. + +Eric Engestrom (4): + +- docs/relnotes: add sha256sum for 22.3.2 +- .pick_status.json: Update to 38d6185432d1f19a5653b3892069cd350187f5b8 +- gen_release_notes: strip second newline in new features +- meson: add missing dependency + +Felix DeGrood (2): + +- anv: Emit CS stall on INTEL_MEASURE timestamp +- hasvk: Emit CS stall on INTEL_MEASURE timestamp + +Gert Wollny (1): + +- virgl: Use virgl host side shader stage IDs when reading caps + +Iago Toral Quiroga (1): + +- v3dv: fix alpha-to-one for single sample setup + +Konstantin Kharlamov (5): + +- bin/gen_release_notes.py: do not end "features" with "None" +- bin/gen_release_notes.py: do not fail on confidential features +- bin/gen_release_notes.py: don't fail if "Closes" refers to an MR +- bin/gen_release_notes.py: parse "Fixes" tags as well as "Closes" +- bin/gen_release_notes.py: read Closes/Fixes tags case-insensitively + +Lionel Landwerlin (4): + +- anv: don't nullify entries +- anv: check that push range actually match binding considered +- anv: return properly typed value for no ubo promoted +- anv: use the null surface with unused push descriptor binding table entries + +Lucas Stach (1): + +- etnaviv: blt: properly use upper half of clear value + +Marek Olšák (1): + +- radeonsi: rewrite si_update_ps_colorbuf0_slot to fix crashes and recursions + +Maíra Canal (1): + +- v3dv: initialize fd variable for proper error handling + +Mike Blumenkrantz (5): + +- zink: only update framebuffer object during swapchain update if framebuffer exists +- zink: protect against invalid scissored renderpass clears +- lavapipe: add some safety asserts when beginning rendering +- zink: stop using VK_PIPELINE_CACHE_CREATE_EXTERNALLY_SYNCHRONIZED_BIT +- zink: set surface->dt when updating swapchain + +Pierre-Eric Pelloux-Prayer (4): + +- egl: handle NULL loaderPrivate in dri_is_thread_safe +- util/00-mesa-defaults: add Limbo workaround +- mesa: add missing count_scale attribute +- mesa: add missing count_scale attributes + +Qiu Wenbo (2): + +- vc4: Fix RADDR_A field extraction of branch instruction +- vc4: Fix running process_mux_deps on irrelevant type of instructions + +Rhys Perry (1): + +- aco/gfx11: update s_code_end padding + +Ruijing Dong (5): + +- gallium: add film_grain_target into av1 dec desc +- frontends/omx: initialize film_grain_target +- frontneds/va: use current_display_picture from VA for film grain +- frontends/va: pass in film_grain_target as new output +- radeonsi/vcn: av1 film_grain output fix + +Samuel Pitoiset (4): + +- radv: fix missing initialization of radv_resolve_barrier::dst_stage_mask +- radv: fix multiple resolves in the same subpass +- radv: fix returning VK_PIPELINE_COMPILE_REQUIRED from library +- radv: fix re-emitting tessellation domain origin when it's dynamic + +Sil Vilerino (1): + +- frontends/va: Update state var frame_num disregarding cap check + +Tapani Pälli (1): + +- iris: let isl set tiling mode for external resources + +Timothy Arceri (2): + +- util/00-mesa-defaults: add Metal Slug XX workaround +- util/driconf: add Dune: Spice Wars workaround + +Väinö Mäkelä (1): + +- intel: Fix a hang caused by invalid dispatch enables on gfx6/7 + +Xaver Hugl (1): + +- driconf: add a workaround for plasmashell freezing + +Yiwei Zhang (2): + +- lvp: properly ignore sampler write for immutable sampler +- venus: properly ignore the sampler for immutable sampler diff --git a/lib/mesa/docs/relnotes/22.3.4.rst b/lib/mesa/docs/relnotes/22.3.4.rst new file mode 100644 index 000000000..6c49bd7f8 --- /dev/null +++ b/lib/mesa/docs/relnotes/22.3.4.rst @@ -0,0 +1,212 @@ +Mesa 22.3.4 Release Notes / 2023-01-26 +====================================== + +Mesa 22.3.4 is a bug fix release which fixes bugs found since the 22.3.3 release. + +Mesa 22.3.4 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 22.3.4 implements the Vulkan 1.3 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA256 checksum +--------------- + +:: + + TBD. + + +New features +------------ + +- None + + +Bug fixes +--------- + +- anv: Performance issue with Vulkan on Wayland KWin +- VAAPI HEVC encode broken since 22.3 +- GPU HANG: ecode 12:1:859ffffb (Resetting rcs0 for stopped heartbeat on rcs0) - reproducible +- zink: src/gallium/auxiliary/pipebuffer/pb_slab.c:138: Assertion failed: \`heap < slabs->num_heaps` +- [zink] Assertion \`heap < slabs->num_heaps' failed on Pascal (bisected) +- [RADV] Incorrect copies to/from compressed textures with mipmaps +- mesa_glthread=true and probably ANY id Tech 3 engine games, offroad... +- radeonsi: VRAM Leak/abnormally high usage in Minecraft mod pack +- libmesa_util depends on gallium +- debug build compilation failed: inlining failed in call to ‘always_inline’ ‘src_is_ssa’: indirect function call with a yet undetermined callee +- radv: Missing implementation of VkImageSwapchainCreateInfoKHR and VkBindImageMemorySwapchainInfoKHR + + +Changes +------- + +Alejandro Piñeiro (1): + +- vulkan/wsi: check if image info was already freed + +Alyssa Rosenzweig (5): + +- pan/bi: Fix incorrect compilation of fsat(reg.yx) +- panfrost: Enable NV_primitive_restart on Valhall +- panfrost: Fix logic ops on Bifrost +- panfrost: Fix clears with conditional rendering +- mesa: Set info.separate_shader for ARB programs + +Bas Nieuwenhuizen (1): + +- aco: Pass correct number of coords to Vega 1D LOD instruction. + +Danylo Piliaiev (1): + +- tu/kgsl: do not use kgsl_command_object::offset + +Dave Airlie (5): + +- lavapipe: fix descriptor set layout reference counting in layout merge +- lavapipe: fix merged layout ref counting +- llvmpipe: unref images correctly in the fragment shader path +- vk_cmd_queue: add a callback to free push descriptors set khr +- lavapipe: fix one more descriptor set reference issue + +Emma Anholt (5): + +- docs/perfetto: Fix the name of the i915 render stages data sources. +- zink: Add missing Flat decorations on some inputs. +- zink: Fix validation failure for maxLod < minLod. +- zink: Fix up mismatches of memory model vs addressing model. +- zink: Re-emit the SpvBuiltInSampleMask access chain each load. + +Eric Engestrom (4): + +- docs/relnotes: add sha256sum for 22.3.3 +- .pick_status.json: Update to 9f51340b99a39be80bf1bc6d4be15a87161c7a6b +- .pick_status.json: Mark e527f686caaf688f0a185e2641fc96413dd502be as denominated +- gen_release_notes: include links in relnotes.rst when generating the new release note + +Erik Faye-Lund (2): + +- zink: fix depth-clip disable cap +- radeonsi: respect smoothing_enabled + +Francisco Jerez (2): + +- intel/fs/gfx12: Ensure that prior reads have executed before barrier with acquire semantics. +- intel/fs: Fix src and dst types of LOAD_PAYLOAD ACP entries during copy propagation. + +Friedrich Vock (1): + +- radv/rt: Divide by the correct workgroup size + +Gert Wollny (2): + +- r600/sfn: Fix readport check +- virgl: drop the separable flag for cases that can't be handled + +Italo Nicola (1): + +- panfrost: fix off-by-one when exporting format modifiers + +Jason Ekstrand (1): + +- gallium,util: Pull u_indices and u_primconvert back into gallium + +Jonathan Gray (1): + +- egl/dri2: avoid undefined unlocks + +Julia Tatz (3): + +- zink: zink_heap isn't 1-to-1 with memoryTypeIndex +- zink: trival renames heap_idx -> memoryTypeIndex +- zink: correct sparse bo mem_type_idx placement + +Kenneth Graunke (1): + +- intel/compiler: Drop redundant 32-bit expansion for shared float atomics + +Lionel Landwerlin (4): + +- nir/divergence: add missing RT intrinsinc handling +- nir/lower_io: fix bounds checking for 64bit_bounded_global +- intel/fs: avoid cmod optimization on instruction with different write_mask +- vulkan/wsi/wayland: improve same gpu detection + +Marcin Ślusarz (1): + +- intel/compiler/mesh: handle const data in task & mesh programs + +Marek Olšák (4): + +- glthread: fix an upload buffer leak +- util: fix util_is_vbo_upload_ratio_too_large +- mesa: allow GL_UNSIGNED_INT64_ARB as vertex format for ARB_bindless_texture +- glthread: handle GL_*_ARRAY in glEnable/Disable + +Maíra Canal (1): + +- v3dv: remove unused clamp_to_transparent_black_border property + +Mike Blumenkrantz (7): + +- zink: store drm format as internal_format for imported resources +- zink: handle modifier nplanes queries correctly for planar formats +- zink: preserve present resources during async presentation +- zink: delete need_blend_constants +- zink: don't use ds3 blend states without color attachments +- zink: use actual swapchain object for surface comparison +- zink: fix VK_DYNAMIC_STATE_LINE_WIDTH usage + +Paulo Zanoni (4): + +- anv: don't leave undefined values in exec->syncobj_values +- anv: check the return value of anv_execbuf_add_bo_bitset() +- hasvk: don't leave undefined values in exec->syncobj_values +- hasvk: check the return value of anv_execbuf_add_bo_bitset() + +Pierre-Eric Pelloux-Prayer (4): + +- vbo: remove bogus assert +- vbo: lower VBO_SAVE_BUFFER_SIZE to avoid large VRAM usage +- glthread: fix glArrayElement handling +- radeonsi/gfx11: fix ge_cntl programming + +Rhys Perry (1): + +- radv/gfx11: increase radeon_check_space for occlusion query begin + +Rob Clark (1): + +- freedreno: Fix tracking of enabled SSBOs + +Rose Hudson (1): + +- radeonsi: report 0 block size for Polaris HEVC encoding + +Samuel Pitoiset (5): + +- radv: fix missing implementation of creating images from swapchains +- radv: fix setting MAX_MIP for BC views +- radv: fix buffer to image copies with BC views on the graphics queue +- radv: fix creating BC image views when the base layer is > 0 +- radv: fix ignoring graphics shader stages that don't need to be imported + +Simon Ser (1): + +- zink: fix PIPE_RESOURCE_PARAM_NPLANES with format modifier + +Tapani Pälli (3): + +- anv: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable +- hasvk: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable +- iris: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable + +t0b3 (1): + +- nir/nir_opt_move: fix ALWAYS_INLINE compiler error |