Ryujinx

Author	SHA1	Message	Date
riperiperi	9ac66336a2	GPU: Use lazy checks for specialization state (#4004 ) * GPU: Use lazy checks for specialization state This PR adds a new class, the SpecializationStateUpdater, that allows elements of specialization state to be updated individually, and signal the state is checked when it changes between draws, instead of building and checking it on every draw. This also avoids building spec state when Most state updates have been moved behind the shader state update, so that their specialization state updates make it in before shaders are fetched. Downside: Fields in GpuChannelGraphicsState are no longer readonly. To counteract copies that might be caused this I pass it as `ref` when possible, though maybe `in` would be better? Not really sure about the quirks of `in` and the difference probably won't show on a benchmark. The result is around 2 extra FPS on SMO in the usual spot. Not much right now, but it will remove costs when we're doing more expensive specialization checks, such as fragment output type specialization for macos. It may also help more on other games with more draws. * Address Feedback * Oops	2022-12-04 18:41:17 +01:00
riperiperi	4965681e06	GPU: Swap bindings array instead of copying (#4003 ) * GPU: Swap bindings array instead of copying Reduces work on UpdateShaderState. Now the cost is a few reference moves for arrays, rather than copying data. Downside: bindings arrays are no longer readonly. * Micro optimisation * Add missing docs * Address Feedback	2022-12-04 18:18:40 +01:00
riperiperi	458452279c	GPU: Track buffer migrations and flush source on incomplete copy (#3952 ) * Track buffer migrations and flush source on incomplete copy Makes sure that the modified range list is always from the latest iteration of the buffer, and flushes earlier iterations of a buffer if the data has not been migrated yet. * Cleanup 1 * Reduce cost for redundant signal checks on Vulkan * Only inherit the range list if there are pending ranges. * Fix OpenGL * Address Feedback * Whoops	2022-12-01 16:30:13 +01:00
gdkchan	4905101df1	Remove shader dependency on SPV_KHR_shader_ballot and SPV_KHR_subgroup_vote extensions (#3943 ) * Remove shader dependency on SPV_KHR_shader_ballot and SPV_KHR_subgroup_vote extensions * Shader cache version bump	2022-11-30 18:24:15 -03:00
gdkchan	8750b90a7f	Ensure that vertex attribute buffer index is valid on GPU (#3942 ) * Ensure that vertex attribute buffer index is valid on GPU * Remove vertex buffer validation code from OpenGL * Remove some fields that are no longer necessary	2022-11-30 18:06:40 -03:00
riperiperi	476b4683cf	Fix CB0 alignment with addresses used for 8/16-bit LDG/STG (#3897 ) This replacement is meant to be done with the original identified byteOffset, not the one assigned later on by the below conditionals (that already has the constant offset added, for instance). This fixes videos being pixelated in Xenoblade 3, and other regressions that might have happened since #3847.	2022-11-25 14:39:03 +00:00
riperiperi	65778a6b78	GPU: Don't trigger uploads for redundant buffer updates (#3828 ) * Initial implementation * Actually do The Thing * Add remark about performance to IVirtualMemoryManager	2022-11-24 15:50:15 +01:00
riperiperi	ece36b274d	GAL: Send all buffer assignments at once rather than individually (#3881 ) * GAL: Send all buffer assignments at once rather than individually The `(int first, BufferRange[] ranges)` method call has very significant performance implications when the bindings are spread out, which they generally always are in Vulkan. This change makes it so that these methods are only called a maximum of one time per draw. Significantly improves GPU thread performance in Pokemon Scarlet/Violet. * Address Feedback Removed SetUniformBuffers(int first, ReadOnlySpan<BufferRange> buffers)	2022-11-24 07:50:59 +00:00
riperiperi	f3cc2e5703	GPU: Access non-prefetch command buffers directly (#3882 ) * GPU: Access non-prefetch command buffers directly Saves allocating new arrays for them constantly - they can be quite small so it can be very wasteful. About 0.4% of GPU thread in SMO, but was a bit higher in S/V when I checked. Assumes that non-prefetch command buffers won't be randomly clobbered before they finish executing, though that's probably a safe bet. * Small change while I'm here * Address feedback	2022-11-24 01:56:55 +00:00
riperiperi	5a39d3c4a1	GPU: Relax locking on Buffer Cache (#3883 ) I did this on ncbuffer2 when we were using it for LDN 3, but I noticed that it can apply to the current buffer manager too, and it's an easy performance win. The only buffer access that can come from another thread is the overlap search for buffers that have been unmapped. Everything else, including modifications, come from the main GPU thread. That means we only need to lock the range list when it's being modified, as that's the only time where we'll cause a race with the unmapped handler. This has a significant performance improvements in situations where FIFO is high, like the other two PRs. Joined together they give a nice boost (73.6 master -> 79 -> 83 fps in SMO).	2022-11-24 01:41:16 +00:00
gdkchan	f088c3d344	Do not update shader state for DrawTextures (#3876 )	2022-11-21 18:16:00 +01:00
gdkchan	5de6ae426e	Unsubscribe MemoryUnmappedHandler even when GPU channel is destroyed (#3872 )	2022-11-19 23:54:33 -03:00
gdkchan	69ced3a6e8	Fix shader cache on Vulkan when geometry shaders are inserted (#3868 )	2022-11-19 10:24:23 +01:00
gdkchan	2e43d01d36	Move gl_Layer from vertex to geometry if GPU does not support it on vertex (#3866 ) * Move gl_Layer from vertex to geometry if GPU does not support it on vertex * Shader cache version bump * PR feedback	2022-11-18 23:27:54 -03:00
riperiperi	de162a648b	Gpu: Fix thread safety of ReregisterRanges (#3865 ) A quick fix to prevent reading the wrong value of Count when reregistering ranges for a new target buffer. Buffer flushes from another thread can modify the range list when the lock isn't active, which can change the count. This prevents some crashes in Pokemon Scarlet/Violet. It's probably likely that buffer migration during flush is causing some other issues in this game, but this at least prevents the crashing.	2022-11-18 21:47:29 +01:00
riperiperi	187372cbde	Prune ForceDirty and CheckModified caches on unmap (#3862 ) * Prune ForceDirty and CheckModified caches on unmap Since we're now using this for modified checks on the HLE indirect draw method, I'm worried that leaving these to forever gather cache entries isn't the best idea for performance in the long term, and it could keep old buffer objects alive for longer than they should be. This PR adds the ability to prune invalid entries before checking these caches, and queues it whenever gpu memory is unmapped. It also aligns modified checks to the page size, as I figured it would be possible for a huge number of overlapping over a game's runtime. This prevents Super Mario Odyssey from having 10s of thousands of entries in the modified cache in Metro Kingdom, and them duplicating when entering and leaving a building (should be cleared, as they were unmapped). * Address Feedback	2022-11-18 14:58:24 +00:00
riperiperi	7c53b69c30	SPIR-V: Fix unscaling helper not being able to find Array textures (#3863 ) The type in the `texOp` in the textureSize instruction doesn't have the exact type on SPIR-V (for example, it is missing the Array flag). This PR gives it the proper type before giving it to the unscaling helper. This fixes the ground textures being broken on Pokemon Scarlet/Violet when scaling. It wasn't finding the texture, so the descriptor index it provided was -1...	2022-11-18 02:37:37 +00:00
riperiperi	33a4d7d1ba	GPU: Eliminate CB0 accesses when storage buffer accesses are resolved (#3847 ) * Eliminate CB0 accesses Still some work to do, decouple from hle? * Forgot the important part somehow * Fix and improve alignment test * Address Feedback * Remove some complexity when checking storage buffer alignment * Update Ryujinx.Graphics.Shader/Translation/Optimizations/GlobalToStorage.cs Co-authored-by: gdkchan <gab.dark.100@gmail.com> Co-authored-by: gdkchan <gab.dark.100@gmail.com>	2022-11-17 18:47:41 +01:00
gdkchan	f1d1670b0b	Implement HLE macro for DrawElementsIndirect (#3748 ) * Implement HLE macro for DrawElementsIndirect * Shader cache version bump * Use GL_ARB_shader_draw_parameters extension on OpenGL * Fix DrawIndexedIndirectCount on Vulkan when extension is not supported * Implement DrawIndex * Alignment * Fix some validation errors * Rename BaseIds to DrawParameters * Fix incorrect index buffer and vertex buffer size in some cases * Add HLE macros for DrawArraysInstanced and DrawElementsInstanced * Perform a regular draw when indirect data is not modified * Use non-indirect draw methods if indirect buffer was not GPU modified * Only check if draw parameters match if the shader actually uses them * Expose Macro HLE setting on GUI * Reset FirstVertex and FirstInstance after draw * Update shader cache version again since some people already tested this * PR feedback Co-authored-by: riperiperi <rhy3756547@hotmail.com>	2022-11-16 14:53:04 -03:00
gdkchan	9daf029f35	Use vector transform feedback outputs if possible (#3832 )	2022-11-12 20:20:40 -03:00
gdkchan	51a27032f0	Fix VertexId and InstanceId on Vulkan (#3833 ) * Fix VertexId and InstanceId on Vulkan * Shader cache version bump	2022-11-11 13:22:49 -03:00
gdkchan	a6a67a2b7a	Minor improvement to Vulkan pipeline state and bindings management (#3829 ) * Minor improvement to Vulkan pipeline state and bindings management * Clean up buffer textures too * Use glBindTextureUnit	2022-11-10 13:38:38 -03:00
Mary-nyan	c6d05301aa	infra: Migrate to .NET 7 (#3795 ) * Update readme to mention .NET 7 * infra: Migrate to .NET 7 .NET 7 is still in preview but this prepare for the release coming up next month. * Use Random.Shared in CreateRandom * Move UInt128Utils.cs to Ryujinx.Common project * Fix inverted parameters in System.UInt128 constructor * Fix Visual Studio complains on Ryujinx.Graphics.Vic * time: Fix missing alignment enforcement in SystemClockContext Fixes at least Smash * time: Fix missing alignment enforcement in SteadyClockContext Fix games (like recent version of Smash) using time shared memory * Switch to .NET 7.0.100 release * Enable Tiered PGO * Ensure CreateId validity requirements are meet when doing random generation Also enforce correct packing layout for other Mii structures. This fix a Mario Kart 8 crashes related to the default Miis.	2022-11-09 20:22:43 +01:00
gdkchan	647de4cd31	Ensure all pending draws are done before compute dispatch (#3822 )	2022-11-03 19:54:30 -03:00
gdkchan	f82309fa2d	Vulkan: Implement multisample <-> non-multisample copies and depth-stencil resolve (#3723 ) * Vulkan: Implement multisample <-> non-multisample copies and depth-stencil resolve * FramebufferParams is no longer required there * Implement Specialization Constants and merge CopyMS Shaders (#15) * Vulkan: Initial Specialization Constants * Replace with specialized helper shader * Reimplement everything Fix nonexistant interaction with Ryu pipeline caching Decouple specialization info from data and relocate them Generalize mapping and add type enum to better match spv types Use local fixed scopes instead of global unmanaged allocs * Fix misses in initial implementation Use correct info variable in Create2DLayerView Add ShaderStorageImageMultisample to required feature set * Use texture for source image * No point in using ReadOnlyMemory * Apply formatting feedback Co-authored-by: gdkchan <gab.dark.100@gmail.com> * Apply formatting suggestions on shader source Co-authored-by: gdkchan <gab.dark.100@gmail.com> Co-authored-by: gdkchan <gab.dark.100@gmail.com> * Support conversion with samples count that does not match the requested count, other minor changes Co-authored-by: mageven <62494521+mageven@users.noreply.github.com>	2022-11-02 18:17:19 -03:00
gdkchan	59cdf310bd	SPIR-V: Fix tessellation control shader output types (#3807 ) * SPIR-V: Fix tessellation control shader output types * Shader cache version bump	2022-10-29 13:45:30 -03:00
gdkchan	5fdc46ac7f	Vulkan: Fix vertex position Z conversion with geometry shader passthrough (#3781 ) * Vulkan: Fix vertex position Z conversion with geometry shader passthrough * Shader cache version bump	2022-10-21 04:48:21 +00:00
gdkchan	2df16ded9b	Improve shader BRX instruction code generation (#3759 ) * Improve shader BRX instruction code generation * Shader cache version bump, add some comments and asserts	2022-10-15 23:20:16 +00:00
gdkchan	88a8d1e567	Fix disposed textures being updated on TextureBindingsManager (#3750 ) * Fix disposed textures being updated on TextureBindingsManager * PR feedback	2022-10-09 15:23:52 -03:00
riperiperi	bf77d1cab9	GPU: Pass SpanOrArray for Texture SetData to avoid copy (#3745 ) * GPU: Pass SpanOrArray for Texture SetData to avoid copy Texture data is often converted before upload, meaning that an array was allocated to perform the conversion into. However, the backend SetData methods were being passed a Span of that data, and the Multithreaded layer does `ToArray()` on it so that it can be stored for later! This method can't extract the original array, so it creates a copy. This PR changes the type passed for textures to a new ref struct called SpanOrArray, which is backed by either a ReadOnlySpan or an array. The benefit here is that we can have a ToArray method that doesn't copy if it is originally backed by an array. This will also avoid a copy when running the ASTC decoder. On NieR this was taking 38% of texture upload time, which it does a _lot_ of when you move between areas, so there should be a 1.6x performance boost when strictly uploading textures. No doubt this will also improve texture streaming performance in UE4 games, and maybe a small reduction with video playback. From the numbers, it's probably possible to improve the upload rate by a further 1.6x by performing layout conversion on GPU. I'm not sure if we could improve it further than that - multithreading conversion on CPU would probably result in memory bottleneck. This doesn't extend to buffers, since we don't convert their data on the GPU emulator side. * Remove implicit cast to array.	2022-10-08 12:04:47 -03:00
gdkchan	2068445939	Fix shader SULD (bindless) instruction using wrong register as handle (#3732 ) * GLSL: Do not generate scale helpers if we have no textures * Fix shader SULD (bindless) instruction using wrong register as handle	2022-10-03 20:40:22 -03:00
gdkchan	81f848e54f	Allow Surface Flinger frame enqueue after process has exited (#3733 )	2022-10-02 21:50:03 +00:00
gdkchan	9c2500de5f	Fix incorrect tessellation inputs/outputs (#3728 ) * Fix incorrect tessellation inputs/outputs * Shader cache version bump	2022-10-01 02:35:52 -03:00
gdkchan	0cb1e926b5	Allow bindless textures with handles from unbound constant buffer (#3706 )	2022-09-19 15:35:47 -03:00
Emmanuel Hansen	6f0395538b	Avalonia - Use embedded window for avalonia (#3674 ) * wip * use embedded window * fix race condition on opengl Windows * fix glx issues on prime nvidia * fix mouse support win32 * clean up * addressed review * addressed review * fix warnings * fix sotware keyboard dialog * Update Ryujinx.Ava/Ui/Applet/SwkbdAppletDialog.axaml.cs Co-authored-by: gdkchan <gab.dark.100@gmail.com> * remove double semi Co-authored-by: gdkchan <gab.dark.100@gmail.com>	2022-09-19 15:05:26 -03:00
gdkchan	66f16f4392	Fix bindless 1D textures having a buffer type on the shader (#3697 ) * Fix bindless 1D textures having a buffer type on the shader * Shader cache version bump	2022-09-13 08:53:55 +02:00
riperiperi	36172ab43b	Scale SamplesPassed counter by RT scale on report (#3680 ) * Scale SamplesPassed counter by RT scale on report Adds a scale factor for samples passed counter report based on the render target scale at the time. This ensures that when a game reads this counter, it appears similar to the result at 1x. This doesn't cover cases where the the render target scale changes during the queried draws, though that might be better to handle along with other scope related issues in a future rework of counters. Games generally don't count for occlusion queries over render target changes anyways. Fixes an issue in the Splatoon games where the special charge would scale too quickly at high res, points at the end of the game would be broken (but still provide a correct winner), and playing at a low res would make it impossible to swim in ink. May also affect LOD scaling in The Witcher 3. * Update Ryujinx.Graphics.Gpu/Engine/Threed/SemaphoreUpdater.cs Co-authored-by: gdkchan <gab.dark.100@gmail.com> Co-authored-by: gdkchan <gab.dark.100@gmail.com>	2022-09-11 15:58:15 +00:00
gdkchan	619ac86bd0	Do not output ViewportIndex on SPIR-V if GPU does not support it (#3644 ) * Do not output ViewportIndex on SPIR-V if GPU does not support it * Bump shader cache version	2022-09-10 13:20:23 +00:00
riperiperi	dc4ba3993b	Rebind textures if format changes or they're buffer textures	2022-09-10 14:12:50 +02:00
gdkchan	408bd63b08	Transform shader LDC into constant buffer access if offset is constant (#3672 ) * Transform shader LDC into constant buffer access if offset is constant * Shader cache version bump	2022-09-07 20:25:22 -03:00
mageven	311c2661b8	Replace image format magic numbers with enums (#3631 ) * Replace magic constants with enums * Extra formatting * Lower case ASTC dimensions * Use uint for VertexAttributeFormat	2022-08-28 01:56:26 +00:00
gdkchan	923089a298	Fast path for Inline-to-Memory texture data transfers (#3610 ) * Fast path for Inline-to-Memory texture data transfers * Only do it for block linear textures to be on the safe side	2022-08-26 02:16:41 +00:00
gdkchan	88a0e720cb	Use RGBA16 vertex format if RGB16 is not supported on Vulkan (#3552 ) * Use RGBA16 vertex format if RGB16 is not supported on Vulkan * Catch all shader compilation exceptions	2022-08-20 16:20:27 -03:00
Nicholas Rodine	7defc59b9d	A few minor documentation fixes. (#3599 ) * A few minor documentation fixes. * Removed more invalid inheritdoc instances.	2022-08-19 18:21:06 -03:00
Nicholas Rodine	951700fdd8	Removed unused usings. (#3593 ) * Removed unused usings. * Added back using, now that it's used. * Removed extra whitespace.	2022-08-18 18:04:54 +02:00
gdkchan	e87e8b012c	Fix texture bindings using wrong sampler pool in some cases (#3583 )	2022-08-14 14:00:30 -03:00
gdkchan	ad47bd2d4e	Fix blend with RGBX color formats (#3553 )	2022-08-11 18:23:25 -03:00
gdkchan	a5ff0024fb	Rename ToSpan to AsSpan (#3556 )	2022-08-11 18:07:37 -03:00
gdkchan	1080f64df9	Implement HLE macros for render target clears (#3528 ) * Implement HLE macros for render target clears * Add constants for the offsets	2022-08-04 21:30:08 +00:00
riperiperi	c48a75979f	Fix Multithreaded Compilation of Shader Cache on OpenGL (#3540 ) This was broken by the Vulkan changes - OpenGL was building host caches at boot on one thread, which is very notably slower than when it is multithreaded. This was caused by trying to get the program binary immediately after compilation started, which blocks. Now it does it after compilation has completed.	2022-08-03 19:37:56 -03:00

1 2 3 4 5 ...

448 commits