Linoy Tsaban commited on
Commit
2412a2b
·
1 Parent(s): 086a6c0

Update index.html

Browse files
Files changed (1) hide show
  1. index.html +42 -6
index.html CHANGED
@@ -208,9 +208,21 @@
208
  <section class="section">
209
  <div class="container is-max-desktop">
210
  <div class="columns is-centered has-text-centered">
211
- <img src="static/images/ledits_teaser.jpg"
212
  class="interpolation-image"
213
- style="max-height:800px; max-width:1200px"
 
 
 
 
 
 
 
 
 
 
 
 
214
  alt="examples"/>
215
  </div>
216
  </div>
@@ -221,13 +233,12 @@
221
  </div>
222
  <p>
223
  The methodology of LEDITS++ can be broken down into three components: (1) efficient image
224
- inversion, (2) versatile textual editing, and (3) semantic grounding of image changes. More in-depth
225
- details and mathematical derivations of each component can be found in App
226
  </p>
227
 
228
  <div class="columns is-centered has-text-centered">
229
- <img src="static/images/diagram.jpg"
230
- style="max-height:620px; max-width:700px"
231
  alt="diagram"/>
232
  </div>
233
 
@@ -250,6 +261,18 @@
250
  <div class="content">
251
  <h2 class="title is-4">Component 1: Perfect Inversion</h2>
252
  <p>
 
 
 
 
 
 
 
 
 
 
 
 
253
 
254
  </p>
255
 
@@ -262,6 +285,13 @@
262
  <div class="columns is-centered">
263
  <div class="column content">
264
  <p>
 
 
 
 
 
 
 
265
 
266
  </p>
267
 
@@ -275,6 +305,12 @@
275
  <div class="columns is-centered">
276
  <div class="column content">
277
  <p>
 
 
 
 
 
 
278
 
279
  </p>
280
 
 
208
  <section class="section">
209
  <div class="container is-max-desktop">
210
  <div class="columns is-centered has-text-centered">
211
+ <img src="static/images/removal.png"
212
  class="interpolation-image"
213
+ style="max-height:800px; max-width:800px"
214
+ alt="examples"/>
215
+ </div>
216
+ <div class="columns is-centered has-text-centered">
217
+ <img src="static/images/replacement.png"
218
+ class="interpolation-image"
219
+ style="max-height:800px; max-width:800px"
220
+ alt="examples"/>
221
+ </div>
222
+ <div class="columns is-centered has-text-centered">
223
+ <img src="static/images/style_transfer.png"
224
+ class="interpolation-image"
225
+ style="max-height:800px; max-width:800px"
226
  alt="examples"/>
227
  </div>
228
  </div>
 
233
  </div>
234
  <p>
235
  The methodology of LEDITS++ can be broken down into three components: (1) efficient image
236
+ inversion, (2) versatile textual editing, and (3) semantic grounding of image changes.
 
237
  </p>
238
 
239
  <div class="columns is-centered has-text-centered">
240
+ <img src="static/images/ledits_teaser.jpg"
241
+ style="max-height:620px; max-width:1000px"
242
  alt="diagram"/>
243
  </div>
244
 
 
261
  <div class="content">
262
  <h2 class="title is-4">Component 1: Perfect Inversion</h2>
263
  <p>
264
+ Utilizing text-to-image models for editing real images is usually done by inverting the sampling process to
265
+ identify a noisy xT that will be denoised to the input image x0.
266
+ We propose an efficient inversion method that greatly reduces the required number
267
+ of steps while maintaining no reconstruction error.
268
+ First, DDPM can be viewed as a first-order
269
+ stochastic differential
270
+ equation
271
+ (SDE) solver when
272
+ formulating the reverse diffusion process as an SDE. This
273
+ SDE can be solved more efficiently—in fewer steps—
274
+ using a higher-order differential equation solver, hence we present here dpm-solver++
275
+ Inversion.
276
 
277
  </p>
278
 
 
285
  <div class="columns is-centered">
286
  <div class="column content">
287
  <p>
288
+ After creating our re-construction sequence, we can edit the image by manipulating
289
+ the noise estimate εθ based on a set of edit instructions. We devise a dedicated
290
+ guidance term for each concept based on conditioned and unconditioned estimate. We
291
+ define LEDITS++ guidance such that it both reflects the direction of the edit (if we
292
+ want
293
+ to push away from/towards the edit concept) and maximizes fine-grained control over
294
+ the effect of the desired edit
295
 
296
  </p>
297
 
 
305
  <div class="columns is-centered">
306
  <div class="column content">
307
  <p>
308
+ With LEDITS++, we empirically demonstrate that these maps can also capture regions 290
309
+ of an image relevant to an editing concept that is not already present.
310
+ Specifically for multiple edits, calculating a
311
+ dedicated mask for each edit prompt ensures that the corresponding
312
+ guidance terms remain largely isolated, limiting
313
+ interference between them.
314
 
315
  </p>
316