Skip to content

Latest commit

 

History

History
90 lines (67 loc) · 3.75 KB

README.md

File metadata and controls

90 lines (67 loc) · 3.75 KB

FlexiEdit:
Frequency-Aware Latent Refinement for
Enhanced Non-Rigid Editing

Gwanhyeong Koo, Sunjae Yoon, Ji Woo Hong, Chang D. Yoo
KAIST

         

FlexiEdit

Release

  • [07/03] Initial Preview Release 🔥 Coming Soon!
  • [09/07] Official Release of Code 🔥 Available Now!

Contents

🐶 Introduction

Current image editing methods using DDIM Inversion face challenges with non-rigid edits due to high-frequency components in DDIM latent, which hinder structural changes. We introduce FlexiEdit, which improves fidelity to input text prompts by refining these components in targeted areas. FlexiEdit features Latent Refinement for layout adjustments and Edit Fidelity Enhancement for accurate text prompt reflection, demonstrating superior performance in complex edits through comparative experiments.

💻 Installation

It is recommended to run our code on a Nvidia GPU with a linux system. We have not yet tested on other configurations. Currently, it requires around 20 GB GPU memory to run our method.

To install the required libraries, simply run the following command:

git clone https://github.com/kookie12/FlexiEdit.git
cd FlexiEdit
conda env create -f environment.yml
conda activate flexiedit

🚀 Usage

To start using FlexiEdit, execute the following commands:

python run_flexiedit.py

Note: Within run_flexiedit.py, you can set parameters such as model_path, out_dir, and source_image_path.

To use a user-defined mask for edits, obtain coordinates using draw_bbox.py and input them into bbx_start_point and bbx_end_point. If you prefer an automatic mask, simply set these values to None.

Compatibility: FlexiEdit has been primarily tested on Stable Diffusion versions v1-4, but is also compatible with newer versions like v1-5 and v2-1. You can download these checkpoints from their official repository.

📆 TODO

The repo is still being under construction, thanks for your patience.

  • Release of code.

🙌🏻 Acknowledgement

Our code is based on these awesome repos:

📖 BibTeX

If you find our repo helpful, please consider leaving a star or cite our paper :)

@article{koo2024flexiedit,
  title={FlexiEdit: Frequency-Aware Latent Refinement for Enhanced Non-Rigid Editing},
  author={Koo, Gwanhyeong and Yoon, Sunjae and Hong, Ji Woo and Yoo, Chang D},
  journal={arXiv preprint arXiv:2407.17850},
  year={2024}
}